You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It can be an interesting feature, the only problem is that at the moment the crawler does not deal with the page content as DOM, it is just a text content. But maybe we can limit the section of the page which should be crawled in some other way. I will investigate this a bit more.
for example:
would only follow the links found inside
.main-content
this way i don't have to keep crawling the header, footer, sidebars, etc on every page
thank you for writing this!
The text was updated successfully, but these errors were encountered: