Would be awesome to apply a selector to limit scope of crawled links #15

duggi · 2015-10-31T17:33:03Z

for example:

crawler.crawl({
  url: "http://localhost:8080/locations/",
  selector: ".main-content"

would only follow the links found inside .main-content

this way i don't have to keep crawling the header, footer, sidebars, etc on every page

thank you for writing this!

The text was updated successfully, but these errors were encountered:

amoilanen · 2015-11-01T20:15:31Z

Hi,

It can be an interesting feature, the only problem is that at the moment the crawler does not deal with the page content as DOM, it is just a text content. But maybe we can limit the section of the page which should be crawled in some other way. I will investigate this a bit more.

amoilanen added the enhancement label Nov 1, 2015

amoilanen self-assigned this Nov 1, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Would be awesome to apply a selector to limit scope of crawled links #15

Would be awesome to apply a selector to limit scope of crawled links #15

duggi commented Oct 31, 2015

amoilanen commented Nov 1, 2015

Would be awesome to apply a selector to limit scope of crawled links #15

Would be awesome to apply a selector to limit scope of crawled links #15

Comments

duggi commented Oct 31, 2015

amoilanen commented Nov 1, 2015