Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add order_groups option to Amazon scraper #32

Merged
merged 1 commit into from
Dec 17, 2020

Conversation

carljm
Copy link
Collaborator

@carljm carljm commented Dec 5, 2020

I have Amazon orders going back years, and scraping all of those order
list pages on every update takes forever and is flaky just due to the likelihood
that something will timeout along the way when making that many requests.

Once I've scraped the old ones, no new invoices are going to be added in
the past, so e.g. assuming I update at least every 3 months, I'm really fine
just scraping the 'past 3 months' order group.

This adds a flexible option to specify exactly which order groups should
be scraped; all others will be skipped.

I have Amazon orders going back years, and scraping all of them on every
update takes forever and is flaky just due to the likelihood that
something will timeout along the way when making that many requests.
Once I've scraped the old ones, no new invoices are going to be added in
a previous year, so assuming I update at least every 3 months, I'm
really fine just scraping the 'past 3 months' order group.

This adds a flexible option to specify exactly which order groups should
be scraped; others will be skipped.
@carljm
Copy link
Collaborator Author

carljm commented Dec 17, 2020

Hi @jbms, any thoughts on whether this addition makes sense? If you have other ideas for how to handle this, I'm open to implementing something different.

@jbms jbms merged commit 67dc2ac into jbms:master Dec 17, 2020
@jbms
Copy link
Owner

jbms commented Dec 17, 2020

Thanks!

This was referenced Jan 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants