Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handling single dates in pipeline #10

Open
philip-schrodt opened this issue Feb 24, 2014 · 0 comments
Open

Handling single dates in pipeline #10

philip-schrodt opened this issue Feb 24, 2014 · 0 comments

Comments

@philip-schrodt
Copy link

We still need to get some experience in how those feeds come in re: dates. The last time I checked, the URLs in consecutive days were largely (but not completely) redundant. What we may want to do is provide an initial update, then after a couple of days go back, recode all of the available files (eliminating duplicate URLs) and then replace the records for that day in the "final" file. Any given run of oneaday_formatter.py then would process a single day rather than multiple days. What we want to avoid is having the file not be in chronological order: this creates a mess with various routines.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants