Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deduplication of failed jobs #193

Open
jhkennedy opened this issue Nov 21, 2024 · 0 comments
Open

Deduplication of failed jobs #193

jhkennedy opened this issue Nov 21, 2024 · 0 comments
Labels
wontfix This will not be worked on

Comments

@jhkennedy
Copy link
Contributor

When deduplicating, for performance reasons (see #119 ), we are moving away from querying the HyP3 API for all jobs associated with a reference scene and instead:

  • querying the S3 bucket directly for published scenes, which encompasses SUCCEEDED jobs (shipped in Release v0.5.8 #184)
  • querying the hyp3-its-live dynamodb table directly using the status_code index for PENDING and RUNNING jobs (should always be a small number and performant)

With that strategy, we won't account for FAILED jobs, potentially submitting multiple requests for the same job. Generally, we don't see many "unlucky" jobs that experience intermittent failure for all three attempts, so this is likely to submit jobs that will always fail.

There's a significant number of failed jobs, even at a nominal failure rate of 0.5%, in the 544-day search window, so searching for FAILED jobs is probably not a good option from a performance standpoint.

Since we shouldn't receive the same message that many times, I don't see this causing a significant cost increase (though we'll want to check in on this assumption at some point), but feel it's worth documenting.

Going to leave this open and labeled wontfix This will not be worked on as a reference

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

1 participant