Implement ECS task concurrency prevention for registry-sweepers #105

alexdunnjpl · 2024-04-03T16:33:38Z

💡 Description

Currently, if a sweeper executes for longer than its schedule cadence, multiple instances of the sweeper will run concurrently.

This causes additional cost due to both redundant processing and a slowdown of all jobs due to increased database load, and could affect service if the database is loaded heavily enough.

Implement configuration to allow execution of <=1 container instance per task definition (i.e. node) at any point in time.

@jordanpadams this isn't blocking anything, but the sooner it's done, the shorter we can make our sweepers cadence and the performance/cost impact is nontrivial.

jordanpadams · 2024-04-03T17:03:45Z

@alexdunnjpl when you say "implement configuration" is this an event scheduler configuration?

alexdunnjpl · 2024-04-03T17:30:04Z

@jordanpadams I'm fuzzy on the details, but I think it requires defining a cluster for each task definition and setting a container limit on each cluster. Simply, "do some AWS Console stuff"

@sjoshi-jpl will have a better idea of the details I suspect

jordanpadams · 2024-04-03T21:01:48Z

Thanks @alexdunnjpl. As a task, this is 100% going to get lost in the 100s of tickets we have open right now. I will try to keep track of this and add to our overall release plan.

alexdunnjpl · 2024-04-12T06:14:10Z

The need for this should be somewhat mitigated (though not completely avoided) by NASA-PDS/registry-sweepers#115 as now, only provenance should result in any redundant work being done.

EDIT Actually this is incorrect - there's still a concern of multiple instances tripping over each other in the event of an influx of data which causes >cadencePeriod container runtime

alexdunnjpl · 2024-04-12T06:15:58Z

Possibly-related:

NASA-PDS/registry-sweepers#31
NASA-PDS/registry-sweepers#60

alexdunnjpl added B14.1 task i&t.skip labels Apr 3, 2024

alexdunnjpl assigned jordanpadams and sjoshi-jpl Apr 3, 2024

jordanpadams transferred this issue from NASA-PDS/operations Apr 3, 2024

This was referenced Apr 23, 2024

Registry-Sweeper ECS Enchancements (Post Multi-Tenancy) NASA-PDS/registry-sweepers#60

Closed

Implement Registry Multi-tenancy with Cognito in the loop NASA-PDS/registry#185

Closed

jordanpadams mentioned this issue Sep 25, 2024

Wrap up multi-tenancy Registry Migration NASA-PDS/registry#326

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ECS task concurrency prevention for registry-sweepers #105

Implement ECS task concurrency prevention for registry-sweepers #105

alexdunnjpl commented Apr 3, 2024

jordanpadams commented Apr 3, 2024

alexdunnjpl commented Apr 3, 2024

jordanpadams commented Apr 3, 2024

alexdunnjpl commented Apr 12, 2024 •

edited

Loading

alexdunnjpl commented Apr 12, 2024

Implement ECS task concurrency prevention for registry-sweepers #105

Implement ECS task concurrency prevention for registry-sweepers #105

Comments

alexdunnjpl commented Apr 3, 2024

💡 Description

jordanpadams commented Apr 3, 2024

alexdunnjpl commented Apr 3, 2024

jordanpadams commented Apr 3, 2024

alexdunnjpl commented Apr 12, 2024 • edited Loading

alexdunnjpl commented Apr 12, 2024

alexdunnjpl commented Apr 12, 2024 •

edited

Loading