-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test impact of using refresh: false
for task manager internals
#99444
Comments
Pinging @elastic/apm-ui (Team:apm) |
Pinging @elastic/kibana-alerting-services (Team:Alerting Services) |
refresh: false
for task manager internals
Thank Dario, added to out project for triage 👍 I would separate the additional instrumentation from this change - these are two distinct deliverables, and tying them together will make it harder to tell what the source of an issue is if we deliver them together. |
This issue seems to be a part of the current one or should be implemented as the next step. |
Based on the discussion with Dario, here is some details about setup a local environment:
|
This issue didn't make it part of the 7.15 planning. @YulNaumenko, how much effort is left before resolving this? We are planning to move this to the backlog for now. |
The most effort currently is on the functional and performance testing. It looks like loe:week |
Closing as done |
@ymao1 Did this get implemented or what was the outcome of the perf test? |
@mikecote just to check: the default from the SO client is 'wait_for', no? Do you mean that the Alerting/Task Manager sets it to |
That's correct, so we had to change a bunch of places to
I think it was bad enough when using refresh: wait_for that we couldn't run many tasks per minute nor GA anything 🙈 |
The linked PR in this ticket is tagged as 7.14. How do you reconcile that with:
Do you define "beginning of alerting" as GA? Because I cannot see how it is true in any other case. When did alerting go GA? |
There might be a few refresh missing or that have been added over time but there isn't a specific release where we would see significant changes to alerting/task manager performance. Alerting went GA in 7.11. Could be cool to compare 7.11 to now though if someone had spare cycles. |
@mikecote have you looked at APM traces? Or more generally, how have you verified your assumptions are correct? E.g., the PR I put up as a result of me looking into this has 4 changes - out of those 4, only one place actually seems to have added a |
In #99160, we are considering setting
refresh
tofalse
for Task Manager internal operations, like creating/updating/deleting tasks. By default, the Saved Objects client will usewait_for
, which means that it will keep a connection open to Elasticsearch until a shard gets refreshed. By default, this is 1 second. This means that workers are not freed up as quickly as they could be, and can have a negative impact on the rate of tasks that can be executed.We should test not just the functionality, but also the performance impact, by running a small (local) load test.
We should also do a larger-scale load test, but given the complexity, we will address that separately.
cc @pmuellr @gmmorris
The text was updated successfully, but these errors were encountered: