-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
provide caps on task manager configuration settings #56573
Comments
Pinging @elastic/kibana-alerting-services (Team:Alerting Services) |
Not sure why we'd want to raise the min of poll_interval to 1000. |
Running the polling 10x/second seems like it's going to be overkill, if it's running at that speed all the time. Seems like we're going to eventually want some kind of adaptive story. It would be fine to run 10x/second if there's a ton of short-running task to process, and they need to be processed in a timely fashion, but if you don't have anything scheduled for the next 10 minutes, that's a lot of ES calls all resulting in empty results. I've used polling systems that adapt to "idle" situations by polling differently or less often. We could also optimistically take into account the time the next task actually has to run. If it's an hour from now, we could certainly scale back the amount of polling. I think we'll get a better sense for whether we need something like this, once we instrument TM a bit more (telemetry &| logging). It's probably also true that the built-in "lag" from the polling interval provides a bit of "spreading out" a stampeding herd of new tasks. |
It was discussed to deprecate the values above our limit and then in 8.0 to remove support. Log some sort of warning until 8.0, create follow up issue for cleanup. |
…limit resolves elastic#56573 In this PR we create a new task manager limit on the config property `xpack.task_manager.max_workers` of 100, but only log a deprecation warning if that property exceeds the limit. We'll enforce the limit in 8.0. The rationale is that it's unlikely going to be useful to run with more than some number of workers, due to the amount of simultaneous work that would end up happening. In practice, too many workers can slow things down more than speed them up. We're setting the limit to 100 for now, but may increase / decrease it based on further research.
…limit (#85574) resolves #56573 In this PR we create a new task manager limit on the config property `xpack.task_manager.max_workers` of 100, but only log a deprecation warning if that property exceeds the limit. We'll enforce the limit in 8.0. The rationale is that it's unlikely going to be useful to run with more than some number of workers, due to the amount of simultaneous work that would end up happening. In practice, too many workers can slow things down more than speed them up. We're setting the limit to 100 for now, but may increase / decrease it based on further research.
…limit (elastic#85574) resolves elastic#56573 In this PR we create a new task manager limit on the config property `xpack.task_manager.max_workers` of 100, but only log a deprecation warning if that property exceeds the limit. We'll enforce the limit in 8.0. The rationale is that it's unlikely going to be useful to run with more than some number of workers, due to the amount of simultaneous work that would end up happening. In practice, too many workers can slow things down more than speed them up. We're setting the limit to 100 for now, but may increase / decrease it based on further research.
…limit (#85574) (#85872) resolves #56573 In this PR we create a new task manager limit on the config property `xpack.task_manager.max_workers` of 100, but only log a deprecation warning if that property exceeds the limit. We'll enforce the limit in 8.0. The rationale is that it's unlikely going to be useful to run with more than some number of workers, due to the amount of simultaneous work that would end up happening. In practice, too many workers can slow things down more than speed them up. We're setting the limit to 100 for now, but may increase / decrease it based on further research.
Task manager currently exposes a number of configuration properties that could cause systemic problems if set to extreme values.
Here are the current config properties:
kibana/x-pack/plugins/task_manager/server/config.ts
Lines 12 to 45 in 0a17cde
Issue #56478 specifically calls out
max_workers
andpoll_interval
to whitelist on the cloud, and so these properties are ones we should consider capping first, to keep customers from shooting themselves in the foot.Suggestions:
max_workers
: should havemax
: 100poll_interval
: currentmin
: 100; suggest changing it to 1000 (1 second)The text was updated successfully, but these errors were encountered: