-
Notifications
You must be signed in to change notification settings - Fork 313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow to benchmark multi-machine clusters #71
Comments
What's the status of this issue? |
It's in the early concept stage. We use Github's milestone feature and it's planned for 0.5.0 (no release date). The current "workaround" is to use the pipeline "benchmark-only" as I've mentioned in the description of this ticket provided you are able to apply enough load with a single Rally instance. I'd expect that you can apply a bit more load when #108 is ready on which I am currently working on. |
Gotcha, thanks! |
With this commit we change the client model of one process per client instead of one thread per client. We also allow to run queries by more than one client. Clients communicate internally via an actor system so we are already preparing Rally for truly distributed benchmarks which will be implemented in #71. Closes #58 Closes #64 Closes #108
We'll implement this gradually and implement support for single-machine clusters in #184 first. Then we can extend Rally to multi-machine clusters. This will reduce the risk of adding too many changes at once (as we then also need an ability to define cluster topology). |
For the first implementation we'll assume that every node has the same configuration (i.e. there is no possibility to define dedicated node roles (master, data, coordinator, ...) and different node configurations). We will allow multiple nodes per host though. |
Having Rally set-up multi-node clusters can be done now by specifying the respective nodes with Examples:
|
Currently, it is only possible to benchmark on a single machine (except for the special case where we run with
--pipeline=benchmark-only
which (a) puts the burden of provisioning on the user and (b) does not gather system metrics (like CPU usage or index size). Rally should be able to run benchmarks also on clusters across multiple machines.In this ticket we want to collect the high-level ideas around making this possible. Work should be done in smaller, more focused tickets. At least these areas need to change:
Note: Distribution of the load generator is handled separately in #257.
The text was updated successfully, but these errors were encountered: