[Scaling Investigation] Stress Load Generation Host & Determine Max Clients Per Worker Actor #558

IanHoang · 2024-06-20T20:00:03Z

Experiment 2:

This experiment is related to the scale testing RFC. For more details, see this RFC here.

To see other experiments in this analysis, see the META issue.

How many clients can OSB simulate on a single load generation host?
What is the max number of clients that a worker actor can have?

To answer the second and third questions, we will run rounds with load generation hosts with various physical CPU cores and GB of RAM to determine how many search clients a single load generation host can successfully simulate. Contrary to the advice of running OSB with an over-provisioned load generation host to avoid bottlenecks, we will focus on discovering at what point is the number of clients causing a bottleneck in the load generation host. It’s believed that a single physical CPU core can handle up to 2500 simulated clients. Knowing that the number of worker actors provisioned by OSB is constrained by the number vCPUs, it might be worth finding out at what point is there too many clients per workers.

It’s worth mentioning that this experiment has limited scope as it focuses only on a few instance types from a single instance family and specific cloud provider (AWS). We also suspect that finding a definitive answer to these questions will be difficult since the number of clients on a load generation host depends on other factors, such as workload type, SUT characteristics, network, and disk storage. However, there is still value in investigating this to get more clarity. Users will be able to use these results to make educated decisions when deciding how many physical cores of CPU and GB of RAM their load generation hosts should have and can avoid overpaying for over-provisioned load generation hosts.

The following rounds will be run. Any bottlenecks encountered will help us get a better idea of how many physical CPU cores and GB of RAM are needed to simulate N number of clients.

LG Hosts with OpenSearch Benchmark	Simulated Clients (search_clients:N)	Instance Type	Instance Count	vCPUs	Memory (GB)	Clients Per Worker Actor
Round 1	2500	c5.large	1	2	4	1250
Round 2	5000	c5.xlarge	1	4	8	2500
Round 3	10000	c5.2xlarge	1	8	16	5000
Round 4	20000	c5.4xlarge	1	16	32	10000

IanHoang added enhancement New feature or request untriaged labels Jun 20, 2024

IanHoang mentioned this issue Jun 20, 2024

[META] Scale-Up Improvements on Single Load Generation Host #505

Closed

4 tasks

IanHoang changed the title ~~[Scale Testing] Experiment 2~~ [Scale Testing] Experiment 2: Max Clients Per Worker Jun 20, 2024

IanHoang changed the title ~~[Scale Testing] Experiment 2: Max Clients Per Worker~~ [Scale Testing] Experiment 2: Max Clients Per Worker Actor Jun 20, 2024

IanHoang added Child Issue and removed untriaged labels Jun 20, 2024

IanHoang self-assigned this Jun 20, 2024

IanHoang added this to Search Project Board and OpenSearch Engineering Effectiveness Jun 20, 2024

github-project-automation bot moved this to 🆕 New in Search Project Board Jun 20, 2024

github-project-automation bot moved this to Backlog in OpenSearch Engineering Effectiveness Jun 20, 2024

IanHoang changed the title ~~[Scale Testing] Experiment 2: Max Clients Per Worker Actor~~ [Scaling Investigation] Experiment 2: Determine Max Clients Per Worker Actor Jul 24, 2024

IanHoang changed the title ~~[Scaling Investigation] Experiment 2: Determine Max Clients Per Worker Actor~~ [Scaling Investigation] Determine Max Clients Per Worker Actor Jul 24, 2024

IanHoang mentioned this issue Jul 24, 2024

[Scaling Investigation] Validate Client Simulation Accuracy #557

Open

IanHoang changed the title ~~[Scaling Investigation] Determine Max Clients Per Worker Actor~~ [Scaling Investigation] Stress Load Generation Host & Determine Max Clients Per Worker Actor Jul 24, 2024

getsaurabh02 moved this from 🆕 New to Later (6 months plus) in Search Project Board Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Scaling Investigation] Stress Load Generation Host & Determine Max Clients Per Worker Actor #558

[Scaling Investigation] Stress Load Generation Host & Determine Max Clients Per Worker Actor #558

IanHoang commented Jun 20, 2024 •

edited

Loading

[Scaling Investigation] Stress Load Generation Host & Determine Max Clients Per Worker Actor #558

[Scaling Investigation] Stress Load Generation Host & Determine Max Clients Per Worker Actor #558

Comments

IanHoang commented Jun 20, 2024 • edited Loading

Experiment 2:

IanHoang commented Jun 20, 2024 •

edited

Loading