Experiment 1 and 2: Scale Testing OSB with Distributed Workload Generation #415

IanHoang · 2023-11-15T20:28:03Z

Background

We're aware that OSB has limitations, but we don't know exactly under what circumstances they occur. For example, it's known that OSB can test single node and small multi node (3,4,5 etc.) OpenSearch clusters. However, when OSB tests a really large cluster (unclear on how large but let's just say 20 nodes), it tends to fail. To combat this, OSB has a feature called Distributed Workload Generation (DWG) which basically allows users to get over that hurdle of testing large 20 node clusters.

However, DWG has only been used to a certain extent and we do not know its limitations. Some questions we want to answer for this feature are:

How many nodes can we use for DWG? Is it only max 3 nodes?
What parameters do we need to ensure that users include when using DWG?
Does running DWG on a single node cluster or small multi node clusters perform better than just using standard OSB (1 node) ?
These aren't the only questions but these have been unanswered for some time.

See RFC #416 for more details.

Strategy

Experiment 1: Comparing OSB Without DWG and OSB With DWG

We should perform a series of tests for standard OSB And DWG of various clusters. When testing OSB without DWG and with DWG, we should run the following tests.

workloads: nyc_taxis
Run against: single node cluster, 3M3D multi-node cluster, 3M15D multi-node cluster
For each cluster, run the workload ~5 times (to collect consistent data)

In other words, we should run 30 tests total, 15 tests for each series.

Experiment 2: How many nodes can OSB use with DWG

workloads: nyc_taxis
Run against: 3M3D multi-node cluster
Run a few rounds, one with 3 LG Hosts, then 5 LG Hosts, and then try 10 LG Hosts

If possible, since we're running this with a search workload, we should also run this with a log analytics workload such as http_logs.

When assessing the results, we should aim to see if there are performance differences and when they occur.

IanHoang · 2024-04-05T17:14:49Z

Newer and improved issue here: #506

IanHoang added the enhancement New feature or request label Nov 15, 2023

github-actions bot added the untriaged label Nov 15, 2023

IanHoang added RFC Request for comment on major changes and removed untriaged RFC Request for comment on major changes labels Nov 15, 2023

IanHoang changed the title ~~Scale Testing and Understanding Distributed Workload Generation~~ Scale Testing OSB with Distributed Workload Generation Nov 15, 2023

IanHoang changed the title ~~Scale Testing OSB with Distributed Workload Generation~~ Experiment 1 and 2: Scale Testing OSB with Distributed Workload Generation Nov 15, 2023

IanHoang added the Medium Priority label Feb 15, 2024

gkamat added this to Performance Roadmap Mar 26, 2024

github-project-automation bot moved this to Todo in Performance Roadmap Mar 26, 2024

IanHoang closed this as completed Apr 5, 2024

github-project-automation bot moved this from Todo to Done in Performance Roadmap Apr 5, 2024

github-project-automation bot added this to OpenSearch Benchmark Roadmap Aug 30, 2024

github-project-automation bot moved this to Completed in OpenSearch Benchmark Roadmap Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment 1 and 2: Scale Testing OSB with Distributed Workload Generation #415

Experiment 1 and 2: Scale Testing OSB with Distributed Workload Generation #415

IanHoang commented Nov 15, 2023 •

edited

Loading

IanHoang commented Apr 5, 2024

Experiment 1 and 2: Scale Testing OSB with Distributed Workload Generation #415

Experiment 1 and 2: Scale Testing OSB with Distributed Workload Generation #415

Comments

IanHoang commented Nov 15, 2023 • edited Loading

Background

Strategy

Experiment 1: Comparing OSB Without DWG and OSB With DWG

Experiment 2: How many nodes can OSB use with DWG

IanHoang commented Apr 5, 2024

IanHoang commented Nov 15, 2023 •

edited

Loading