Create Workload Improvement: Multi-Process Index Extraction #375

IanHoang · 2023-09-25T22:57:19Z

Is your feature request related to a problem? Please describe.

When using OpenSearch Benchmark's create-workload feature to extract data from clusters, users often extract more than one index. For example, a user might have a cluster comprised of 5 indices and OSB will extract each corpora in order. This is a time consuming process and can be more efficient. For larger sets of data, this can also lead to read timeouts like below:

[ERROR] Cannot create-workload. ConnectionTimeout caused by - ReadTimeoutError(HTTPSConnectionPool(host='<example host endpoint>', port=443): Read timed out. (read timeout=10)).

Describe the solution you'd like

To expedite the process and make it more efficient, OSB can use N number of processes to extract N number of indices.

Describe alternatives you've considered

A clear and concise description of any alternative solutions or features you've considered.

Additional context

Add any other context or screenshots about the feature request here.

The text was updated successfully, but these errors were encountered:

AkshathRaghav · 2023-11-04T00:57:54Z

@IanHoang 👋🏽, I've opened a PR for this -> #403

IanHoang added enhancement New feature or request good first issue Good for newcomers labels Sep 25, 2023

github-actions bot added the untriaged label Sep 25, 2023

IanHoang removed untriaged good first issue Good for newcomers labels Sep 25, 2023

This was referenced Nov 1, 2023

Adding multi-process extraction to create_workloads #402

Closed

Multi-Processing Functionality for create-workload #403

Closed

IanHoang self-assigned this Jan 29, 2024

IanHoang added this to OpenSearch Engineering Effectiveness Jan 29, 2024

github-project-automation bot moved this to Backlog in OpenSearch Engineering Effectiveness Jan 29, 2024

IanHoang moved this from Backlog to In Progress in OpenSearch Engineering Effectiveness Feb 14, 2024

IanHoang mentioned this issue Feb 15, 2024

Create-Workload Improvements #463

Open

2 tasks

bbarani moved this from In Progress to on hold in OpenSearch Engineering Effectiveness Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Workload Improvement: Multi-Process Index Extraction #375

Create Workload Improvement: Multi-Process Index Extraction #375

IanHoang commented Sep 25, 2023

AkshathRaghav commented Nov 4, 2023

Create Workload Improvement: Multi-Process Index Extraction #375

Create Workload Improvement: Multi-Process Index Extraction #375

Comments

IanHoang commented Sep 25, 2023

AkshathRaghav commented Nov 4, 2023