Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide a capability to increase the data corpus size for a workload #254

Closed
gkamat opened this issue Apr 4, 2023 · 3 comments
Closed
Assignees
Labels
enhancement New feature or request High Priority

Comments

@gkamat
Copy link
Collaborator

gkamat commented Apr 4, 2023

The data corpora supplied with the included workloads are generally small, under ~75 GB. They do not suffice for performance testing larger clusters, scale testing and longevity testing.

Since acquiring larger data sets is not straightforward, it would be helpful to be able to provide some mechanism to increase the corpus size for a workload. This could be done through duplicating (and appropriately modifying) the existing documents in the corpus or by synthesizing documents.

@dblock
Copy link
Member

dblock commented Apr 25, 2023

Is this a dup/subset of #253?

@gkamat
Copy link
Collaborator Author

gkamat commented May 25, 2023

Is this a dup/subset of #253?

It is a child issue, referenced in that one. Additional issues will be added to that parent issue as work progresses on these items.

@gkamat
Copy link
Collaborator Author

gkamat commented May 25, 2023

This capability is now available for the http_logs workload There are several enhancements possible, including modifying the documents and regenerating queries, adding a similar capability to the other workloads, etc., but those will be addressed in dedicated issues that will be opened separately.

Closing this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request High Priority
Projects
Archived in project
Development

No branches or pull requests

3 participants