Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-benchmark tuning #208

Merged

Conversation

gevtushenko
Copy link
Collaborator

Description

closes #151

This PR provides several CUB tuning infrastructure improvements. It allows multiple benchmarks in a single tuning, provides a new workload to batched memcpy benchmark, and combines all segmented sort benchmarks in a single tuning. Besides that, brute force seeker now randomizes variant space to improve coverage. Benchmark is now killed when the search is terminated. All runtime workloads are now covered in a single benchmark launch, reducing the overhead of context initialization. This optimization provides about 20% speedup of one generation with five workloads per benchmark. This number should be more significant for benchmarks containing many workloads (segmented algorithms).

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@gevtushenko gevtushenko requested review from miscco and elstehle July 12, 2023 09:34
@gevtushenko gevtushenko added the benchmark Feature related to benchmarking our libraries label Jul 12, 2023
Copy link
Collaborator

@elstehle elstehle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving for benchmark changes 👍

@gevtushenko gevtushenko merged commit 84fc205 into NVIDIA:main Jul 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
benchmark Feature related to benchmarking our libraries
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

[FEA]: Allow multiple benchmarks in a single tuning
3 participants