Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark batched memcpy #136

Merged

Conversation

gevtushenko
Copy link
Collaborator

@gevtushenko gevtushenko commented Jun 28, 2023

Description

Apart from fixing #135, this issue addresses NVIDIA/cub#719 related issue by making nvbench helper object library instead of shared one. Also adds a pair plot to help see correlation in tuning parameters.

pairplot

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@gevtushenko gevtushenko requested a review from elstehle June 28, 2023 15:30
@jrhemstad jrhemstad linked an issue Jun 28, 2023 that may be closed by this pull request
1 task
Copy link
Collaborator

@elstehle elstehle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Curious about the tuning policies we'll arrive at. Thanks for moving some hard-coded parameters into the tuning policies.

@gevtushenko gevtushenko merged commit eb78562 into NVIDIA:monorepo Jun 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

[FEA]: Implement batched memcpy benchmark / tuning
3 participants