-
Notifications
You must be signed in to change notification settings - Fork 195
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experimental Python cooperative algorithms #1973
Merged
Merged
+3,156
−27
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This is a bit more complex since this is the first subproject that doesn't live in a directory with the same name as the project.
This reverts commit 058bb5f2725a70fa8b4cd8ff39b84e0c4c53c2b0.
pre-commit.ci autofix |
/ok to test |
/ok to test |
/ok to test |
🟩 CI finished in 2h 36m: Pass: 100%/421 | Total: 2d 04h | Avg: 7m 31s | Max: 57m 47s | Hits: 95%/523017
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
+/- | pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
/ok to test |
jrhemstad
approved these changes
Jul 11, 2024
/ok to test |
🟨 CI finished in 2h 08m: Pass: 99%/421 | Total: 2d 05h | Avg: 7m 40s | Max: 50m 24s | Hits: 96%/523017
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
+/- | pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
/ok to test |
/ok to test |
CI failures are unrelated network issues and so I'm admin merging this. |
🟨 CI finished in 4h 30m: Pass: 99%/421 | Total: 2d 01h | Avg: 7m 06s | Max: 59m 34s | Hits: 94%/521318
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
+/- | pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
pciolkosz
pushed a commit
to pciolkosz/cccl
that referenced
this pull request
Jul 17, 2024
* Python exposure of cooperative algorithms * Update inpect changes to be aware of pycudax. This is a bit more complex since this is the first subproject that doesn't live in a directory with the same name as the project. * Print working directory when running CI commands. * Update CI for pycudax. * Update module name in CI This reverts commit 058bb5f2725a70fa8b4cd8ff39b84e0c4c53c2b0. * [pre-commit.ci] auto code formatting * Remove accidental directory * Fix Thrust pair docs * Fix pkg resource usage --------- Co-authored-by: Allison Piper <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
1 task
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR adds Python library with speed-of-light cooperative block and warp level reduction, prefix sum, merge and radix sort.The API is in experimental stage, so there should be no concern with API stability for now. New sphinx project is intentionally omitted in the top-level toctree for now.
Below is the performance comparison of segmented prefix sum implemented with block scan (prefix callback overload) in C++ and Python:
Checklist