-
Notifications
You must be signed in to change notification settings - Fork 9.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llama : add pipeline parallelism support #6017
Merged
Merged
Commits on Mar 12, 2024
-
llama : add pipeline parallelism support for batch processing with mu…
…ltiple CUDA GPUs ggml-ci
Configuration menu - View commit details
-
Copy full SHA for 822121f - Browse repository at this point
Copy the full SHA 822121fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ac668e - Browse repository at this point
Copy the full SHA 1ac668eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ddccc2 - Browse repository at this point
Copy the full SHA 4ddccc2View commit details -
llama : fix Mamba inference for pipeline parallelism
Tested to work correctly with both `main` and `parallel` examples.
Configuration menu - View commit details
-
Copy full SHA for 937966d - Browse repository at this point
Copy the full SHA 937966dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 00a415d - Browse repository at this point
Copy the full SHA 00a415dView commit details -
add LLAMA_SCHED_MAX_COPIES to configure the number of input copies fo…
…r pipeline parallelism default increase to 4 (from 2) changing this value may improve performance for some systems, but increases memory usage
Configuration menu - View commit details
-
Copy full SHA for 89bfa1f - Browse repository at this point
Copy the full SHA 89bfa1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for aa1e2f8 - Browse repository at this point
Copy the full SHA aa1e2f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for deb3e24 - Browse repository at this point
Copy the full SHA deb3e24View commit details -
Configuration menu - View commit details
-
Copy full SHA for ead5c8b - Browse repository at this point
Copy the full SHA ead5c8bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 255c1ec - Browse repository at this point
Copy the full SHA 255c1ecView commit details
Commits on Mar 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4400153 - Browse repository at this point
Copy the full SHA 4400153View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e7cecc - Browse repository at this point
Copy the full SHA 9e7ceccView commit details -
Configuration menu - View commit details
-
Copy full SHA for b25a0f1 - Browse repository at this point
Copy the full SHA b25a0f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 529e749 - Browse repository at this point
Copy the full SHA 529e749View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54cdd47 - Browse repository at this point
Copy the full SHA 54cdd47View commit details -
Configuration menu - View commit details
-
Copy full SHA for cda49d3 - Browse repository at this point
Copy the full SHA cda49d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 015e1bf - Browse repository at this point
Copy the full SHA 015e1bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d934ee - Browse repository at this point
Copy the full SHA 0d934eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c38789 - Browse repository at this point
Copy the full SHA 3c38789View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9092883 - Browse repository at this point
Copy the full SHA 9092883View commit details -
Configuration menu - View commit details
-
Copy full SHA for cb580a6 - Browse repository at this point
Copy the full SHA cb580a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f56481 - Browse repository at this point
Copy the full SHA 1f56481View commit details -
Configuration menu - View commit details
-
Copy full SHA for 976176d - Browse repository at this point
Copy the full SHA 976176dView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.