-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run GptSession without openmpi? #1220
Comments
The code reference seems outdated. @haohuanw , can you please post the snippet that forces the MPI initialization? Generally, the code base is tightly integrated with MPI. If it is an option, the easy way forward would be installing MPI in your environment. Also, note that |
basically Previously, i just need to make sure |
@MartinMarciniszyn i had a chance to check new executor interface. to me it seems that if i can fake a i currently see that there are no method in communicator that i can override, do you know when would that happen? |
The code in KV Cache Manager has been modified. It should not require initialization of MPI for single device setups anymore. This will be released next Tuesday. |
I have some relatively basic use cases (no parallelism) that I am able to run in environments that doesn't have MPI installed. However, i found that since 0130 release #1019 MPI is force initialized in this function call: https://github.com/NVIDIA/TensorRT-LLM/blob/main/cpp/tensorrt_llm/runtime/gptSession.cpp#L191-L192 which prevents me to do a trt-llm upgrade.
since this function is part of closed source, is it possible to update the function to not initialize MPI when it is not necessary?
The text was updated successfully, but these errors were encountered: