Run GptSession without openmpi? #1220

haohuanw · 2024-03-03T04:00:10Z

I have some relatively basic use cases (no parallelism) that I am able to run in environments that doesn't have MPI installed. However, i found that since 0130 release #1019 MPI is force initialized in this function call: https://github.com/NVIDIA/TensorRT-LLM/blob/main/cpp/tensorrt_llm/runtime/gptSession.cpp#L191-L192 which prevents me to do a trt-llm upgrade.

since this function is part of closed source, is it possible to update the function to not initialize MPI when it is not necessary?

MartinMarciniszyn · 2024-03-07T12:38:36Z

The code reference seems outdated. @haohuanw , can you please post the snippet that forces the MPI initialization? Generally, the code base is tightly integrated with MPI. If it is an option, the easy way forward would be installing MPI in your environment. Also, note that GptSession will not be actively maintained going forward. The official entry point in the API will be the Executor class.

haohuanw · 2024-03-07T16:05:20Z

basically KVCacheManager forces a MPI.SESSION initialize starting #1019 releases.

Previously, i just need to make sure WorldConfig is not constructed with mpi static function and it actually works fine.

haohuanw · 2024-03-11T17:55:15Z

@MartinMarciniszyn i had a chance to check new executor interface. to me it seems that if i can fake a Communicator class what i am looking for will be fulfilled.

i currently see that there are no method in communicator that i can override, do you know when would that happen?

MartinMarciniszyn · 2024-03-15T13:07:36Z

The code in KV Cache Manager has been modified. It should not require initialization of MPI for single device setups anymore. This will be released next Tuesday.

haohuanw changed the title ~~Run TRT-LLM without openmpi?~~ Run GptSession without openmpi? Mar 3, 2024

byshiue assigned MartinMarciniszyn Mar 5, 2024

MartinMarciniszyn closed this as completed Mar 15, 2024

kaiyux mentioned this issue Mar 19, 2024

Update TensorRT-LLM #1315

Merged

kaiyux mentioned this issue Apr 12, 2024

Update TensorRT-LLM Release branch #1445

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run GptSession without openmpi? #1220

Run GptSession without openmpi? #1220

haohuanw commented Mar 3, 2024

MartinMarciniszyn commented Mar 7, 2024

haohuanw commented Mar 7, 2024

haohuanw commented Mar 11, 2024

MartinMarciniszyn commented Mar 15, 2024

Run GptSession without openmpi? #1220

Run GptSession without openmpi? #1220

Comments

haohuanw commented Mar 3, 2024

MartinMarciniszyn commented Mar 7, 2024

haohuanw commented Mar 7, 2024

haohuanw commented Mar 11, 2024

MartinMarciniszyn commented Mar 15, 2024