Does mistral support multiple gpus? #451

NiuBlibing · 2024-06-20T13:10:37Z

Describe the bug
I build mistral with cargo build --release --features "cuda flash-attn" and run model with ./target/release/mistralrs-server --port 1234 -n 8 plain -m ./Qwen/Qwen2-72B-Instruct/ -a qwen2 on 8*a100 device, the nvitop shown only one gpu' memory is growing and then oom.

Latest commit
3a79137

The text was updated successfully, but these errors were encountered:

b0xtch · 2024-06-21T05:15:29Z

#375

I don't think so

NiuBlibing · 2024-06-21T06:28:30Z

#395

EricLBuehler · 2024-06-21T16:36:53Z

Hi @NiuBlibing! Multiple GPUs is not supported yet, but I will add cross-GPU mapping support first before NCCL support.

NiuBlibing added the bug Something isn't working label Jun 20, 2024

NiuBlibing closed this as completed Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does mistral support multiple gpus? #451

Does mistral support multiple gpus? #451

NiuBlibing commented Jun 20, 2024

b0xtch commented Jun 21, 2024

NiuBlibing commented Jun 21, 2024

EricLBuehler commented Jun 21, 2024

Does mistral support multiple gpus? #451

Does mistral support multiple gpus? #451

Comments

NiuBlibing commented Jun 20, 2024

b0xtch commented Jun 21, 2024

NiuBlibing commented Jun 21, 2024

EricLBuehler commented Jun 21, 2024