bug: Concurrent chat doesnt work on Mac Silicon #1569

gabrielle-ong · 2024-10-29T08:27:43Z

Cortex version

1.0.1-203

Describe the Bug

Mac: Concurrent chats for the same model are queued up rather than parallel

Models tested: tinyllama, llama3.2
I expect to open 2 CLI windows / Postman window and have concurrent chats
Works well if separate models (eg tinyllama chat & llama3.2 chat)

May be related to n_parallel parameter in model.yaml

Windows, Ubuntu: Working as expected

Steps to Reproduce

No response

Screenshots / Logs

No response

What is your OS?

MacOS
Windows
Linux

What engine are you running?

cortex.llamacpp (default)
cortex.tensorrt-llm (Nvidia GPUs)
cortex.onnx (NPUs, DirectML)

The text was updated successfully, but these errors were encountered:

gabrielle-ong · 2024-11-01T07:37:37Z

@vansangpfiev do I need to change anything for this to work?
I redownloaded the models, but still is non concurrent on my local com and the VM test-macos-13-1
ie right chat finishes, only then left chat begins

gabrielle-ong · 2024-11-05T05:05:15Z

works with n_parallel = 2, marking as complete

gabrielle-ong added the type: bug Something isn't working label Oct 29, 2024

gabrielle-ong assigned vansangpfiev Oct 29, 2024

github-project-automation bot added this to Menlo Oct 29, 2024

github-project-automation bot moved this to Investigating in Menlo Oct 29, 2024

gabrielle-ong added the category: model running Inference ux, handling context/parameters, runtime label Oct 29, 2024

vansangpfiev mentioned this issue Oct 29, 2024

fix: add n_parallel to model yaml config #1571

Merged

3 tasks

vansangpfiev moved this from Investigating to Review + QA in Menlo Oct 30, 2024

gabrielle-ong added this to the v1.0.2 milestone Nov 5, 2024

gabrielle-ong moved this from Review + QA to Completed in Menlo Nov 5, 2024

gabrielle-ong closed this as completed Nov 5, 2024

github-project-automation bot moved this from Completed to Review + QA in Menlo Nov 5, 2024

gabrielle-ong moved this from Review + QA to Completed in Menlo Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Concurrent chat doesnt work on Mac Silicon #1569

bug: Concurrent chat doesnt work on Mac Silicon #1569

gabrielle-ong commented Oct 29, 2024

gabrielle-ong commented Nov 1, 2024

gabrielle-ong commented Nov 5, 2024

bug: Concurrent chat doesnt work on Mac Silicon #1569

bug: Concurrent chat doesnt work on Mac Silicon #1569

Comments

gabrielle-ong commented Oct 29, 2024

Cortex version

Describe the Bug

Steps to Reproduce

Screenshots / Logs

What is your OS?

What engine are you running?

gabrielle-ong commented Nov 1, 2024

gabrielle-ong commented Nov 5, 2024