Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Make sure that all workers are notified about end of execution loop (#…
…730) Currently we will have a hang at the end of script when using TP>1 and multistep scheduling. This is caused by lack of notification from driver worker about ending the execution loop. This is a workaround for this issue, by making sure that all workers are notified at the end of `llm_engine` loop. Other possible workaround could be modification of this check: https://github.com/HabanaAI/vllm-fork/blob/habana_main/vllm/engine/llm_engine.py#L1379 with `or not self.has_unfinished_requests()`.
- Loading branch information