Load Balancing #34

wzh1994 · 2024-06-24T06:26:06Z

During inference, when the available number of GPUs exceeds the demand, the tasks are distributed across multiple computing nodes. This is achieved through a load balancing mechanism to manage the task execution.

wzh1994 added the V0.2 label Jun 24, 2024

wzh1994 added this to the LazyLLM v0.2 milestone Jun 24, 2024

wzh1994 modified the milestones: LazyLLM v0.2, LazyLLM v0.3 Aug 14, 2024

wzh1994 added v0.3 and removed V0.2 labels Aug 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load Balancing #34

Load Balancing #34

wzh1994 commented Jun 24, 2024

Load Balancing #34

Load Balancing #34

Comments

wzh1994 commented Jun 24, 2024