This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[vLLM] Support vLLM CPU backend and provide QBits acceleration #4181

Triggered via pull request May 24, 2024 01:35

synchronize #1551

Status Cancelled

Total duration 52m 37s

Artifacts –

unit-test-neuralchat.yml

on: pull_request

Matrix: neuralchat-unit-test

4 errors

Canceling since a higher priority waiting request for 'NeuralChat Unit Test-1551' exists

The operation was canceled.

Canceling since a higher priority waiting request for 'NeuralChat Unit Test-1551' exists

The operation was canceled.