Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[vLLM] Support vLLM CPU backend and provide QBits acceleration #4181

[vLLM] Support vLLM CPU backend and provide QBits acceleration

[vLLM] Support vLLM CPU backend and provide QBits acceleration #4181

Triggered via pull request May 24, 2024 01:35
Status Cancelled
Total duration 52m 37s
Artifacts

unit-test-neuralchat.yml

on: pull_request
Matrix: neuralchat-unit-test
Generate-NeuralChat-Report
0s
Generate-NeuralChat-Report
Fit to window
Zoom out
Zoom in

Annotations

4 errors
neuralchat-unit-test-PR-test
Canceling since a higher priority waiting request for 'NeuralChat Unit Test-1551' exists
neuralchat-unit-test-PR-test
The operation was canceled.
neuralchat-unit-test-baseline
Canceling since a higher priority waiting request for 'NeuralChat Unit Test-1551' exists
neuralchat-unit-test-baseline
The operation was canceled.