Skip to content

CUDA: faster q8_0 -> f16 dequantization#4895

Merged
JohannesGaessler merged 1 commit intoggerganov:masterfrom JohannesGaessler:cuda-faster-q8_0-to-half-2Jan 12, 2024

Commits

Commits on Jan 12, 2024