"failed to find free space in the KV cache" is displayed and no response is returned. #6603

Taikono-Himazin · 2024-04-11T09:25:34Z

When using llama.cpp via LocalAI, "failed to find free space in the KV cache" will be displayed after using it for a while, and the string that can respond will gradually become shorter, and eventually it will not be possible to respond.

I looked at past issues such as #4185, but I don't really understand how to solve them.

I would like to delete the KV cache or increase the capacity of the KV cache, but how should I do that?

The version of llama.cpp seems to be 8228b66.

The text was updated successfully, but these errors were encountered:

phymbert · 2024-04-11T09:32:24Z

Please upgrade the KV cache size yes using --ctx-size

Taikono-Himazin added the bug-unconfirmed label Apr 11, 2024

phymbert closed this as not planned Won't fix, can't repro, duplicate, stale Apr 11, 2024

enn-nafnlaus mentioned this issue Apr 11, 2024

Please upgrade the KV cache size yes using --ctx-size #6617

Closed

Taikono-Himazin mentioned this issue Apr 11, 2024

I would like to change the settings of llama-cpp in detail, but which file should I change? mudler/LocalAI#2015

Open

phymbert mentioned this issue Apr 29, 2024

Fast request make the server stuck #6979

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"failed to find free space in the KV cache" is displayed and no response is returned. #6603

"failed to find free space in the KV cache" is displayed and no response is returned. #6603

Taikono-Himazin commented Apr 11, 2024 •

edited

Loading

phymbert commented Apr 11, 2024

"failed to find free space in the KV cache" is displayed and no response is returned. #6603

"failed to find free space in the KV cache" is displayed and no response is returned. #6603

Comments

Taikono-Himazin commented Apr 11, 2024 • edited Loading

phymbert commented Apr 11, 2024

Taikono-Himazin commented Apr 11, 2024 •

edited

Loading