llama: significance of truncating input to `ctxLen - 4` #16

iboB · 2024-07-22T08:43:16Z

Why truncate to ctxLen - 4? Why is that 4 significant.

This is kept for now as per llama.cpp demos, but we should investigate.

The text was updated successfully, but these errors were encountered:

pminev · 2024-08-13T10:48:21Z

I was looking at the commits and PRs:

Longer and infinite output ggerganov/llama.cpp#71
ggerganov/llama.cpp@e2d490d
this was a linked problem: https://github.com/ggerganov/llama.cpp/pull/1789/files,

Finally I talked with G.Gerganov and it was added to secure space (at least 4) in KV cache for the new generated tokens. I didn't ask him further why it's explicitly 4, but it seems like when new tokens are generated the input will be truncated, in order to have enough space again.

iboB · 2024-10-04T11:30:02Z

this can be closed now. There is a link to this issue in the code for reference

iboB added the question Further information is requested label Jul 22, 2024

iboB added this to Local SDK MVP Jul 22, 2024

github-project-automation bot moved this to Todo in Local SDK MVP Jul 22, 2024

iboB added the good first issue Good for newcomers label Jul 29, 2024

iboB assigned pminev Jul 29, 2024

iboB added the i:llama label Oct 2, 2024

iboB closed this as completed Oct 4, 2024

github-project-automation bot moved this from In Progress to Done in Local SDK MVP Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama: significance of truncating input to `ctxLen - 4` #16

llama: significance of truncating input to `ctxLen - 4` #16

iboB commented Jul 22, 2024

pminev commented Aug 13, 2024

iboB commented Oct 4, 2024

llama: significance of truncating input to ctxLen - 4 #16

llama: significance of truncating input to ctxLen - 4 #16

Comments

iboB commented Jul 22, 2024

pminev commented Aug 13, 2024

iboB commented Oct 4, 2024

llama: significance of truncating input to `ctxLen - 4` #16

llama: significance of truncating input to `ctxLen - 4` #16