Fix a segfault with simple.cpp #3803

tterrasson · 2023-10-26T21:46:38Z

Updating simple.cpp to use the new API llama_batch_clear() and llama_batch_add() and fix issue #3753.

KerfuffleV2

Looks good. Tested to fix the issue.

* master: (350 commits) speculative : ensure draft and target model vocab matches (ggerganov#3812) llama : correctly report GGUFv3 format (ggerganov#3818) simple : fix batch handling (ggerganov#3803) cuda : improve text-generation and batched decoding performance (ggerganov#3776) server : do not release slot on image input (ggerganov#3798) batched-bench : print params at start log : disable pid in log filenames server : add parameter -tb N, --threads-batch N (ggerganov#3584) (ggerganov#3768) server : do not block system prompt update (ggerganov#3767) sync : ggml (conv ops + cuda MSVC fixes) (ggerganov#3765) cmake : add missed dependencies (ggerganov#3763) cuda : add batched cuBLAS GEMM for faster attention (ggerganov#3749) Add more tokenizer tests (ggerganov#3742) metal : handle ggml_scale for n%4 != 0 (close ggerganov#3754) Revert "make : add optional CUDA_NATIVE_ARCH (ggerganov#2482)" issues : separate bug and enhancement template + no default title (ggerganov#3748) Update special token handling in conversion scripts for gpt2 derived tokenizers (ggerganov#3746) llama : remove token functions with `context` args in favor of `model` (ggerganov#3720) Fix baichuan convert script not detecing model (ggerganov#3739) make : add optional CUDA_NATIVE_ARCH (ggerganov#2482) ...

KerfuffleV2 · 2023-10-30T00:11:11Z

At least spell "viewed" correctly if you're going to spam us with a million notifications.

simple : fix batch handling

5fc96c3

KerfuffleV2 approved these changes Oct 27, 2023

View reviewed changes

KerfuffleV2 merged commit c8d6a1f into ggerganov:master Oct 27, 2023
31 checks passed

KerfuffleV2 mentioned this pull request Oct 27, 2023

segfault in simple.cpp #3753

Closed

joprice mentioned this pull request Nov 9, 2023

segfault when using latest llama atacama-dev/llama-cpp-ocaml#1

Open

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

simple : fix batch handling (ggerganov#3803)

1e4c357

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a segfault with simple.cpp #3803

Fix a segfault with simple.cpp #3803

tterrasson commented Oct 26, 2023

KerfuffleV2 left a comment

KerfuffleV2 commented Oct 30, 2023

Fix a segfault with simple.cpp #3803

Fix a segfault with simple.cpp #3803

Conversation

tterrasson commented Oct 26, 2023

KerfuffleV2 left a comment

Choose a reason for hiding this comment

KerfuffleV2 commented Oct 30, 2023