Skip to content

Commit

Permalink
merge test (#2)
Browse files Browse the repository at this point in the history
* feat: add support for KV cache quantization options (abetlen#1307)

* add KV cache quantization options

abetlen#1220
abetlen#1305

* Add ggml_type

* Use ggml_type instead of string for quantization

* Add server support

---------

Co-authored-by: Andrei Betlen <[email protected]>

* fix: Changed local API doc references to hosted (abetlen#1317)

* chore: Bump version

* fix: last tokens passing to sample_repetition_penalties function (abetlen#1295)

Co-authored-by: ymikhaylov <[email protected]>
Co-authored-by: Andrei <[email protected]>

* feat: Update llama.cpp

* fix: segfault when logits_all=False. Closes abetlen#1319

* feat: Binary wheels for CPU, CUDA (12.1 - 12.3), Metal (abetlen#1247)

* Generate binary wheel index on release

* Add total release downloads badge

* Update download label

* Use official cibuildwheel action

* Add workflows to build CUDA and Metal wheels

* Update generate index workflow

* Update workflow name

* feat: Update llama.cpp

* chore: Bump version

* fix(ci): use correct script name

* docs: LLAMA_CUBLAS -> LLAMA_CUDA

* docs: Add docs explaining how to install pre-built wheels.

* docs: Rename cuBLAS section to CUDA

* fix(docs): incorrect tool_choice example (abetlen#1330)

* feat: Update llama.cpp

* fix: missing logprobs in response, incorrect response type for functionary, minor type issues. Closes abetlen#1328 abetlen#1314

* fix: missing logprobs in response, incorrect response type for functionary, minor type issues. Closes abetlen#1328 Closes abetlen#1314

* feat: Update llama.cpp

* fix: Always embed metal library. Closes abetlen#1332

* feat: Update llama.cpp

* chore: Bump version

---------

Co-authored-by: Limour <[email protected]>
Co-authored-by: Andrei Betlen <[email protected]>
Co-authored-by: lawfordp2017 <[email protected]>
Co-authored-by: Yuri Mikhailov <[email protected]>
Co-authored-by: ymikhaylov <[email protected]>
Co-authored-by: Sigbjørn Skjæret <[email protected]>
  • Loading branch information
7 people authored Apr 6, 2024
1 parent 76b51c3 commit bf766bd
Showing 0 changed files with 0 additions and 0 deletions.

0 comments on commit bf766bd

Please sign in to comment.