-
Notifications
You must be signed in to change notification settings - Fork 10.4k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: initial support for IQ4_XS quantization
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#11501
opened Jan 29, 2025 by
remyoudompheng
Loading…
Correctly identify LF token for GPT-2 style BPE tokenizer
#11496
opened Jan 29, 2025 by
mgroeber9110
Loading…
Start work on wave 64 optimizeation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#11495
opened Jan 29, 2025 by
IMbackK
Loading…
vulkan: Make Vulkan optional at runtime (#11493).
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#11494
opened Jan 29, 2025 by
daym
Loading…
server : update json snippets in README.md [no ci]
examples
server
#11492
opened Jan 29, 2025 by
danbev
Loading…
ggml : x2 speed for WASM by optimizing SIMD
ggml
changes relating to the ggml tensor library for machine learning
#11453
opened Jan 27, 2025 by
ngxson
Loading…
llama: Add support for RWKV v7 architecture
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#11452
opened Jan 27, 2025 by
MollySophia
Loading…
1 task
Optimized DeepSeek V2/V3 implementation (MLA)
python
python script changes
#11446
opened Jan 27, 2025 by
fairydreaming
•
Draft
ci : allow creating artifacts on PRs on demand
artifacts
Creates artifacts for pull requests
devops
improvements to build systems and github actions
llama : add option to override model tensor buffers
demo
Demonstrate some concept or idea, not intended to be merged
need feedback
Testing and feedback with results are needed
docs: update fedora cuda guide for 12.8 release
documentation
Improvements or additions to documentation
#11393
opened Jan 24, 2025 by
teihome
Loading…
ggml-cpu: Add CPU backend support for KleidiAI library
ggml
changes relating to the ggml tensor library for machine learning
#11390
opened Jan 24, 2025 by
chaxu01
Loading…
gguf_convert_endian.py: implement byteswapping for q4_k and q6_k
python
python script changes
#11349
opened Jan 22, 2025 by
AlekseiNikiforovIBM
Loading…
cpu_pnp_strategy changes
ggml
changes relating to the ggml tensor library for machine learning
#11326
opened Jan 21, 2025 by
savesanketsw
•
Draft
cmake: refined conditions for math library linking on windows
ggml
changes relating to the ggml tensor library for machine learning
#11312
opened Jan 20, 2025 by
Xarbirus
Loading…
ggml: reserve in gguf_writer and added const pointers as params
ggml
changes relating to the ggml tensor library for machine learning
#11297
opened Jan 18, 2025 by
GermanAizek
Loading…
Removed const references for simple types and structures less 16 bytes
ggml
changes relating to the ggml tensor library for machine learning
#11294
opened Jan 18, 2025 by
GermanAizek
Loading…
Align structures for 64bit, reorder params and ignore error-warn for Clang 19
ggml
changes relating to the ggml tensor library for machine learning
#11291
opened Jan 18, 2025 by
GermanAizek
Loading…
added rudimentary support for outetts v0.3 500m and 1b models
examples
#11287
opened Jan 18, 2025 by
LostRuins
Loading…
fix makefile and cmake logic for AARCH64
ggml
changes relating to the ggml tensor library for machine learning
#11246
opened Jan 15, 2025 by
Haus1
Loading…
Allow s390x to load little endian models unmodified
examples
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#11234
opened Jan 14, 2025 by
AlekseiNikiforovIBM
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.