-
Notifications
You must be signed in to change notification settings - Fork 10.7k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci : fix (again) arm64 build fails
devops
improvements to build systems and github actions
#11895
opened Feb 15, 2025 by
ngxson
Loading…
CUDA: use async data loading for FlashAttention
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#11894
opened Feb 15, 2025 by
JohannesGaessler
Loading…
vulkan: Added GGML_VK_DEVICE{idx}_MEMORY environment variable for setting device memory to manually allocate workload.
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#11878
opened Feb 14, 2025 by
System233
Loading…
Overlap CUDA graph building and processing to minimize GPU idle time and improve tokens per seconds performance.
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#11867
opened Feb 14, 2025 by
aendk
Loading…
Upgrade init_tensor API to return a ggml_status
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#11854
opened Feb 14, 2025 by
WilliamTambellini
Loading…
WebUI - Mcp SSE server support in client
examples
python
python script changes
server
#11853
opened Feb 14, 2025 by
brucepro
Loading…
MUSA: enable dp4a and fix compile errors on ARM64
build
Compilation issues
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#11843
opened Feb 13, 2025 by
BodhiHu
Loading…
vulkan: improve im2col and RDNA1 performance
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#11826
opened Feb 12, 2025 by
daniandtheweb
Loading…
cmake: Fix ggml backend dependencies and installation
ggml
changes relating to the ggml tensor library for machine learning
#11818
opened Feb 12, 2025 by
vvuksanovic
Loading…
Improved KV cache loading performance for Vulkan, resulting in a 20x …
#11815
opened Feb 12, 2025 by
idales
Loading…
Add Granite Vision Support
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#11794
opened Feb 10, 2025 by
alex-jw-brooks
Loading…
ggml: move kvalues_iq4nl definition to ggml-common.h
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11785
opened Feb 10, 2025 by
HungMingWu
Loading…
chore: update ggml-cpu-aarch64.cpp
ggml
changes relating to the ggml tensor library for machine learning
#11782
opened Feb 10, 2025 by
eltociear
Loading…
server (webui): Fix issue with muliple
<think>
tags in response
examples
server
#11779
opened Feb 10, 2025 by
stduhpf
Loading…
ggml : fix more imatrix nan cases
ggml
changes relating to the ggml tensor library for machine learning
#11773
opened Feb 9, 2025 by
slaren
Loading…
vulkan: implement several ops relevant for ggml_opt
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#11769
opened Feb 9, 2025 by
remyoudompheng
Loading…
ci
: fix certificate revocation error failures on windows workers (WIP)
devops
ggml-cpu-aarch64: Fix compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#11745
opened Feb 8, 2025 by
MarsDoge
Loading…
blas build: use configuration from pkg_check_modules(DepBLAS openblas) and alike
ggml
changes relating to the ggml tensor library for machine learning
ggml : replace reallocation to reuse vector
ggml
changes relating to the ggml tensor library for machine learning
Previous Next
ProTip!
Updated in the last three days: updated:>2025-02-12.