Skip to content

Actions: ggerganov/llama.cpp

Pull Request Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,969 workflow runs
5,969 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

server : (refactoring) do not rely on JSON internally
Pull Request Labeler #5969: Pull request #10643 synchronize by ngxson
December 4, 2024 18:52 20s
December 4, 2024 18:52 20s
server : (refactoring) do not rely on JSON internally
Pull Request Labeler #5968: Pull request #10643 synchronize by ngxson
December 4, 2024 18:36 18s
December 4, 2024 18:36 18s
server : (refactoring) do not rely on JSON internally
Pull Request Labeler #5967: Pull request #10643 synchronize by ngxson
December 4, 2024 18:26 18s
December 4, 2024 18:26 18s
server : (refactoring) do not rely on JSON internally
Pull Request Labeler #5966: Pull request #10643 synchronize by ngxson
December 4, 2024 18:19 14s
December 4, 2024 18:19 14s
server : (refactoring) do not rely on JSON internally
Pull Request Labeler #5965: Pull request #10643 synchronize by ngxson
December 4, 2024 17:58 18s
December 4, 2024 17:58 18s
server: add request aggregation functionallity
Pull Request Labeler #5964: Pull request #10660 opened by kalabYibeltal
December 4, 2024 17:51 18s
December 4, 2024 17:51 18s
Refactor/online repacking
Pull Request Labeler #5963: Pull request #10446 synchronize by Djip007
December 4, 2024 17:18 16s
December 4, 2024 17:18 16s
Extend how Llama.cpp locates metal resources
Pull Request Labeler #5962: Pull request #10657 opened by ormandi
December 4, 2024 16:38 14s
December 4, 2024 16:38 14s
GGUF: backend support, fixed-width I/O, misc fixes
Pull Request Labeler #5961: Pull request #10655 synchronize by JohannesGaessler
December 4, 2024 16:24 1m 34s
December 4, 2024 16:24 1m 34s
Vulkan: VK_KHR_cooperative_matrix support to speed up prompt processing
Pull Request Labeler #5960: Pull request #10597 synchronize by 0cc4m
December 4, 2024 16:04 21s
December 4, 2024 16:04 21s
vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and FlashAttention2
Pull Request Labeler #5959: Pull request #10206 synchronize by jeffbolznv
December 4, 2024 15:49 20s
December 4, 2024 15:49 20s
vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and FlashAttention2
Pull Request Labeler #5958: Pull request #10206 synchronize by jeffbolznv
December 4, 2024 15:47 19s
December 4, 2024 15:47 19s
vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and FlashAttention2
Pull Request Labeler #5957: Pull request #10206 synchronize by jeffbolznv
December 4, 2024 14:57 22s
December 4, 2024 14:57 22s
vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and FlashAttention2
Pull Request Labeler #5956: Pull request #10206 synchronize by jeffbolznv
December 4, 2024 14:50 20s
December 4, 2024 14:50 20s
server : (refactoring) do not rely on JSON internally
Pull Request Labeler #5955: Pull request #10643 synchronize by ngxson
December 4, 2024 14:03 8m 31s
December 4, 2024 14:03 8m 31s
GGUF: backend support, fixed-width I/O, misc fixes
Pull Request Labeler #5954: Pull request #10655 synchronize by JohannesGaessler
December 4, 2024 13:16 14s
December 4, 2024 13:16 14s
GGUF: backend support, fixed-width I/O, misc fixes
Pull Request Labeler #5953: Pull request #10655 opened by JohannesGaessler
December 4, 2024 13:02 22s
December 4, 2024 13:02 22s
server : fix speculative decoding with context shift
Pull Request Labeler #5952: Pull request #10641 synchronize by ggerganov
December 4, 2024 11:11 21s
December 4, 2024 11:11 21s
server : fix free of spec context and batch
Pull Request Labeler #5951: Pull request #10651 opened by ggerganov
December 4, 2024 09:34 21s
December 4, 2024 09:34 21s
Fix HF repo commit to clone lora test models
Pull Request Labeler #5950: Pull request #10649 opened by ltoniazzi
December 4, 2024 09:02 1m 51s
December 4, 2024 09:02 1m 51s
server : fix speculative decoding with context shift
Pull Request Labeler #5949: Pull request #10641 synchronize by ggerganov
December 4, 2024 09:00 20s
December 4, 2024 09:00 20s
Add lora test workflow (WIP)
Pull Request Labeler #5948: Pull request #9058 synchronize by ltoniazzi
December 4, 2024 08:56 18s
December 4, 2024 08:56 18s
server : fix speculative decoding with context shift
Pull Request Labeler #5947: Pull request #10641 synchronize by ggerganov
December 4, 2024 08:45 18s
December 4, 2024 08:45 18s
gguf-py: Improve GGUFReader read-only mode performance
Pull Request Labeler #5946: Pull request #10159 synchronize by Isotr0py
December 4, 2024 07:25 16s
December 4, 2024 07:25 16s
Add support for GLM-Edge and GLM-Edge-V series models
Pull Request Labeler #5945: Pull request #10573 synchronize by piDack
December 4, 2024 03:50 12s
December 4, 2024 03:50 12s