Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

common, ggml : fix non-ASCII file path handling on Windows ggml changes relating to the ggml tensor library for machine learning
#21838 opened Apr 13, 2026 by Anai-Guo Loading…
Expose build_info in router mode examples python python script changes server
#21835 opened Apr 13, 2026 by gaspardpetit Loading…
Fix unbounded VRAM usage creep on HIP/ROCm backend when quantizing kv cache ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#21830 opened Apr 13, 2026 by stragulus Draft
ggml-rpc: fix 32-bit ARM (ILP32) serialization bugs ggml changes relating to the ggml tensor library for machine learning
#21828 opened Apr 12, 2026 by rovmo Loading…
fix(poetry): update python version
#21819 opened Apr 12, 2026 by Wingless-Archangel Loading…
Metal: TurboQuant GPU dequant kernels + host buffer type Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning server testing Everything test related
#21818 opened Apr 12, 2026 by sachmans Draft
5 of 7 tasks
TurboQuant: Apple Accelerate + norm correction for CPU dequant examples ggml changes relating to the ggml tensor library for machine learning server testing Everything test related
#21817 opened Apr 12, 2026 by sachmans Draft
3 of 4 tasks
server: allow cancel loading model examples server
#21814 opened Apr 12, 2026 by ngxson Contributor Loading…
common : add download cancellation and temp file cleanup
#21813 opened Apr 12, 2026 by angt Member Loading…
docs/android.md: Add dependency libandroid-spawn for building on termux documentation Improvements or additions to documentation
#21812 opened Apr 12, 2026 by aafsmarak Loading…
2
TP: fix 0-sized tensor slices, AllReduce fallback ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#21808 opened Apr 12, 2026 by JohannesGaessler Contributor Loading…
kv: Add optional mmap kv cache examples testing Everything test related
#21792 opened Apr 12, 2026 by skiz Loading…
vulkan: fix output corruption on GCN 2.0/3.0 (Vulkan 1.2) ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#21787 opened Apr 12, 2026 by rafikb Loading…
chat: dedicated DeepSeek v3.2 parser + "official" template testing Everything test related
#21785 opened Apr 12, 2026 by pwilkin Member Loading…
ggml-metal: add Metal kernel for ggml_roll Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#21782 opened Apr 11, 2026 by stephencox-ict Contributor Loading…
vendor : update cpp-httplib to 0.42.0 python python script changes script Script related
#21781 opened Apr 11, 2026 by cabelo Contributor Loading…
ggml: add graph_reused ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#21764 opened Apr 11, 2026 by am17an Contributor Loading…
CUDA: only init NCCL for setups with multi GPU ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#21761 opened Apr 11, 2026 by EldarBorge Loading…
ProTip! Updated in the last three days: updated:>2026-04-09.