Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

webgpu : fix build on emscripten build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#15826 opened Sep 5, 2025 by ngxson Draft
Add support for Qwen3-Reranker examples python python script changes server
#15824 opened Sep 5, 2025 by iamlemec Loading…
Rewrite llama-run to use llama-server examples
#15818 opened Sep 5, 2025 by ericcurtin Loading…
kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16 ggml changes relating to the ggml tensor library for machine learning
#15817 opened Sep 5, 2025 by chaxu01 Loading…
ggml : split graph allocations according to backend max buffer size ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#15815 opened Sep 5, 2025 by Acly Loading…
CANN: implement LRU cache for ACL graphs in CANN backend Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#15814 opened Sep 5, 2025 by noemotiovon Loading…
CUDA: Conv2d Tensor Core ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15813 opened Sep 5, 2025 by mnehete32 Loading…
CANN: Switch to stream synchronization Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#15809 opened Sep 5, 2025 by noemotiovon Loading…
Add conv2d Implicit GEMM ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15805 opened Sep 4, 2025 by bssrdf Draft
vulkan: add mul_mat variant for embedded gpus ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15800 opened Sep 4, 2025 by rmatif Loading…
ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend
#15797 opened Sep 4, 2025 by slaren Loading…
vulkan: support im2col_3d ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#15795 opened Sep 4, 2025 by jeffbolznv Loading…
vulkan: Support pad_ext ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#15794 opened Sep 4, 2025 by jeffbolznv Loading…
docker : Fix AMDGPU_TARGETS deprecated warnning message devops improvements to build systems and github actions
#15786 opened Sep 4, 2025 by haiyuewa Loading…
ggml: allow casting between f32 and i32 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend
#15783 opened Sep 4, 2025 by ngxson Loading…
4 tasks done
CUDA: faster tile FA (Pascal/AMD), headsize 256 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15769 opened Sep 3, 2025 by JohannesGaessler Loading…
CUDA: Add mul_mat_id support for the mmf kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15767 opened Sep 3, 2025 by am17an Loading…
Consolidate multiple tensor copies to reduce API overhead ggml changes relating to the ggml tensor library for machine learning
#15750 opened Sep 2, 2025 by agray3 Loading…
nix: Added missing packages and options for ROCm build devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#15747 opened Sep 2, 2025 by SteelPh0enix Loading…
Add scale_diag_mask_inf_softmax operation for transformer attention ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15738 opened Sep 2, 2025 by Arya-Hari Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.