-
Notifications
You must be signed in to change notification settings - Fork 12.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add LLaDA 8b Diffusion model
examples
python
python script changes
#14771
opened Jul 19, 2025 by
am17an
Loading…
Vulkan: Fix fprintf format-security warning
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14770
opened Jul 19, 2025 by
0cc4m
Loading…
docs : mention apt installation method
documentation
Improvements or additions to documentation
#14766
opened Jul 19, 2025 by
vp2177
Loading…
feat: Add extended sampling API with candidate token lists #14612
#14765
opened Jul 19, 2025 by
baonudesifeizhai
Loading…
webui: add missing messages in export (#13552)
examples
server
#14764
opened Jul 18, 2025 by
srogmann
Loading…
cuda : implement bf16 cpy ops and enable bf16 cont
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14763
opened Jul 18, 2025 by
CISC
Loading…
tests : add non-cont K,V FA tests
testing
Everything test related
#14756
opened Jul 18, 2025 by
ggerganov
Loading…
Fix MinicpmV model converter and clip to avoid using hardcode.
examples
python
python script changes
#14750
opened Jul 18, 2025 by
gryffindor-rr
Loading…
[ROCm] Fix HIP version check for HIPBLAS V2 API compatibility
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14744
opened Jul 17, 2025 by
danielholanda
Loading…
metal: SSM_SCAN performance
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#14743
opened Jul 17, 2025 by
gabe-l-hart
Loading…
examples : predicted output for text generation
examples
#14739
opened Jul 17, 2025 by
iamlemec
Loading…
Improve Mistral models integration with llama.cpp
python
python script changes
#14737
opened Jul 17, 2025 by
juliendenize
•
Draft
CUDA: skip masked out KQ slices in mma FA kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#14735
opened Jul 17, 2025 by
JohannesGaessler
Loading…
feat: Add optional prompt processing progress streaming
examples
server
#14731
opened Jul 17, 2025 by
baonudesifeizhai
Loading…
mtmd : Support jinja in libmtmd (Only for QwenVL and Qwen Omni)
examples
#14730
opened Jul 17, 2025 by
alielmorsy
Loading…
server: add prompt processing progress streaming for /completion endpoint #14685
examples
server
#14728
opened Jul 16, 2025 by
baonudesifeizhai
Loading…
vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274)
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14707
opened Jul 16, 2025 by
Peter0x44
Loading…
Fix KleidiAI compilation errors with -DGGML_NATIVE=OFF (issue #14464)
ggml
changes relating to the ggml tensor library for machine learning
#14700
opened Jul 15, 2025 by
baonudesifeizhai
Loading…
Adding a simple-function-call example - hopefully not doing anything wrong
examples
#14682
opened Jul 14, 2025 by
klogdotwebsitenotdotcom
Loading…
kleidiai: add support for get_rows
ggml
changes relating to the ggml tensor library for machine learning
#14676
opened Jul 14, 2025 by
chaxu01
Loading…
bug fix: handle saving/loading null layers in recurrent memory
#14675
opened Jul 14, 2025 by
l3utterfly
Loading…
Add Pad Reflect 1D CUDA support
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14659
opened Jul 13, 2025 by
YavorGIvanov
Loading…
webui : add a preset feature to the settings
examples
server
#14649
opened Jul 12, 2025 by
gabriellarson
Loading…
Add CUDA non-contiguous Unary Ops support
build
Compilation issues
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14639
opened Jul 11, 2025 by
YavorGIvanov
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.