Skip to content

Update july 2025 #1225

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Aug 6, 2025
Merged

Conversation

martindevans
Copy link
Member

@martindevans martindevans commented Jul 26, 2025

Updated llama.cpp binaries to 11dd5a44eb180e1d69fac24d3852b5222d66fb7f, compiled with this run.

Testing:

  • Windows CPU
  • Windows CUDA
  • Windows Vulkan
  • Linux CPU (x64)
  • Linux CPU (ARM64)
  • Linux CPU (musl)
  • Linux CUDA
  • Linux Vulkan
  • MacOS

@m0nsky
Copy link
Contributor

m0nsky commented Jul 26, 2025

Unit tests passed on Windows CUDA & Linux CUDA.

Test application is running fine on:

  • Windows CPU
  • Windows CUDA
  • Windows Vulkan
  • Linux CPU
  • Linux CUDA
  • Linux Vulkan

@jihadkhawaja
Copy link

jihadkhawaja commented Jul 27, 2025

Any possibility to include llama win-arm64-cpu binary in this release?
#600

Posting this from windows arm64 device 😥

@martindevans
Copy link
Member Author

martindevans commented Jul 27, 2025

@jihadkhawaja It should be possible, it looks like llama.cpp has support for it.

The first step would be to update the build action (see here) to add a windows ARM build. If you're interested in contributing that please feel free to open up an issue and I can give you some tips.

@jihadkhawaja
Copy link

I would love to, Will open a PR for that, thanks a lot @martindevans

@LSXAxeller
Copy link

LSXAxeller commented Aug 5, 2025

I think that you would like to update to the latest llama.cpp commit before releasing, they just got the GLM-4.5 models series PR merged yesterday, and the model seems promising.

EDIT: OpenAI GPT-OSS just released a few hours ago and support merged into llama.cpp.

@martindevans
Copy link
Member Author

That'll have to wait for the next update, this one is basically ready to go. I'm pretty busy right now, but I'll try to find some time this weekend to do another update, hopefully not much has changed in the last 2 weeks!

@Lyrcaxis
Copy link
Contributor

Lyrcaxis commented Aug 6, 2025

They did a huge PR for gpt-oss 😅 ggml-org/llama.cpp#15091
From a quick scan though not much changed on the mid level except the addition of some enums.

@martindevans martindevans merged commit 8ff2e89 into SciSharp:master Aug 6, 2025
6 checks passed
@martindevans martindevans deleted the update_july_2025 branch August 6, 2025 21:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants