Releases · ggml-org/llama.cpp

24 May 23:11

a2d02d5

b5477

releases : bundle llvm omp library in windows release (#13763)

Assets 18

24 May 20:54

github-actions

b5476

17fc817

b5476

releases : enable openmp in windows cpu backend build (#13756)

Assets 18

24 May 20:44

github-actions

b5475

2bd1b30

b5475

ggml-cpu : set openmp wait time if not set (#13758)

Assets 18

24 May 15:16

github-actions

b5474

259469c

b5474

Move GLM4 f32 attention fix to the correct function (#13750)

Assets 18

24 May 11:32

github-actions

b5473

4c32832

b5473

ggml : add ggml_gelu_erf() CUDA kernel (#13719)

* ggml : add ggml_gelu_erf() CUDA kernel

* missing semicolon

Assets 18

24 May 10:48

github-actions

b5472

c3a2624

b5472

vocab : fix ugm tokenizer precision (#13743)

Assets 18

24 May 10:12

github-actions

b5471

ffd0eae

b5471

CUDA: fix race condition in FA vector kernels (#13742)

Assets 18

23 May 17:34

github-actions

b5468

d13d0f6

b5468

hparams : initialize arrays (#13728)

ggml-ci

Assets 18

23 May 10:29

github-actions

b5466

9ecf3e6

b5466

server : support audio input (#13714)

* server : support audio input

* add audio support on webui

Assets 18

23 May 10:01

github-actions

b5465

faaaff5

b5465

CANN: Support MUL_MAT_ID for q8_0 and q4_0 (#13705)

* [CANN]Support MUL_MAT_ID Q8 && Q4

Signed-off-by: noemotiovon <[email protected]>

* codestyle adjustment

Signed-off-by: noemotiovon <[email protected]>

---------

Signed-off-by: noemotiovon <[email protected]>

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5477

Uh oh!

b5476

Uh oh!

b5475

Uh oh!

b5474

Uh oh!

b5473

Uh oh!

b5472

Uh oh!

b5471

Uh oh!

b5468

Uh oh!

b5466

Uh oh!

b5465

Uh oh!