Releases · ggml-org/llama.cpp

28 May 20:58

1096133

b5526

mtmd : move helpers to dedicated library (⚠️ breaking change) (#13866)

* mtmd : move helpers to dedicated library

* fix server build

* rm leftover cmakelist code

Assets 18

28 May 17:22

github-actions

b5524

e0e3aa2

b5524

llama : add support for BertForSequenceClassification reranker (#13858)

* convert: add support for BertForSequenceClassification

* add support for reranking using BertForSequenceClassification

* merge checks of eos and sep

* fix lint

---------

Co-authored-by: dinhhuy <[email protected]>

Assets 18

28 May 14:55

github-actions

b5522

c962ae3

b5522

server: fix remove 'image_url'/'input_audio' json-object effectlly fo…

Assets 18

28 May 13:06

github-actions

b5519

a682474

b5519

CUDA: fix FA tg at long context for CC >= 8.9 (#13852)

Assets 18

28 May 04:13

github-actions

b5517

1e8659e

b5517

CANN: Add SOC TYPE printing in cmake configuration (#13837)

Assets 18

27 May 20:31

github-actions

b5516

a3c3084

b5516

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, …

Assets 18

27 May 20:21

github-actions

b5515

1701d4c

b5515

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors …

Assets 18

27 May 18:47

github-actions

b5514

bef8176

b5514

vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817)

Also change it to be controlled by an env var rather than cmake flag

Assets 18

27 May 18:23

github-actions

b5513

34b7c04

b5513

cmake : add llama-cparams.cpp to build (#13832)

Assets 18

27 May 18:01

github-actions

b5512

f3101a8

b5512

SYCL: add gelu_erf kernel (#13749)

* SYCL: add gelu_erf kernel

* refactor code

Co-authored-by: Atharva Dubey <[email protected]>

* Use scope_op_debug_print

---------

Co-authored-by: Atharva Dubey <[email protected]>

Assets 18

Releases: ggml-org/llama.cpp

b5526

Uh oh!

b5524

Uh oh!

b5522

Uh oh!

b5519

Uh oh!

b5517

Uh oh!

b5516

Uh oh!

b5515

Uh oh!

b5514

Uh oh!

b5513

Uh oh!

b5512

Uh oh!