Skip to content

Releases: ggerganov/llama.cpp

b3902

09 Oct 18:09
c81f3bb
Compare
Choose a tag to compare
cmake : do not build common library by default when standalone (#9804)

b3901

09 Oct 15:05
e702206
Compare
Choose a tag to compare
perplexity : fix integer overflow (#9783)

* perplexity : fix integer overflow

ggml-ci

* perplexity : keep n_vocab as int and make appropriate casts

ggml-ci

b3899

08 Oct 13:28
dca1d4b
Compare
Choose a tag to compare
ggml : fix BLAS with unsupported types (#9775)

* ggml : do not use BLAS with types without to_float

* ggml : return pointer from ggml_internal_get_type_traits to avoid unnecessary copies

* ggml : rename ggml_internal_get_type_traits -> ggml_get_type_traits

it's not really internal if everybody uses it

b3898

08 Oct 12:39
458367a
Compare
Choose a tag to compare
server : better security control for public deployments (#9776)

* server : more explicit endpoint access settings

* protect /props endpoint

* fix tests

* update server docs

* fix typo

* fix tests

b3896

07 Oct 21:46
6374743
Compare
Choose a tag to compare
ggml : add backend registry / device interfaces to BLAS backend (#9752)

* ggml : add backend registry / device interfaces to BLAS backend

* fix mmap usage when using host buffers

b3895

07 Oct 17:35
f1af42f
Compare
Choose a tag to compare
Update building for Android (#9672)

* docs : clarify building Android on Termux

* docs : update building Android on Termux

* docs : add cross-compiling for Android

* cmake : link dl explicitly for Android

b3892

07 Oct 13:28
96b6912
Compare
Choose a tag to compare
metal : single allocation of encode_async block (#9747)

* Single allocation of encode_async block with non-ARC capture in ggml-metal.m

* Moving Block_release to the deallocation code

* Release encode block when re-setting encoding buffer count if needed

* Update ggml/src/ggml-metal.m

---------

Co-authored-by: Georgi Gerganov <[email protected]>

b3889

06 Oct 11:01
b6d6c52
Compare
Choose a tag to compare
sync : llama.cpp

b3887

05 Oct 16:25
8c475b9
Compare
Choose a tag to compare
rerank : use [SEP] token instead of [BOS] (#9737)

* rerank : use [SEP] token instead of [BOS]

ggml-ci

* common : sanity check for non-NULL tokens

ggml-ci

* ci : adjust rank score interval

ggml-ci

* ci : add shebang to run.sh

ggml-ci

b3886

05 Oct 15:55
58b1669
Compare
Choose a tag to compare
sync : ggml