llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-11 11:34:10 +00:00

Files

Tim Neumann 2e97c5f96f backend sampling: support returning post-sampling probs (#22622 )

* server: Never return 0.0 post-sampling probabilities

* backend sampling: support returning post-sampling probs

2026-05-10 19:12:02 +02:00

batched-bench

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

cli

docs : update speculative decoding parameters after refactor (#22397 ) (#22539 )

2026-05-04 08:52:07 +03:00

completion

docs : update speculative decoding parameters after refactor (#22397 ) (#22539 )

2026-05-04 08:52:07 +03:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

export-lora

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fit-params

fit-params : refactor + add option to output estimated memory per device (#22171 )

2026-04-21 09:54:36 +03:00

gguf-split

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

imatrix

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

llama-bench

spec : refactor params (#22397 )

2026-04-28 09:07:33 +03:00

mtmd

mtmd: fix whisper audio tail truncation by exposing padded buffer to FFT (#22770 )

2026-05-07 14:01:01 +02:00

parser

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

perplexity

fit-params : refactor + add option to output estimated memory per device (#22171 )

2026-04-21 09:54:36 +03:00

quantize

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

results

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

rpc

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server

backend sampling: support returning post-sampling probs (#22622 )

2026-05-10 19:12:02 +02:00

tokenize

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

tts

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

CMakeLists.txt

llama: end-to-end tests (#19802 )

2026-03-08 12:30:21 +01:00