llama.cpp/tools at b8330 - llama.cpp - Gitea: Git with a cup of tea

sdgoij/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-04 08:04:07 +00:00

Files

History

SoftwareRenderer d7ba99c485 server: reset counter related to kill-switch on client error (#20513 )

* server: reset kill-switch on client error

This avoids triggering a server kill switch.

If the client sends a request that exceeds the configured context size, an appropriate HTTP 400 response is provided and no tokens are generated.

However since no tokens are generated, update_slots() increments n_empty_consecutive. If the client sends 3 such messages in a row, the server terminates.

* moved counter reset as per recommendation

* cont : minor

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2026-03-13 19:58:09 +02:00

..

Fix locale-dependent float printing in GGUF metadata (#17331 )

2026-03-04 09:30:40 +01:00

common/parser: handle reasoning budget (#20297 )

2026-03-11 10:26:12 +01:00

chore : correct typos [no ci] (#20041 )

2026-03-05 08:50:21 +01:00

cvector-generator

chore : correct typos [no ci] (#20041 )

2026-03-05 08:50:21 +01:00

Fix locale-dependent float printing in GGUF metadata (#17331 )

2026-03-04 09:30:40 +01:00

llama-fit-params: keep explicit --ctx-size 0 (#19070 )

2026-01-24 22:13:08 +01:00

Fix locale-dependent float printing in GGUF metadata (#17331 )

2026-03-04 09:30:40 +01:00

chore : correct typos [no ci] (#20041 )

2026-03-05 08:50:21 +01:00

llama-bench: introduce -hf and -hff flags & use --mmap 1 by default (#20211 )

2026-03-09 09:05:44 +08:00

mtmd : rename mtmd_get_audio_bitrate to mtmd_get_audio_sample_rate (#20105 )

2026-03-13 12:30:02 +01:00

Autoparser - complete refactoring of parser architecture (#18675 )

2026-03-06 21:01:00 +01:00

chore : correct typos [no ci] (#20041 )

2026-03-05 08:50:21 +01:00

llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770 )

2026-03-10 08:16:05 +02:00

llama: end-to-end tests (#19802 )

2026-03-08 12:30:21 +01:00

Fix locale-dependent float printing in GGUF metadata (#17331 )

2026-03-04 09:30:40 +01:00

server: reset counter related to kill-switch on client error (#20513 )

2026-03-13 19:58:09 +02:00

Fix locale-dependent float printing in GGUF metadata (#17331 )

2026-03-04 09:30:40 +01:00

Fix locale-dependent float printing in GGUF metadata (#17331 )

2026-03-04 09:30:40 +01:00

CMakeLists.txt

llama: end-to-end tests (#19802 )

2026-03-08 12:30:21 +01:00