llama.cpp/tools/cli/cli.cpp at 49bfddeca18e62fa3d39114a23e9fcbdf8a22388

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-01 22:54:05 +00:00

Files

Piotr Wilkin (ilintar) 5e54d51b19 common/parser: add proper reasoning tag prefill reading (#20424 )

* Implement proper prefill extraction

* Refactor cli parameters, update docs, move reasoning budget sampler part to common/reasoning-budget.cpp

* Update tools/server/server-task.cpp

* refactor: move grammars to variant, remove grammar_external, handle exception internally

* Make code less C++y

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2026-03-19 16:58:21 +01:00

22 KiB

Raw Blame History

View Raw

22 KiB Raw Blame History

22 KiB

Raw Blame History