llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-01 22:54:05 +00:00

Files

Georgi Gerganov 39173bcacb context : reserve new scheduler when graph topology changes (#18547 )

* context : reserve new scheduler when graph topology changes

* cont : fix

* cont : fix reserve

* cont : reserve only when changes occur + timing

* context : add comments

* llama : reserve on sampler changes

* common : allow null common_sampler

* server : task declares needs (embd, logits, sampling)

* server : do not init sampler if not needed

* llama : fix need_reserve when unsetting a sampler

* server : consolidate slot reset/clear logic

2026-01-15 16:39:17 +02:00

llama-cpp.h

lora: make sure model keep track of associated adapters (#18490 )

2026-01-15 10:24:28 +01:00

llama.h

context : reserve new scheduler when graph topology changes (#18547 )

2026-01-15 16:39:17 +02:00