mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-01 22:54:05 +00:00

Files

Peter Sideris b42c7fa5b8 spec : fix vocab compat checks in spec example (#22426 )

* port #22358 PR to examples/speculative/speculative.cpp
* use vocab_[tgt,dft] instead of ctx_[tgt,dft] when logging on draft
  model / target model vocabulary mismatch

Co-authored-by: Petros Sideris <petros.sideris@nokia.com>

2026-04-30 08:18:25 +03:00

CMakeLists.txt

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

README.md

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

speculative.cpp

spec : fix vocab compat checks in spec example (#22426 )

2026-04-30 08:18:25 +03:00

README.md

llama.cpp/examples/speculative

Demonstration of speculative decoding and tree-based speculative decoding techniques

More info:

https://github.com/ggml-org/llama.cpp/pull/2926
https://github.com/ggml-org/llama.cpp/pull/3624
https://github.com/ggml-org/llama.cpp/pull/5625