llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-02 07:04:19 +00:00

Files

Georgi Gerganov 254098a279 common : refactor common_sampler + grammar logic changes (#17937 )

* common : refactor common_sampler + grammar logic changes

* tests : increase max_tokens to get needed response

* batched : fix uninitialized samplers

2025-12-14 10:11:13 +02:00

batched-bench

batched-bench : add "separate text gen" mode (#17103 )

2025-11-10 12:59:29 +02:00

cli

cli: new CLI experience (#17824 )

2025-12-10 15:28:59 +01:00

completion

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

cvector-generator

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

export-lora

cmake : Do not install tools on iOS targets (#15903 )

2025-09-16 09:54:44 +07:00

gguf-split

cli: new CLI experience (#17824 )

2025-12-10 15:28:59 +01:00

imatrix

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

llama-bench

ggml-zendnn : add ZenDNN backend for AMD CPUs (#17690 )

2025-12-07 00:13:33 +08:00

mtmd

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

perplexity

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

quantize

cli: new CLI experience (#17824 )

2025-12-10 15:28:59 +01:00

rpc

Install rpc-server when GGML_RPC is ON. (#17149 )

2025-11-11 10:53:59 +00:00

run

Manually link -lbsd to resolve flock symbol on AIX (#16610 )

2025-10-23 19:37:31 +08:00

server

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

tokenize

cmake : Do not install tools on iOS targets (#15903 )

2025-09-16 09:54:44 +07:00

tts

common : refactor common_sampler + grammar logic changes (#17937 )

2025-12-14 10:11:13 +02:00

CMakeLists.txt

cli: new CLI experience (#17824 )

2025-12-10 15:28:59 +01:00