llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-08 01:54:10 +00:00

Files

Daniel Bevenius eaf1d7930c llama : add support for Nemotron 3 Super (#20411 )

* llama : add support for Nemotron 3 Super

This commit adds support for the Nemotron 3 Super model (120B.A12B)
enabling this model to be converted to GGUF format and run in llama.cpp.

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: Matt Clayton <156335168+mattjcly@users.noreply.github.com>

2026-03-11 19:27:53 +01:00

cmake

ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )

2025-08-07 13:45:41 +02:00

include

ggml : bump RPC version (#20330 )

2026-03-10 21:36:57 +02:00

src

llama : add support for Nemotron 3 Super (#20411 )

2026-03-11 19:27:53 +01:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml : bump version to 0.9.7 (ggml/1425)

2026-02-15 22:24:29 +02:00