llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-15 21:44:05 +00:00

Files

Oliver Simons 1da013c66e Build with CCCL 3.2 for CUDA backends

Gives best perf for backend-sampling on CUDA. Flag can be removed once
CCCL 3.2 is bundled within CTK and that CTK version is used in llama.cpp

2025-12-19 16:10:51 +01:00

2025-12-07 14:02:04 +01:00

2025-12-17 13:46:48 +01:00

2025-12-19 16:10:51 +01:00

copilot-instructions.md

2025-11-14 09:12:56 +02:00

labeler.yml

2025-11-04 12:29:39 +01:00

pull_request_template.md

2025-02-15 16:40:57 +02:00