llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-12 12:04:08 +00:00

Files

Oliver Simons 8cef8201a1 CUDA: directly include cuda/iterator (#22936 )

Before, we relied on a transient import from `cub/cub.cuh`, which is
bad practice to do as cub may not always expose cuda/iterator

2026-05-11 12:16:38 +02:00

2026-04-09 16:42:19 +02:00

2026-05-08 10:09:38 +02:00

2026-05-11 12:16:38 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2026-05-10 17:00:11 +03:00