llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-01 22:54:05 +00:00

Files

Aman Gupta de1aa6fa73 CUDA: check for buffer overlap before fusing (#21566 )

* CUDA: check for buffer overlap before fusing

* use ggml_cuda_check_fusion_memory_ranges

2026-04-08 00:57:04 +08:00

2025-08-07 13:45:41 +02:00

2026-04-07 15:28:27 +03:00

2026-04-08 00:57:04 +08:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2026-04-02 10:39:00 +03:00