llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-11 11:34:10 +00:00

Files

Aman Gupta 1ee9d0b415 CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557 )

* CUDA: use fastdiv + ggml_cuda_mad for mmvf

* use bf16 directly + fix formatting

* Add exception for HIP code

2025-10-14 13:16:21 +02:00

2025-08-07 13:45:41 +02:00

2025-10-04 12:49:16 +03:00

2025-10-14 13:16:21 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2025-10-07 13:48:56 -07:00