llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-11 03:24:21 +00:00

Files

Aman Gupta 27208bf657 CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361 )

* CUDA: add bf16 and f32 support to cublas_mul_mat_batched

* Review: add type traits and make function more generic

* Review: make check more explicit, add back comments, and fix formatting

* Review: fix formatting, remove useless type conversion, fix naming for bools

2025-06-29 01:30:53 +08:00

cmake

ggml-cpu : rework weak alias on apple targets (#14146 )

2025-06-16 13:54:15 +08:00

include

ggml : add ggml_set_rows (#14274 )

2025-06-27 16:41:40 +03:00

src

CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361 )

2025-06-29 01:30:53 +08:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317 )

2025-06-25 23:49:04 +02:00