llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-01 22:54:05 +00:00

Files

Jeff Bolz 6a6780a232 vulkan: Support GGML_TYPE_NVFP4 (#21455 )

This adds nvfp4 support for get_rows, dequant, and mul_mat(_id). For
mul_mat, it does not add support for the dp4/q8_1 path, it's all via
fp16/fp32.

2026-04-14 11:34:23 +02:00

2026-04-09 16:42:19 +02:00

2026-04-09 16:42:19 +02:00

2026-04-14 11:34:23 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2026-04-09 16:42:19 +02:00