llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-07 09:34:07 +00:00

Files

Jeff Bolz b37124d2d2 vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515 )

* vulkan: handle quantize_q8_1 overflowing the max workgroup count

* vulkan: Fix small tile size matmul on lavapipe

* fix mul_mat_id failures

2026-01-05 11:30:14 +01:00

2025-08-07 13:45:41 +02:00

2026-01-01 08:58:27 +01:00

2026-01-05 11:30:14 +01:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2025-12-31 18:54:43 +02:00