llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-08 01:54:10 +00:00

Files

Jeff Bolz c6c5e85979 vulkan: support solve_tri with larger N/K values (#17781 )

Split N into chunks to fit into shared memory.
If K > 128, use a larger workgroup with enough invocations.
Add perf tests matching qwen3next.

2025-12-06 08:56:45 +01:00

2025-08-07 13:45:41 +02:00

2025-12-05 19:39:04 +02:00

2025-12-06 08:56:45 +01:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2025-12-04 07:04:02 +01:00