mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-05-07 01:24:24 +00:00
cuda 12.8 added the option to specify stronger compression for binaries, so we now default to "size".
cuda 12.8 added the option to specify stronger compression for binaries, so we now default to "size".