mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-05-10 19:14:07 +00:00
* opencl: use subgrroup reduce for reduction in rms_norm_mul * opencl: add comment about workgroup size
* opencl: use subgrroup reduce for reduction in rms_norm_mul * opencl: add comment about workgroup size