mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-05-14 13:04:08 +00:00
The NeoX cur_rot part is different because I'm pretty sure my original implementation was wrong.
66 KiB
66 KiB