llama.cpp/main.cpp at master-0b366e7

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-07 01:24:24 +00:00

Files

Erik Scholz 0b366e7357 Command line switch to use F16 for memory_k and memory_v (refactor of #154 ) (#294 )

* Use F16 for memory_k and memory_v

* add command line switch to use f16 instead of f32 for memory k+v

---------

Co-authored-by: Ty Everett <ty@tyweb.us>

2023-03-19 19:57:00 +02:00

39 KiB

Raw Permalink Blame History

View Raw

39 KiB Raw Permalink Blame History

39 KiB

Raw Permalink Blame History