llama.cpp/examples/parallel/parallel.cpp at c062ffd18cbbf7a7e905223bfee87fefe9746db3

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-07 17:44:09 +00:00

Files

Georgi Gerganov fcca0a7004 refact : fix convert script + zero out KV cache to avoid nans (#3523 )

* refact : fix convert script + zero out KV cache to avoid nans

* ggml : silu(-inf) should never happen

* metal : assert various kernel requirements

2023-10-09 14:32:17 +03:00

16 KiB

Raw Blame History

View Raw

16 KiB Raw Blame History

16 KiB

Raw Blame History