llama.cpp/examples/perplexity/perplexity.cpp at a1c004ef2e056cdeffcd47aaac196883bb123a3a

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-12 03:54:06 +00:00

Files

Georgi Gerganov ad19812cda perplexity : faster HellaSwag via batching (#5017 )

* perplexity : faster HellaSwag

ggml-ci

* perplexity : clean-up

ggml-ci

* perplexity : no need for decode_helper

ggml-ci

* perplexity : add comments

* perplexity : option to specify max batched tasks via `n_parallel`

* perplexity : remove HellaSwag restruction for n_batch

2024-01-18 15:33:01 +02:00

38 KiB

Raw Blame History

View Raw

38 KiB Raw Blame History

38 KiB

Raw Blame History