Files
llama.cpp/examples/perplexity/perplexity.cpp
Iwan Kawrakow ccc78a200e hellaswag: speed up even more by parallelizing log-prob evaluation
For Mistral-7B and fp16, time on my system goes down from 536 seconds
to 423 seconds for the full evaluation dataset (10042 tasks).
2024-01-18 18:25:29 +02:00

40 KiB