mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-15 13:34:06 +00:00

Files

ggerganov 43f14a0a46 llama-eval : support multiple evaluation endpoints with dynamic task distribution

- Add ServerConfig dataclass (url, threads, name)
- Accept comma-separated --server, --threads, --server-name CLI args
- Dynamic shared-queue task distribution across servers (fast servers do more work)
- One ThreadPoolExecutor per server, workers pull from shared Queue
- Track which server processed each task (server_name in results)
- Thread-safe EvalState with threading.Lock for concurrent mutations
- Server column in HTML report and console output
- Backward compatible: single server works as before

Assisted-by: llama.cpp:local pi

2026-05-10 20:42:23 +03:00

llama-eval.py

llama-eval : support multiple evaluation endpoints with dynamic task distribution

2026-05-10 20:42:23 +03:00

llama-server-simulator.py

sim : fix answer matching

2026-05-10 18:13:46 +03:00

README.md

remove junk

2026-05-10 18:13:50 +03:00

test-simulator.sh

test : fix path

2026-05-10 18:13:46 +03:00

README.md

llama-eval

Simple evaluation tool for llama.cpp with support for multiple datasets.

TODO: add usage