mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-05-10 19:14:07 +00:00
* server: enrich health endpoint with available slots, return 503 if not slots are available * server: document new status no slot available in the README.md
122 KiB
122 KiB