llama.cpp/examples/server/server.cpp at 0f95689c170fdef48bb72632cca26c4a5da628a7

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-11 11:34:10 +00:00

Files

Tobias Lütke 7a3895641c allow server to multithread

because web browsers send a lot of garbage requests we want the server
to multithread when serving 404s for favicon's etc. To avoid blowing up
llama we just take a mutex when it's invoked.

2023-07-04 09:14:49 -04:00

42 KiB

Raw Blame History

View Raw

42 KiB Raw Blame History

42 KiB

Raw Blame History