llama.cpp/examples/server/server.cpp at e9095e6098dc21ec51f97f96dd7d9d3e18ed9753

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-14 04:54:06 +00:00

Files

Pavel Fatin 1b17ed7ab6 Direct I/O and Transparent HugePages

--direct-io for bypassing page cache (and using THP on Linux)

Up to 3-6x faster uncached loading, fewer pageouts, no page cache pollution.

2024-05-21 01:35:23 +02:00

View Raw