Files
llama.cpp/examples/server/server.cpp
Pavel Fatin 1b17ed7ab6 Direct I/O and Transparent HugePages
--direct-io for bypassing page cache (and using THP on Linux)

Up to 3-6x faster uncached loading, fewer pageouts, no page cache pollution.
2024-05-21 01:35:23 +02:00

154 KiB