llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-12 20:14:09 +00:00

Files

Gaurav Garg 41e3f02647 cuda : revert CUDA_SCALE_LAUNCH_QUEUES override until investigated (#19227 )

Hangs were reported on Jetson Orin AGX if we set CUDA_SCALE_LAUNCH_QUEUES=4x. Reverting the previous PR (#19042) and updating the document to consider setting CUDA_SCALE_LAUNCH_QUEUES=4x for faster throughput on multi-GPU systems.

2026-02-03 08:41:02 +02:00

android

android: fix missing screenshots for Android.md (#18156 )

2025-12-19 09:32:04 +02:00

backend

Remove support for Nvidia & AMD GPU, because the oneAPI plugin for Nvidia & AMD GPU is unavailable: download/installation channels are out of work. (#19246 )

2026-02-02 21:06:21 +08:00

development

docs : fix links in parsing.md (#18245 )

2025-12-21 09:35:40 +01:00

multimodal

docs : Minor cleanups (#19252 )

2026-02-02 08:38:55 +02:00

ops

sycl: implement GGML_OP_TOP_K (#19242 )

2026-02-02 21:05:51 +08:00

android.md

android: fix missing screenshots for Android.md (#18156 )

2025-12-19 09:32:04 +02:00

build-riscv64-spacemit.md

refactor : remove libcurl, use OpenSSL when available (#18828 )

2026-01-14 18:02:47 +01:00

build-s390x.md

ggml-zdnn: fix #15414 , activate FP16 and BF16 acceleration and incorrect zTensor free (#15839 )

2025-09-13 02:39:52 +08:00

build.md

cuda : revert CUDA_SCALE_LAUNCH_QUEUES override until investigated (#19227 )

2026-02-03 08:41:02 +02:00

docker.md

CLI: fixed adding cli and completion into docker containers, improved docs (#18003 )

2025-12-16 11:52:23 +01:00

function-calling.md

common : implement new jinja template engine (#18462 )

2026-01-16 11:22:06 +01:00

install.md

docs : add "Quick start" section for new users (#13862 )

2025-06-03 13:09:36 +02:00

llguidance.md

llguidance build fixes for Windows (#11664 )

2025-02-14 12:46:08 -08:00

multimodal.md

mtmd : add support for Voxtral (#14862 )

2025-07-28 15:01:48 +02:00

ops.md

sycl: implement GGML_OP_TOP_K (#19242 )

2026-02-02 21:05:51 +08:00

preset.md

preset: allow named remote preset (#18728 )

2026-01-10 15:12:29 +01:00

speculative.md

spec : various improvements ton ngram-map + docs (#19253 )

2026-02-02 08:26:58 +02:00