Default Branch

05e141a6b3 · vulkan: Support asymmetric FA in coopmat2 path (#21753) · Updated 2026-05-01 13:28:32 +00:00

Branches

da1f16886f · load directly from downloaded state · Updated 2026-05-01 13:39:00 +00:00

0
14

88d8c574ac · Converge implementation with export-graph-ops · Updated 2026-05-01 13:38:53 +00:00

0
8

033e652e92 · output device group info · Updated 2026-05-01 12:53:12 +00:00

20
4

1b2bd8699c · fix windows build · Updated 2026-04-30 19:52:31 +00:00

10
8

9d5887035f · testing · Updated 2026-04-30 16:18:57 +00:00

7
2

a7c1110e87 · server : avoid checkpoint data host copies · Updated 2026-04-30 16:16:31 +00:00

7
1

211e58178a · wip · Updated 2026-04-30 08:24:32 +00:00

13
20

c64e772d35 · pi : add rule to use gh CLI for GitHub resources · Updated 2026-04-30 06:50:39 +00:00

10
2

6eddb1c6e3 · pi : add rule to use gh CLI for GitHub resources · Updated 2026-04-30 06:49:54 +00:00

10
2

a7fb22fc50 · server : validate --tools CLI argument against known tool names · Updated 2026-04-30 06:40:58 +00:00

10
1

c6a04cb5c3 · ggml-metal: fix 2D async copy to use row-by-row transfers · Updated 2026-04-29 11:57:48 +00:00

20
3

fd6f79c7a4 · download : prefer q8_0 when q4_k not available · Updated 2026-04-27 09:08:25 +00:00

49
1

cb9fc575e4 · common : use pimpl in debug.h to reduce header dependencies · Updated 2026-04-26 06:49:28 +00:00

70
3

b9421898b6 · add for Q4_0 · Updated 2026-04-23 07:33:19 +00:00

212
2

a5355a0226 · server: keep router model refcount to avoid unloading models that have running requests · Updated 2026-04-22 08:07:13 +00:00

125
15

35df147d80 · cont : remove /api/tags · Updated 2026-04-20 12:45:42 +00:00

140
2

4943e3a396 · gen-libllama-abi: compile sort-key regex once outside the lambda · Updated 2026-04-15 12:04:44 +00:00

195
4

c5b682b25c · various clean up · Updated 2026-04-13 15:39:14 +00:00

216
3

4cabbe36e0 · state · Updated 2026-04-09 11:00:31 +00:00

292
16

a30369d515 · cpu: fix ARM NEON nvfp4 vec dot · Updated 2026-04-06 08:27:03 +00:00

323
1