Default Branch

b97ebdc98f · llama-quant : fix --tensor-type when default qtype is overriden (#22572) · Updated 2026-05-01 17:55:55 +00:00

Branches

121fe62182 · test · Updated 2026-03-06 14:30:32 +00:00

783
7

4b436e4e5e · flake8 fix · Updated 2026-02-23 10:48:01 +00:00

891
20

5d45884106 · metal : fix build · Updated 2026-02-18 07:14:31 +00:00

998
23

5da56dc1d8 · args : add -kvu to llama-parallel · Updated 2026-02-12 19:50:01 +00:00    sdgoij

998
17

e7fbfc9b80 · ci : tmp fixes · Updated 2026-02-11 13:48:40 +00:00    sdgoij

1048
22

5372fc6461 · wip · Updated 2026-02-10 21:44:42 +00:00    sdgoij

1013
18

b9b56b017e · Apply suggestion from @ggerganov (src->buffer to buf_src) v2 · Updated 2026-02-10 11:00:44 +00:00    sdgoij

1013
13

5144018e7b · cont : simplify · Updated 2026-02-07 12:50:05 +00:00    sdgoij

1034
4

1213a03564 · qwen3next : fix chunking · Updated 2026-02-04 08:06:38 +00:00    sdgoij

1067
1

5b01d8575d · examples : add compare-mlx · Updated 2026-01-31 07:57:35 +00:00    sdgoij

1104
1

6c8a04576e · experiments · Updated 2026-01-28 07:45:07 +00:00    sdgoij

1155
29

8b407e3978 · quant : manual overrides of tensor types take precedence · Updated 2026-01-20 09:20:24 +00:00    sdgoij

1219
1

3bfbbcc5fc · winget : update komac version · Updated 2026-01-18 08:29:03 +00:00    sdgoij

1229
1

e2751545b9 · cont : inline verification · Updated 2026-01-17 12:33:07 +00:00    sdgoij

1241
5

36f0132464 · CUDA: Factor out and re-use block_reduce function (#18785) · Updated 2026-01-15 02:44:54 +00:00    sdgoij

1260
0
Included

60864997fe · fit-params : print signed int for -ngl param · Updated 2026-01-14 17:59:23 +00:00    sdgoij

1263
1

5292965711 · Merge branch 'master' into xsn/lora_keep_track · Updated 2026-01-13 12:44:22 +00:00    sdgoij

1277
4

08b5d956fc · minor : std::unordered_set over std::set · Updated 2026-01-12 11:35:25 +00:00    sdgoij

1416
3

4a2751258a · server : simplify prompt state transition branches · Updated 2026-01-09 15:46:03 +00:00    sdgoij

1315
11

caff0fd247 · server : adjust unified KV cache tests · Updated 2026-01-09 12:26:14 +00:00

1315
1