AesSedai
046e284437
Add flash attention MMA / Tiles to support MiMo-V2.5 ( #22812 )
...
* mimo-v2.5: add flash attention mma/tiles for for d_kq=192 d_v=128
* mimo-v2.5: follow (256, 256) fattn templates
* mimo-v2.5: cleanup comments
* mimo-v2.5: further comment cleanup
* mimo-v2.5: address PR feedback
fix GQA handling
check for other dangling 320/576 carveouts and mirror them for 192
Add to backend ops test so new paths are covered
2026-05-09 11:28:29 +08:00
..
2026-04-13 18:18:18 -05:00
2026-04-02 22:53:58 +02:00
2026-04-02 22:53:58 +02:00
2026-04-22 10:28:45 +02:00
2026-04-02 18:19:20 +02:00
2024-01-26 14:18:00 +02:00
2024-01-26 14:18:00 +02:00
2026-04-02 22:53:58 +02:00
2026-04-02 22:53:58 +02:00
2026-03-05 08:50:21 +01:00
2026-04-28 09:07:33 +03:00
2026-02-02 08:38:55 +02:00
2026-05-09 11:28:29 +08:00
2026-03-18 17:40:22 +08:00
2025-12-10 12:32:23 -08:00
2025-07-03 07:48:32 +03:00
2026-04-25 21:24:13 +02:00
2026-03-19 16:58:21 +01:00
2026-04-02 11:28:56 +02:00
2026-05-04 13:18:11 +02:00
2024-07-12 10:46:02 +03:00
2025-04-24 16:00:10 +03:00
2026-04-02 22:53:58 +02:00
2026-03-25 12:53:16 +02:00
2026-03-21 18:43:35 +01:00
2025-12-22 11:00:37 +01:00
2026-03-21 18:43:35 +01:00
2026-04-09 11:28:33 +02:00
2025-10-12 16:18:47 +03:00
2026-04-08 13:20:46 +03:00
2026-04-12 14:15:26 +02:00
2026-03-21 18:43:35 +01:00
2026-04-29 10:58:43 +03:00
2025-12-10 15:28:59 +01:00
2025-01-06 10:55:18 +02:00
2026-04-19 11:57:21 +02:00
2025-08-26 22:14:38 +02:00
2026-03-06 21:01:00 +01:00
2026-04-02 22:53:58 +02:00
2026-04-06 20:55:21 +02:00
2025-09-25 08:06:06 +03:00
2026-04-17 11:11:46 +03:00
2026-05-08 17:46:43 +02:00
2026-01-03 16:02:43 -06:00
2025-11-11 13:33:24 +02:00
2025-08-31 20:41:02 +03:00
2026-05-05 09:23:50 +02:00
2026-03-31 12:53:41 +02:00
2025-12-22 11:00:37 +01:00
2026-04-09 12:36:29 +02:00
2026-02-26 12:14:09 +01:00
2025-12-22 11:00:37 +01:00
2025-12-22 11:00:37 +01:00
2026-04-09 12:36:29 +02:00
2025-09-27 02:03:33 +08:00
2026-01-16 11:22:06 +01:00