yulo
f3dd7b8e68
HIP: add mmf for CDNA (#18896)
* refactor mmf rows_per_block
* speed up compile
* pass cdna compile
* fix cuda error
* clean up mmf
* f32 mmf
* clean float mma
* fix mmf error
* faster mmf
* extend tile k
* fix compile error
* Revert "extend tile k"
This reverts commit 4d2ef3d483.
* fix smem overflow
* speed up compiling mmf
* speed up compile for hip
* 512 block for cdna
* config pad size
* fix as comment
* update select logic
* move some code to cuh
* fix as comment
* correct cdna3 config
---------
Co-authored-by: zhang hui <you@example.com>
2026-01-29 11:10:53 +01:00
..
2026-01-19 20:03:19 +02:00
2026-01-19 20:03:19 +02:00
2026-01-28 09:15:56 +02:00
2026-01-29 11:10:53 +01:00
2026-01-23 22:02:07 -08:00
2026-01-29 11:10:53 +01:00
2026-01-25 20:07:19 +02:00
2025-10-11 20:54:32 +02:00
2026-01-26 19:36:24 -08:00
2026-01-02 12:09:36 +02:00
2026-01-29 09:20:22 +08:00
2026-01-28 17:49:40 +08:00
2026-01-28 18:52:45 +01:00
2026-01-27 20:53:36 -08:00
2026-01-22 01:16:21 +01:00
2026-01-29 12:28:57 +08:00
2026-01-28 17:49:40 +08:00
2025-12-15 09:24:59 +01:00
2026-01-09 05:34:56 +08:00
2026-01-28 17:49:40 +08:00
2026-01-19 20:03:19 +02:00
2025-08-05 22:10:36 +03:00
2026-01-19 20:03:19 +02:00
2025-08-14 12:03:57 +02:00
2025-09-23 10:25:20 +02:00
2025-08-05 22:10:36 +03:00
2024-11-14 18:04:35 +01:00
2024-12-12 19:02:49 +01:00
2026-01-19 20:03:19 +02:00
2025-06-01 13:43:57 +03:00
2026-01-24 21:57:51 +01:00