Files
tinygrad/extra/gemm
qazal 616e9c1483 CDNA assembly gemm in tensor.py with flag (#14310)
* work

* work

* the assembly

* remove the old one

* remove ws bufs, assert splitk

* notes cleanup

* work

* gemm args

* gemm in mixins would be nice

* add gemm gradient

* print counters

* the realize is for DEBUG=2 aesthetics

* dedup

* rewrite to python dsl, no list copies

* leave that

* add B, M, N, K to gemm name

* it's M0 not NULL

* fp16 support

* test cleanup + more gemms

* work from viz

* more work

* gemm batch_size

* xccg path work

* tiny comments on the label naming

* s_waitcnt
2026-01-31 22:34:14 +09:00
..
2025-07-28 19:35:48 -07:00
2025-03-10 16:05:30 -04:00
2023-12-05 13:28:24 -08:00
2023-12-05 13:28:24 -08:00
2025-06-26 16:14:57 -07:00
2025-09-10 15:15:48 -04:00
2025-06-06 18:38:37 -04:00
2025-06-06 18:38:37 -04:00
2025-03-10 16:05:30 -04:00
2025-03-10 16:05:30 -04:00
2025-08-07 14:19:17 -07:00