George Hotz
cb500466c2
assembly/amd: amd_asm_matmul (#13989)
* amd_asm_matmul
* dsl transform
* asm roundtrip
* fixed
* less
* better
* more
* simpler
* simplify
* lil
* simpler
* compact
* work
* cleanups
* simplify
* simpler
* cleanup
* name the regs
* simp
* big simp
* big simp
* simp
* acc grid
* fast
* stuff
* fast
* simpler
* owrks
* save vgprs
* save vgprs
* Compact
* less VGPRs
* after
* SQTT support
* fastest
* faster
* lil faster
* tile regs
* faster
* readable
* one more
* simpler
* lil simpler
* NO_GLOBAL skips early globals
* stock kernel
* cleanups
* cleanups
* one b reg
* safe reg changes
* acc is compact now
* remove confusing stuff
* sregs
* lds cleanups
* vopd
2026-01-07 20:11:05 -08:00
..
2025-07-28 19:35:48 -07:00
2026-01-03 18:34:23 +09:00
2025-03-19 15:04:57 +08:00
2025-12-28 21:45:42 +09:00
2026-01-07 20:11:05 -08:00
2025-12-19 17:04:24 -04:00
2026-01-03 18:34:23 +09:00
2024-11-20 00:10:29 +08:00
2024-11-20 00:10:29 +08:00
2025-03-10 16:05:30 -04:00
2023-12-05 13:28:24 -08:00
2023-12-05 13:28:24 -08:00
2025-06-26 16:14:57 -07:00
2024-11-20 00:10:29 +08:00
2025-09-10 15:15:48 -04:00
2025-11-03 18:09:09 -08:00
2024-03-26 20:38:03 -07:00
2025-06-06 18:38:37 -04:00
2025-06-06 18:38:37 -04:00
2025-10-31 19:40:36 +08:00
2025-11-13 09:09:28 -08:00
2025-11-03 13:01:48 +08:00
2024-08-07 22:32:11 -07:00
2025-03-10 16:05:30 -04:00
2025-12-19 17:04:24 -04:00
2025-03-10 16:05:30 -04:00
2025-12-19 17:04:24 -04:00
2025-11-13 16:38:40 -08:00
2025-12-19 17:04:24 -04:00
2025-08-07 14:19:17 -07:00