qazal
2cc64d71b0
simplify mi350x gemm / viz asm tests ( #13984 )
...
* mi350x gemm cleanup
* asm tests work
* simpler asm tests
2026-01-03 11:11:07 +09:00
qazal
9726500de8
enable using assembly in Tensor.custom_kernel ( #13895 )
2026-01-02 00:12:01 +09:00
George Hotz
b998a80b5d
assembly/amd: split generated stuff into enum/ins ( #13924 )
2025-12-31 10:10:52 -05:00
George Hotz
81cf9ea0ab
rename to extra.assembly.amd ( #13879 )
2025-12-29 14:10:55 -05:00
George Hotz
f07c39cfa4
hwtest fixes for rdna3 dsl ( #13865 )
2025-12-28 20:42:29 -05:00
qazal
2180eee5e4
use the asm dsl in remu hwtest.py ( #13856 )
...
* remu hw test with the asm dsl
* simpler
* nthreads and exec mask
* cmp/cmpx
* assembler error in s_mov_b32
* vopd in dsl?
2025-12-28 11:32:41 +09:00
qazal
2f95c10702
remu new instructions / use volatile in emulator tests ( #12862 )
...
* remu new instructions
* start moving to volatile
* test_simple works
* test_exec_mov works and lid is still here
* test_exec_cmp_vopc
* clang did s_mov_b32 exec_lo, 1
* don't hardcode v1
* support volatile in tests
* hw_test passes
* only the volatile version
* subrev saturating behavior
2025-10-23 11:13:43 +08:00
George Hotz
32e9949052
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
qazal
17f0f5e764
add v_rcp_f32_e64 to remu ( #10393 )
...
* tests from the box
* add v_rcp_f32_e64 to remu
* f32::from_bits utils
* v_cndmask_b32 tests
2025-05-18 17:08:21 +03:00
qazal
be8202b293
add s_abs_i32 instruction to remu ( #10334 )
2025-05-15 16:47:58 +03:00
qazal
9210280811
add v_fmac_f16 vop3 instruction to remu ( #10247 )
...
* fmac vop3
* from the box
2025-05-10 23:48:25 +03:00
qazal
ac37510f60
remu: only write v_cmp result if exec is set ( #10084 )
2025-04-28 20:31:52 +08:00