qazal
c7f908b788
sqtt: fix rdna4 structs ( #15111 )
...
* work
* DEBUG=2
2026-03-03 23:32:14 +09:00
qazal
8dd691761d
sqtt: remove old files ( #15108 )
2026-03-03 22:43:24 +09:00
b1tg
a9ea36de79
assembly/amd: v_cmp_lg_f32 is ordered not-equal ( #14982 )
2026-03-03 15:37:48 +08:00
qazal
b8a55d5f68
sqtt: new packet types, add discovery script ( #14960 )
2026-02-28 04:27:27 +09:00
qazal
ad99b77f6d
assembly/amd: add gfx12_asm_vflat llvm tests, disasm fixes ( #15046 )
...
* add gfx12_asm_vflat.s
* work
2026-02-27 20:20:31 +09:00
qazal
d6145736c7
sqtt: examples generator changes from inst_discovery ( #14961 )
...
* sqtt examples generator changes from inst_discovery
* rdna4
* rdna3
* cdna
* sad reality for mi300x
2026-02-23 14:42:48 +09:00
qazal
60f90dd97c
sqtt: fix jitted program deduping, failing test for graphed kernels ( #14951 )
...
* work
* hcq_profile fix, test with JIT=2 passes
* ci, -n=auto
* rm duplicate test
* less
2026-02-22 15:22:31 +09:00
qazal
16ae96fa58
finish rdna4 sqtt ( #14903 )
...
* unskip
* it's a wave pair in rdna4
* work
* that
* hidden archive
* generic s_delay, mystery InstOpRDNA4.UNK_60
* branch failing test
* UNK_60 is OTHER_VMEM_STORE
* rdna4 has both s_delay_alu and s_wait_alu
* real branch failing test
* rdna4 doesn't have JUMP_NO, it's NEXT with a flag for no jump
* make inst_delay skips recursive
* all rdna4 tests pass
* simm16 unwraps
* that has a name
2026-02-20 16:06:13 +09:00
qazal
52b51a0324
test fixes from rdna4 sqtt ( #14902 )
2026-02-20 14:42:33 +09:00
qazal
32f569b573
viz/sqtt: decoder fixes pre rdna4/cdna4 work ( #14900 )
...
* viz/sqtt: decoder fixes pre rdna4/cdna4 work
* fix
* branch_inst + more tests
* smaller
2026-02-20 12:10:15 +09:00
qazal
911399bee5
assembly/amd: move the kernel capture stuff out of helpers ( #14881 )
2026-02-19 16:28:48 +09:00
George Hotz
47d39a6b8b
add sqtt support to the emulator ( #14791 )
...
* add sqtt support to the emulator
* more sqtt
* cleanup
* cleanups
* simpler tests
* some decent tests
* test branch
2026-02-16 16:48:26 +08:00
qazal
ac62d28ddc
viz: amdgpu arch cleanup ( #14790 )
...
* viz: amdgpu arch cleanup
* don't do that
* simpler sqttmap
* work
* self.arch
2026-02-16 16:48:12 +09:00
George Hotz
401095e3e7
emulator barrier tests ( #14789 )
2026-02-16 15:31:01 +08:00
George Hotz
dff9cf35c2
amd asm emulator fixes + run it in CI ( #14786 )
...
* amd asm fix, try 2
* fix tests
2026-02-16 13:24:21 +08:00
George Hotz
0abcb9aac2
move more to mixins ( #14780 )
...
* move more to mixins
* revert
* move some
* do not change
* more
* fix tests
* Revert "more"
This reverts commit d942d59fa4 .
* go
* work
* more
* work
* guard
* base
2026-02-16 11:35:00 +08:00
George Hotz
e8bd432bf6
move amd emulator out of tree ( #14740 )
...
* move amd emulator out of tree
* move the readme too
2026-02-14 10:32:00 +08:00
George Hotz
5289b4e882
renderer/amd: add cdna emulator ( #14721 )
...
* renderer/amd: add cdna emulator
* fixes
* no predecode
* no early
* REMU_PATH
* delete that
* round
* Fix cache invalidation check in _compile_smem
2026-02-13 16:06:58 +08:00
George Hotz
4088d686b2
remove llvm requirement from amd ( #14717 )
...
* remove llvm requirement from amd
* tests pass
* test
* sink kernarg_size
* move stuff
* amd_asm_matmul to new style
* default type
* fix tests, simpler
* cu mode is faster and simpler
* darken
2026-02-13 10:50:12 +08:00
George Hotz
d3adb8428e
Revert "hotfix: skip test/amd in macpytest" ( #14704 )
...
* Revert "hotfix: skip test/amd in macpytest"
This reverts commit b7dade2adf .
* no llvm subprocess
* simpler
* sys.exec
* cleanup
* process safe
* diag
* arm ftz support
* 5 sec
* this one
2026-02-13 08:00:24 +08:00
George Hotz
19e68a1833
skip AMD on not AMD ( #14703 )
2026-02-12 18:56:54 +08:00
George Hotz
4680247e35
renderer/amd: move in tree ( #14702 )
...
* renderer/amd: move in tree
* fix paths in tests
* 24000 lines
* no delete for amd files
2026-02-12 18:09:16 +08:00