George Hotz
267bbb163e
progress
2026-01-01 21:11:29 -05:00
George Hotz
de29a49ea3
all the ones i can find
2026-01-01 20:56:30 -05:00
George Hotz
742e10a572
remove fake ones
2026-01-01 20:26:53 -05:00
George Hotz
447fe8907b
more
2026-01-01 20:22:52 -05:00
George Hotz
b0cfcec183
good
2026-01-01 20:12:20 -05:00
George Hotz
1726084b2a
filt
2026-01-01 19:40:43 -05:00
George Hotz
de069a4876
many
2026-01-01 19:21:46 -05:00
George Hotz
4573e91e61
more
2026-01-01 18:51:31 -05:00
George Hotz
8d43212bc6
assembly/amd: start work on SQTT parsing/emulation
2026-01-01 18:40:58 -05:00
George Hotz
a8bea4ec52
remove __all__
2026-01-01 16:14:15 -05:00
George Hotz
388514c5b1
better
2026-01-01 16:03:29 -05:00
George Hotz
729bb04d8c
fix test failure
2026-01-01 13:21:55 -05:00
George Hotz
8f4de73141
two tests
2026-01-01 13:13:01 -05:00
George Hotz
a5959ef0f1
fix all tests
2026-01-01 13:11:51 -05:00
George Hotz
5ba06892c0
generic
2026-01-01 12:46:08 -05:00
George Hotz
469efe313d
that's a hack
2026-01-01 12:40:14 -05:00
George Hotz
e3b3cb163d
fix emu test
2026-01-01 12:12:47 -05:00
George Hotz
3e32185faf
more tests
2026-01-01 12:04:41 -05:00
George Hotz
5328913d2b
fix flat bug
2026-01-01 11:51:10 -05:00
George Hotz
9c49ec1cc1
update autogen
2026-01-01 11:36:33 -05:00
George Hotz
000d4a125b
fix ds op
2026-01-01 10:36:37 -05:00
George Hotz
63289902d8
refactors
2025-12-31 17:57:27 -05:00
George Hotz
b596f77e33
assembly/amd: add pcode ds ops
2025-12-31 16:59:02 -05:00
George Hotz
2bb07d4824
assembly/amd: move Reg out of the psuedocode ( #13934 )
...
* assembly/amd: move Reg out of the psuedocode
* remove extra
* fix pcode tests
* simpler pcode
* simpler
* simpler
* cleaner
* fix mypy
2025-12-31 15:34:51 -05:00
chenyu
52acadc160
consolidate IGNORE_OOB=0 tests ( #13937 )
...
add a new unit test file and add more cases
2025-12-31 15:24:20 -05:00
chenyu
c0c1c1c8c8
remove unused validate rule ( #13936 )
2025-12-31 15:02:49 -05:00
chenyu
b6d08f247d
assert z3_xor input type ( #13933 )
2025-12-31 13:37:57 -05:00
George Hotz
f14428090f
assembly/amd: speed up emulator ( #13932 )
2025-12-31 13:32:25 -05:00
Christopher Milan
13973e4dea
refactor image pitch ( #13928 )
2025-12-31 13:22:38 -05:00
chenyu
051fe6c8bc
less toposort iteration in oob validate ( #13929 )
2025-12-31 13:16:34 -05:00
chenyu
a9a7b33404
IGNORE_OOB=0 in CI ( #13903 )
2025-12-31 12:56:59 -05:00
George Hotz
29402034a1
assembly/amd: cleanups to asm and emu ( #13912 )
...
* a bunch of cleanups
* ops are back
* bug fixes
* cleanups
* a lil simpler
* more refactors
* _disasm_vop1
* sops
* more
* continue
* more
* num_srcs
* simpler
* no _is16
* op cleanups
* isinstnace
2025-12-31 12:46:11 -05:00
chenyu
ba9aa5cd6f
skip some PTX IGNORE_OOB validation ( #13927 )
2025-12-31 12:40:21 -05:00
chenyu
4968060ad4
fix IGNORE_OOB=0 for WEBGPU ( #13926 )
2025-12-31 10:41:28 -05:00
chenyu
35bd39e4ba
update mypy and torch version in ci ( #13925 )
2025-12-31 10:29:28 -05:00
George Hotz
b998a80b5d
assembly/amd: split generated stuff into enum/ins ( #13924 )
2025-12-31 10:10:52 -05:00
chenyu
404755bafd
merge ci ruff tests and update ruff version ( #13922 )
2025-12-31 09:53:49 -05:00
nimlgen
25440f0f72
all2all ( #13902 )
...
* all2all
* um
* fix
* x
* um
* simler
* mypy
* fix
* t
* cmnts
2025-12-31 16:38:32 +03:00
nimlgen
f7ee644950
amd: lazy sdma queue allocation ( #13920 )
...
* ams: lazy queue
* nv
* linter
* f
2025-12-31 15:17:13 +03:00
nimlgen
b063518ea7
am: several sdmas ( #13919 )
...
* am: several sdmas
* fix
2025-12-31 14:19:22 +03:00
qazal
b23f4517ab
prep mi350x gemm for python dsl ( #13918 )
...
* start by pruning existing asm
* better branch names
* split to template and real instructions
2025-12-31 20:00:57 +09:00
qazal
3f3786ded9
mmapeak: fix compiler import ( #13915 )
2025-12-31 16:52:23 +09:00
Christopher Milan
a14896fff2
refactor QCOM arg parsing ( #13914 )
...
* refactor QCOM arg parsing
* ruff
* mypy
2025-12-30 19:26:02 -05:00
Christopher Milan
c475c3a6d7
remove useless cast ( #13911 )
2025-12-30 19:24:29 -05:00
George Hotz
0221b96761
assembly/amd: fix all ops tests ( #13910 )
...
* assembly/amd: fix all ops tests
* test_ops with smaller sizes
* ds store/load 2addr
2025-12-30 18:01:34 -05:00
chenyu
dc27eb48ac
remove PYTHONPATH="." from test.yml ( #13909 )
2025-12-30 17:00:16 -05:00
George Hotz
efc99d0c55
assembly/amd: more refactors ( #13907 )
...
* assembly/amd: more refactors
* more refactors
* more refactors
* simpler emu
* generate.py
* regen all
* cleanups
* more
* work
* more readme
* lil
2025-12-30 16:13:24 -05:00
George Hotz
49d1bf93d6
assembly/amd: refactor asm.py to be simpler ( #13900 )
...
* assembly/amd: refactor asm.py
* assembly/amd: refactor asm.py to be simpler
* multiple fxns
* fast
* more tests pass
* regen
* stop decode
2025-12-30 13:51:40 -05:00
George Hotz
04c79505ec
no subnormal bf16 ( #13905 )
2025-12-30 13:02:53 -05:00
chenyu
39f99b207a
update IGNORE_OOB error message ( #13904 )
...
IGNORE_OOB=1 to disable
2025-12-30 12:25:55 -05:00