Commit Graph

11570 Commits

Author SHA1 Message Date
George Hotz
267bbb163e progress 2026-01-01 21:11:29 -05:00
George Hotz
de29a49ea3 all the ones i can find 2026-01-01 20:56:30 -05:00
George Hotz
742e10a572 remove fake ones 2026-01-01 20:26:53 -05:00
George Hotz
447fe8907b more 2026-01-01 20:22:52 -05:00
George Hotz
b0cfcec183 good 2026-01-01 20:12:20 -05:00
George Hotz
1726084b2a filt 2026-01-01 19:40:43 -05:00
George Hotz
de069a4876 many 2026-01-01 19:21:46 -05:00
George Hotz
4573e91e61 more 2026-01-01 18:51:31 -05:00
George Hotz
8d43212bc6 assembly/amd: start work on SQTT parsing/emulation 2026-01-01 18:40:58 -05:00
George Hotz
a8bea4ec52 remove __all__ 2026-01-01 16:14:15 -05:00
George Hotz
388514c5b1 better 2026-01-01 16:03:29 -05:00
George Hotz
729bb04d8c fix test failure 2026-01-01 13:21:55 -05:00
George Hotz
8f4de73141 two tests 2026-01-01 13:13:01 -05:00
George Hotz
a5959ef0f1 fix all tests 2026-01-01 13:11:51 -05:00
George Hotz
5ba06892c0 generic 2026-01-01 12:46:08 -05:00
George Hotz
469efe313d that's a hack 2026-01-01 12:40:14 -05:00
George Hotz
e3b3cb163d fix emu test 2026-01-01 12:12:47 -05:00
George Hotz
3e32185faf more tests 2026-01-01 12:04:41 -05:00
George Hotz
5328913d2b fix flat bug 2026-01-01 11:51:10 -05:00
George Hotz
9c49ec1cc1 update autogen 2026-01-01 11:36:33 -05:00
George Hotz
000d4a125b fix ds op 2026-01-01 10:36:37 -05:00
George Hotz
63289902d8 refactors 2025-12-31 17:57:27 -05:00
George Hotz
b596f77e33 assembly/amd: add pcode ds ops 2025-12-31 16:59:02 -05:00
George Hotz
2bb07d4824 assembly/amd: move Reg out of the psuedocode (#13934)
* assembly/amd: move Reg out of the psuedocode

* remove extra

* fix pcode tests

* simpler pcode

* simpler

* simpler

* cleaner

* fix mypy
2025-12-31 15:34:51 -05:00
chenyu
52acadc160 consolidate IGNORE_OOB=0 tests (#13937)
add a new unit test file and add more cases
2025-12-31 15:24:20 -05:00
chenyu
c0c1c1c8c8 remove unused validate rule (#13936) 2025-12-31 15:02:49 -05:00
chenyu
b6d08f247d assert z3_xor input type (#13933) 2025-12-31 13:37:57 -05:00
George Hotz
f14428090f assembly/amd: speed up emulator (#13932) 2025-12-31 13:32:25 -05:00
Christopher Milan
13973e4dea refactor image pitch (#13928) 2025-12-31 13:22:38 -05:00
chenyu
051fe6c8bc less toposort iteration in oob validate (#13929) 2025-12-31 13:16:34 -05:00
chenyu
a9a7b33404 IGNORE_OOB=0 in CI (#13903) 2025-12-31 12:56:59 -05:00
George Hotz
29402034a1 assembly/amd: cleanups to asm and emu (#13912)
* a bunch of cleanups

* ops are back

* bug fixes

* cleanups

* a lil simpler

* more refactors

* _disasm_vop1

* sops

* more

* continue

* more

* num_srcs

* simpler

* no _is16

* op cleanups

* isinstnace
2025-12-31 12:46:11 -05:00
chenyu
ba9aa5cd6f skip some PTX IGNORE_OOB validation (#13927) 2025-12-31 12:40:21 -05:00
chenyu
4968060ad4 fix IGNORE_OOB=0 for WEBGPU (#13926) 2025-12-31 10:41:28 -05:00
chenyu
35bd39e4ba update mypy and torch version in ci (#13925) 2025-12-31 10:29:28 -05:00
George Hotz
b998a80b5d assembly/amd: split generated stuff into enum/ins (#13924) 2025-12-31 10:10:52 -05:00
chenyu
404755bafd merge ci ruff tests and update ruff version (#13922) 2025-12-31 09:53:49 -05:00
nimlgen
25440f0f72 all2all (#13902)
* all2all

* um

* fix

* x

* um

* simler

* mypy

* fix

* t

* cmnts
2025-12-31 16:38:32 +03:00
nimlgen
f7ee644950 amd: lazy sdma queue allocation (#13920)
* ams: lazy queue

* nv

* linter

* f
2025-12-31 15:17:13 +03:00
nimlgen
b063518ea7 am: several sdmas (#13919)
* am: several sdmas

* fix
2025-12-31 14:19:22 +03:00
qazal
b23f4517ab prep mi350x gemm for python dsl (#13918)
* start by pruning existing asm

* better branch names

* split to template and real instructions
2025-12-31 20:00:57 +09:00
qazal
3f3786ded9 mmapeak: fix compiler import (#13915) 2025-12-31 16:52:23 +09:00
Christopher Milan
a14896fff2 refactor QCOM arg parsing (#13914)
* refactor QCOM arg parsing

* ruff

* mypy
2025-12-30 19:26:02 -05:00
Christopher Milan
c475c3a6d7 remove useless cast (#13911) 2025-12-30 19:24:29 -05:00
George Hotz
0221b96761 assembly/amd: fix all ops tests (#13910)
* assembly/amd: fix all ops tests

* test_ops with smaller sizes

* ds store/load 2addr
2025-12-30 18:01:34 -05:00
chenyu
dc27eb48ac remove PYTHONPATH="." from test.yml (#13909) 2025-12-30 17:00:16 -05:00
George Hotz
efc99d0c55 assembly/amd: more refactors (#13907)
* assembly/amd: more refactors

* more refactors

* more refactors

* simpler emu

* generate.py

* regen all

* cleanups

* more

* work

* more readme

* lil
2025-12-30 16:13:24 -05:00
George Hotz
49d1bf93d6 assembly/amd: refactor asm.py to be simpler (#13900)
* assembly/amd: refactor asm.py

* assembly/amd: refactor asm.py to be simpler

* multiple fxns

* fast

* more tests pass

* regen

* stop decode
2025-12-30 13:51:40 -05:00
George Hotz
04c79505ec no subnormal bf16 (#13905) 2025-12-30 13:02:53 -05:00
chenyu
39f99b207a update IGNORE_OOB error message (#13904)
IGNORE_OOB=1 to disable
2025-12-30 12:25:55 -05:00