11633 Commits

Author SHA1 Message Date
George Hotz
0e240fb987 Merge branch 'master' into amd_sqtt 2026-01-02 20:30:16 -05:00
George Hotz
d2c1712e4c more tests 2026-01-02 17:29:48 -08:00
George Hotz
96b0ee0966 lil 2026-01-02 16:53:31 -08:00
George Hotz
9b5c4bc698 shorter 2026-01-02 16:48:26 -08:00
George Hotz
6ea3586101 short 2026-01-02 16:45:34 -08:00
George Hotz
92cb8b6776 tests pass 2026-01-02 16:43:03 -08:00
George Hotz
c416b20668 failures 2026-01-02 15:54:02 -08:00
George Hotz
415b83ba18 tests pass 2026-01-02 15:47:39 -08:00
George Hotz
8c7eacea59 getting close 2026-01-02 15:25:18 -08:00
George Hotz
81542699f8 work 2026-01-02 14:39:52 -08:00
Christopher Milan
9dc524536f IMAGE=1 creates "dynamic" images (#13769)
* remove image from BufferSpec

* cl tiny_gemm (64) works

* mypy

* padding

* openpilot CL

* reshape properly

* remove extra qcom checks

* pad output

* mypy

* update compile test

* move undo

* TestImageCopy valid images

* TestImageRealization valid images

* TestImageDType valid images

* cleanups

* test_renderer_failures

* ruff

* mypy

* simplify ops_qcom

* bump step time

* Revert "bump step time"

This reverts commit 75a037c7d0.

* "dynamic textures" are optional

* a start

* IMAGE=1 works, no FLOAT16

* fast but wrong

* mypy

* some fixes

* better

* works

* refactor

* oops
2026-01-02 16:22:39 -05:00
George Hotz
79f55a5d5e test_snop is correct 2026-01-02 12:01:08 -08:00
George Hotz
37518fb236 start with nop 2026-01-02 11:40:08 -08:00
George Hotz
672008ccab framework 2026-01-02 11:31:41 -08:00
George Hotz
849af761a4 simpler 2026-01-02 11:10:40 -08:00
Christopher Milan
61dc70f1a8 add driving_vision IMAGE=1 benchmark (#13979) 2026-01-02 13:58:27 -05:00
George Hotz
ab46b3d8d3 origin/master 2026-01-02 10:47:00 -08:00
George Hotz
df20197bfb rever emu to master 2026-01-02 10:46:46 -08:00
George Hotz
2b56c264d5 compare tests 2026-01-02 10:39:07 -08:00
George Hotz
c7e5c2f996 Merge origin/master, remove deleted test_emu.py 2026-01-02 09:41:34 -08:00
George Hotz
0e282025ff assembly/amd: split test_emu into hw tests (#13966)
* assmebly/amd: split test_emu into hw tests

* hw tests

* bugfixes

* more tests and fix
2026-01-02 08:04:56 -08:00
chenyu
2e2b5fed12 fix misspellings (#13976) 2026-01-02 10:37:38 -05:00
nietras
f49e4714af Fix spelling errors in README for AMD assembly (#13975) 2026-01-02 10:15:20 -05:00
b1tg
a78fcc55a4 amd tc 1616128 (#13439)
* amd tc 1616128

* fix test

* remove hardcoded check in test
2026-01-02 09:01:05 -05:00
chenyu
fcbb896e05 remove unused to_struct [pr] (#13973) 2026-01-02 08:54:57 -05:00
George Hotz
659aa14043 orks 2026-01-02 05:29:48 -08:00
nimlgen
ff7853a65a am: fix aid doorbells (#13971) 2026-01-02 15:53:44 +03:00
nimlgen
42abb0586c am: fix aid doorbells (#13972) 2026-01-02 15:53:13 +03:00
nimlgen
ebbaad6bfd am: enable all sdma engines (#13970) 2026-01-02 15:25:15 +03:00
qazal
5f52266225 mi350x gemm: use Tensor.custom_kernel in asm test (#13969)
* mi350x gemm: use Tensor.custom_kernel in asm test

* A @ B for baseline
2026-01-02 18:30:50 +09:00
George Hotz
21ffa1a86b 64 nops 2026-01-02 00:38:27 -05:00
George Hotz
29f3fb7af3 still stable 2026-01-01 23:45:19 -05:00
George Hotz
1edc7fc519 stable 2026-01-01 23:43:43 -05:00
George Hotz
c9a3ac988c cleanest 2026-01-01 23:18:19 -05:00
George Hotz
5a1a561e0f assembly/amd: rdna4 autogen (#13967)
* assembly/amd: add pcode ds ops

* refactors

* fix ds op

* update autogen

* fix flat bug

* more tests

* fix emu test

* that's a hack

* generic

* fix all tests

* two tests

* fix test failure

* better

* remove __all__

* assembly/amd: fix autogen for RDNA4
2026-01-01 23:12:18 -05:00
George Hotz
77d96acbe3 clean 2026-01-01 22:59:07 -05:00
George Hotz
660ecf272b work 2026-01-01 22:50:50 -05:00
George Hotz
267bbb163e progress 2026-01-01 21:11:29 -05:00
wozeparrot
b27527f05a fix: missed inner tracked range (#13964) 2026-01-01 18:09:57 -08:00
wozeparrot
ecbac8a338 tk: fa cleanups + causal test (#13963) 2026-01-01 18:05:00 -08:00
George Hotz
de29a49ea3 all the ones i can find 2026-01-01 20:56:30 -05:00
George Hotz
742e10a572 remove fake ones 2026-01-01 20:26:53 -05:00
George Hotz
447fe8907b more 2026-01-01 20:22:52 -05:00
George Hotz
b0cfcec183 good 2026-01-01 20:12:20 -05:00
George Hotz
1726084b2a filt 2026-01-01 19:40:43 -05:00
chenyu
af0392efea only set DiskDevice.size if it opens successfully (#13962) 2026-01-01 19:33:26 -05:00
George Hotz
de069a4876 many 2026-01-01 19:21:46 -05:00
George Hotz
4573e91e61 more 2026-01-01 18:51:31 -05:00
George Hotz
8d43212bc6 assembly/amd: start work on SQTT parsing/emulation 2026-01-01 18:40:58 -05:00
chenyu
e036d6df89 properly fix DiskDevice reuse (#13961) 2026-01-01 18:08:23 -05:00