Commit Graph

11715 Commits

Author SHA1 Message Date
wozeparrot
21d0f6bb76 tk: flat global -> local load (#14033) 2026-01-05 23:35:53 -08:00
qazal
3170365a5b visualize SQTT with the same cfg infrastructure (#13870)
* start

* rough sketch

* post render dag

* art

* intro g key

* work

* custom color scale

* colors

* more blue

* better

* smaller

* use for loop in test
2026-01-06 14:53:20 +09:00
George Hotz
640dac46c2 pcode_exec 2026-01-05 21:15:29 -08:00
Christopher Milan
0120d69caa autogen: avcodec (and simplify workflow) (#14031)
* simplify autogen workflow and add avcodec verification

- Consolidate all regeneration into single steps (delete + import)
- Remove continue-on-error and individual diff checks
- Use git diff at end to catch all differences
- Show artifact URL in failure message
- Add avcodec.py verification

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* patch avcodec

---------

Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-05 23:30:25 -05:00
George Hotz
05129e58b0 pcode back 2026-01-05 20:15:30 -08:00
George Hotz
b7dc59a68d fix emu 2026-01-05 20:09:20 -08:00
George Hotz
ec7ec99cbd better 2026-01-05 20:06:52 -08:00
George Hotz
c8c6346336 tests 2026-01-05 19:57:30 -08:00
George Hotz
6de310c87f parsing 2026-01-05 19:54:56 -08:00
George Hotz
ffba806b65 pdf/qcode work 2026-01-05 19:42:47 -08:00
George Hotz
c4016d5cac fix psuedocode parsing 2026-01-05 18:58:55 -08:00
George Hotz
a5587fbda1 Merge origin/master, delete pcode.py 2026-01-05 18:53:43 -08:00
George Hotz
20653d2996 assembly/amd: make pdf.py code shine (#14029)
* assembly/amd: make pdf.py code shine

* no merge

* pdf2 is the future

* something

* regen enums

* test

* work

* remove junk

* write

* pcode extraction

* pdf2 passes all tests

* simplify

* simpler pdf

* late filter

* remove hacks

* simplify pdf2.py

* field type

* remove defaults

* don't export srcenum

* simple pdf.py

* simpler

* cleaner

* less hack in PDF
2026-01-05 18:49:40 -08:00
qazal
ea7b149ca5 viz command line tool (#14030) 2026-01-06 10:19:47 +09:00
Christopher Milan
f86c728440 load libclang as 'libclang.so' too (#14028) 2026-01-05 16:56:16 -05:00
chenyu
eda6a73897 clean up canonicalize_device (#14027)
centralize the type check
2026-01-05 10:29:55 -05:00
chenyu
ce464b147a clean up comments that mentioned outdated terms (#14026)
no MultiLazyBuffer and no ShapeTracker in comments
2026-01-05 09:42:58 -05:00
chenyu
83063cc3e4 onnx TensorScatter (#14024) 2026-01-05 09:05:22 -05:00
chenyu
9497ec00f2 fix onnx attention permute (#14025)
* fix onnx attention permute

* skip test_attention_4d_fp16_cpu too
2026-01-05 08:58:50 -05:00
qazal
5cff5698f7 viz: g key toggles graph and text view (#14023) 2026-01-05 22:41:45 +09:00
chenyu
7a81a3cb98 more passed onnx tests (#14022) 2026-01-05 07:46:27 -05:00
kim yongjin
34fe105386 remove unused LazySeq (#14020) 2026-01-05 07:38:33 -05:00
qazal
4f2f38bf64 viz: split cfg and table render (#14021) 2026-01-05 20:59:08 +09:00
George Hotz
74da1c6310 test qcode 2026-01-05 02:42:27 -08:00
nimlgen
70405b4f3c am_smi: mi350 (#14018) 2026-01-05 13:10:56 +03:00
George Hotz
400d59c06b simpler 2026-01-04 20:37:06 -08:00
Christopher Milan
b2a0b9c551 autogen: dump patch in CI (#14010)
* autogen: don't fast-fail, produce patch artifact on differences

All verification steps now use continue-on-error to run completely.
Each job generates a patch artifact containing all differences found.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

* add gen from header test

* fix tests

* fail if diff

* add forward decl autogen test

* remove confusing/wrong comments

* macos unittests set LIBCLANG_PATH

---------

Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-04 22:38:12 -05:00
chenyu
aae08b20e0 enable passed onnx tests (#14017) 2026-01-04 22:12:50 -05:00
George Hotz
57684d2777 no pcode 2026-01-04 18:35:16 -08:00
chenyu
785d04d127 simpler einsum (#14014) 2026-01-04 20:38:59 -05:00
George Hotz
8147a78d24 wide dtypes 2026-01-04 17:26:12 -08:00
George Hotz
486248f775 fix pcode 2026-01-04 17:04:52 -08:00
George Hotz
87e72f1540 ftz 2026-01-04 16:32:35 -08:00
chenyu
f6a78a29e0 support einsum trace (#14012)
* support einsum trace

* test_einsum_scalar_cpu
2026-01-04 19:27:27 -05:00
George Hotz
b52ff63896 fixes 2026-01-04 15:48:31 -08:00
George Hotz
404eed6172 assembly/amd: improve tests for asm (#14007)
* assembly/amd: improve tests for asm

* upd

* skip

* tests

* re bug

* more passing

* cleanups

* cdna fixups

* improve tests, better CDNA parsing

* fix CI

* no defs

* simpler

* all pass

* from pdf

* regen
2026-01-04 15:14:08 -08:00
George Hotz
7f7f12d5b4 99% match 2026-01-04 15:05:05 -08:00
George Hotz
b10ae6958e roundtripping 2026-01-04 14:31:40 -08:00
wozeparrot
f550f9204c fa: failing test for bwd jit (#14009)
* tk: failing test for bwd jit

* feat: mark expectedFailure

* clean: spaces
2026-01-04 16:57:43 -05:00
George Hotz
10e2c47d52 don't make dtype 2026-01-04 13:49:47 -08:00
George Hotz
058816dd92 use tinygrad UOps as DSL 2026-01-04 13:40:15 -08:00
George Hotz
28846cb6c4 simpler dsl 2026-01-04 13:24:01 -08:00
George Hotz
958bfa1c5b Op 2026-01-04 13:05:38 -08:00
George Hotz
23cf30820f more correct 2026-01-04 12:52:20 -08:00
George Hotz
ea51512f90 CMPLE 2026-01-04 12:45:21 -08:00
George Hotz
e9664fdf28 dtype is uop 2026-01-04 12:31:40 -08:00
George Hotz
5d50281896 Merge remote-tracking branch 'origin/master' into asm_ucode
# Conflicts:
#	tinygrad/dtype.py
2026-01-04 12:22:52 -08:00
George Hotz
cfeeab8485 work 2026-01-04 12:22:01 -08:00
George Hotz
7abf4591ba use bitsize on dtype (#14011)
* use bitsize on dtype [pr]

* bitsize

* bitsize in js export, but might be wrong

* reverts

* revert that
2026-01-04 12:16:21 -08:00
George Hotz
2be5f8b688 work 2026-01-04 11:57:42 -08:00