George Hotz
fb3011ac61
improve matcher speed [run_process_replay] ( #5438 )
...
* improve matcher speed [run_process_replay]
* don't use arg set in ptx
2024-07-12 20:02:19 -07:00
George Hotz
d13654a820
move uopgraph to file [run_process_replay] ( #5364 )
...
* move uopgraph to file [run_process_replay]
* fix print tree test
2024-07-10 17:34:50 -07:00
kormann
3d452195e4
[bug fix] nested commutative pattern _match [run_process_replay] [no_assert] ( #5340 )
...
* deep pat test
* lint
* min diff
* min lines
* nothing
* is res extra
* cleanup2
* add res back
* reduce lines
* type anno
---------
Co-authored-by: qazal <qazal.software@gmail.com >
2024-07-09 16:38:39 +03:00
qazal
24c89a2a33
move assert_equiv_uops to helpers + use == for dtypes ( #5067 )
...
* dtypes should use ==
* use TestUOps
* should use assertIs
2024-06-20 16:39:34 +03:00
kormann
7c3b877216
rename uop [run_process_replay] ( #5031 )
...
* rename
* fix unittests
* rename vin
* fix test
* fix type [run_process_replay]
* rm pre commit hook change
2024-06-18 21:34:05 +03:00
George Hotz
9823752397
make uops.add private ( #4950 )
...
* make uops.add private
* modernize all tests
2024-06-14 03:23:25 -07:00
chenyu
b56ae5606c
cosmetic changes to uop _match ( #4897 )
...
minor cleanup before fixing two level match
[run_process_replay]
2024-06-09 18:29:42 -04:00
Alec Chen
5ac30c29d8
Construct UOps patterns using UPat ( #4821 )
...
* Allow UPat pattern definitions
* Convert pattern matcher tests to UPat constructions
* Convert constant_folder patterns to upat constructions
* Convert assembly patterns to upat constructions
* [run_process_replay] Drop UPat.from_dict
2024-06-05 10:29:37 +02:00
Alec Chen
4909a0d16f
Fix arg set in pattern matcher ( #4830 )
2024-06-04 15:10:09 -04:00
Alec Chen
c96026ac65
Add arg set regression test for pattern matcher ( #4827 )
...
* Add arg set regression test for pattern matcher
* real regression
---------
Co-authored-by: qazalin <qazal.software@gmail.com >
2024-06-04 13:35:09 -04:00
Alec Chen
e89bc42cc7
Add UOps pattern matcher regression tests ( #4725 )
...
* add pattern matcher regression tests
* Remove test for dtype str after rebasing
* Make test uops match type spec
* leave const const, add const alu vin test
* correct uops
* actually correct uops
2024-05-30 17:12:20 +03:00
George Hotz
07b350a8f4
new uops is an actual graph ( #4560 )
...
* new uops is an actual graph
* it's way slower
* simpler
* fix define acc
* render_loop unique
* ops test pass
* add pattern matcher back, there's bugs
* rewrite
* use priority queue
* recursive children
* fix tests
* fix tests with SINK
* fix abstractions
* fix assembly
* simpler
* link define_acc
* fix DEFINE_ACC placement
* type verify
* full cmp
* fix cmp
* ACCESS_ACC
* insert DEFINE_ACC
* fix PHI
* recursive rewrite
* fix many tests
* sum collapse
* more patterns
* correct change
* fold arange
* fix that lin test
* space
* big folding rule works
* close
* has more maxes, meh
* cached node replace
* set changed
* simplest folding yet
* works
* works
* DIV
* all tests pass
* del
* fuzz linearizer fails
* sum_collapse
* test depth 2 cf
* fix lin test 14
* fix clang depth
* disable that
* failure 14 is fixed
* fix ptx
* failure 27 is fixed
* fix llama
* run_cnt
* Revert "Optimize PTX gated loads index calculation (#4304 )"
This reverts commit d97d5a7689 .
* fix uops loop
* fix ptx bugs
* add barrier
* print
* mem_type in ptx direct
* bypass tests that fail in CI but pass locally
* ptx remove ptr_ar
* more ptx passing
* fix ptx tests
* assert compile support
* remove model inference benchmark from red
2024-05-17 18:00:18 -07:00
chenyu
e7ff5102cf
failed test in test_pattern_matcher ( #4080 )
...
something about the PTX rewrite is incorrect that it has duplicated rewritten uops
2024-04-05 02:53:50 -04:00
Szymon Ożóg
82b7b9655f
test for dtype set ( #4069 )
2024-04-04 11:24:33 -04:00
Szymon Ożóg
e5a9bff899
Add pattern matcher tests, move uop transforms from assembly to pattern ( #4056 )
...
matcher
2024-04-03 09:06:43 -07:00