qazal
67e66ac1ab
hotfix: schedule_uop in process replay ( #8260 )
...
* hotfix: schedule_uop in process replay
* notes
2024-12-15 21:24:54 +08:00
qazal
5864627abe
process replay filter warnings [pr] ( #8199 )
2024-12-13 17:43:43 +08:00
qazal
07b6d5cf63
assign early folding ( #8093 )
...
* assign early folding [pr]
* move to to_si
* -
* fix generate_dataset
* diff too big
* no recreation, no diff
* gzip
* new sops from tiny10
* final try
2024-12-07 17:02:55 +08:00
qazal
9828277c03
view doesn't have buffer, fix the tests [pr] ( #7841 )
...
* view doesn't have buffer, fix the tests [pr]
* need assigns
2024-11-22 20:41:55 +08:00
qazal
1e31b5ba6b
hotfix: ctx doesn't impact process replay [pr] ( #7785 )
2024-11-19 20:17:01 +08:00
chenyu
207bca6cea
set PAGE_SIZE=1 and generate new dataset ( #7559 )
...
13080 rows in total. both generating and loading this are pretty broken now. filters are wrong for example
2024-11-05 11:25:01 -05:00
qazal
e955aa1bee
hotfix: process replay ( #7418 )
2024-10-30 22:45:40 +02:00
qazal
51c0c8d27e
cachable small graph rewrite ( #7371 )
2024-10-29 22:28:13 +08:00
qazal
e46edc22aa
use unittest helpers in TestTensorMetadata [pr] ( #7329 )
...
* use unittest helpers in TestTensorMetadata [pr]
* fix that
* 5 args
2024-10-28 18:38:30 +08:00
qazal
8d9459f281
always run process replay with contextvars ( #7323 )
...
* always run process replay with contextvars [pr]
* not the last two
* extra
* no pr
2024-10-27 20:44:42 +02:00
qazal
d482d927a8
hotfix: nobody uses [run_process_replay] [pr] ( #7264 )
2024-10-24 13:37:29 +03:00
qazal
dae908299e
full_ast_rewrite api with ScheduleItemContext ( #7223 )
2024-10-22 23:17:05 +03:00
qazal
40f33c110b
big graph var_vals as rewrite context ( #7007 )
...
* var_vals as rewrite context
* no default arg
* add st var_vals
* delete some stuff
* add the rewrite rule again
* extra
* this whole part is preschedule
* test with a second context
* redo
* i always forget tensor variable
2024-10-16 07:31:44 +03:00
qazal
09de958855
move print_diff to test/helpers ( #7071 )
2024-10-15 22:00:39 +03:00
qazal
13846930cd
hotfix: extract_dataset.py ( #7029 )
2024-10-13 11:18:23 +03:00
qazal
2800520dd5
even smaller process_replay.py [pr] ( #6941 )
...
* even smaller process_replay.py [pr]
* delete those tests
* dedup asts
2024-10-08 20:43:22 +08:00
qazal
b82023c97e
process replay cleanup to generic _pmap [pr] ( #6929 )
...
* process replay cleanup to generic _pmap [pr]
* delete `COMPARE_SCHEDULE`
2024-10-07 13:57:05 +08:00
qazal
16312b4c59
rip out old scheduler process replay stuff, diff pure UOps [pr] ( #6927 )
2024-10-07 13:20:35 +08:00
George Hotz
4df5c7a4ef
move lazy to engine [pr] ( #6886 )
...
* move lazy to engine [pr]
* engine.lazy
2024-10-04 23:19:26 +08:00
qazal
c5b252cdb3
add pr alias [pr] ( #6834 )
2024-10-01 18:48:44 +08:00
qazal
a16a8c5958
color process replay stats [run_process_replay] ( #6830 )
2024-10-01 15:29:11 +08:00
qazal
5ad2f95d01
process replay diff stats ( #6736 )
...
* process replay diff stats
* fix tuples
2024-09-25 15:19:56 +08:00
qazal
cefc3e9382
make all schedules immutable [run_process_replay] ( #6718 )
...
* compute inputs and outputs in LBScheduleItem [run_process_replay]
* simpler metadata, delete __hash__
* no dynamic field
* test_diff_schedule
2024-09-24 21:08:16 +08:00
qazal
6be1bf09f1
hotfix: bring COMPARE_SCHEDULE=0 back ( #6657 )
2024-09-23 10:39:43 +08:00
qazal
6b65d8c461
more process replay tracing work [run_process_replay] ( #6650 )
2024-09-22 16:16:58 +08:00
qazal
5bafed2f88
process replay traceback ( #6642 )
2024-09-21 16:53:34 +08:00
qazal
4ffb722d4e
var_vals prereq for deleting LBScheduleItem [run_process_replay] ( #6511 )
2024-09-14 17:00:30 +08:00
George Hotz
904f6a63fa
Revert "Revert "cleanup process_replay/* namings [run_process_replay] ( #6429 )…" ( #6442 )
...
This reverts commit eda177da84 .
2024-09-10 07:00:16 +08:00
George Hotz
eda177da84
Revert "cleanup process_replay/* namings [run_process_replay] ( #6429 )" ( #6437 )
...
This reverts commit f4e83b30b4 .
2024-09-09 18:52:36 +08:00
qazal
f4e83b30b4
cleanup process_replay/* namings [run_process_replay] ( #6429 )
2024-09-09 16:59:04 +08:00
qazal
935b6b658f
delete seen from the scheduler api [run_process_replay] ( #6427 )
...
docs
2024-09-09 16:26:34 +08:00
qazal
99018a4aa1
minor schedule differ utils [run_process_replay] ( #6348 )
...
* minor schedule differ utils [run_process_replay]
* rm
2024-09-04 03:41:38 +08:00
qazal
dd4e5f1c8d
process replay rewrite ( #6284 )
...
* process replay rewrite
p2
* start some unittests + exceptions and exits
* shebang
* remove extra kernel init
2024-08-29 15:08:27 +03:00
qazal
d2f8eeed2e
make [compare_schedule] the default [run_process_replay] ( #6273 )
...
* make [compare_schedule] the default
* capture ctx
* logging
* set capture to false
2024-08-26 21:40:03 +08:00
qazal
ae23540d6e
refresh process replay schedule ref in reset.py ( #6265 )
2024-08-24 16:12:51 +03:00
qazal
478145cb8e
lowering error in diff_schedule is fine [run_process_replay] ( #6185 )
2024-08-19 20:51:12 +03:00
qazal
e28d29641f
more scheduler process replay tooling [run_process_replay] ( #6178 )
2024-08-19 15:35:51 +03:00
qazal
d5e3217076
hotfix: scheduler differ ( #6115 )
...
* hotfix: scheduler differ
* add the test back
* track keys
2024-08-16 23:34:49 +03:00
qazal
c23d44c779
AST is UOp ( #6030 )
...
* most of the work from the uops2 branch
* schedule
* realize
* kernel
* lowerer
* search
* green
* merge uops with ops
* Revert "merge uops with ops"
This reverts commit 1408a59f12 .
* fix benchmark
* remove extra dedup
2024-08-16 22:09:00 +03:00
chenyu
6b3112d525
fix qcom process_replay for kernel diff ( #6079 )
...
* debug why qcom process_replay does not run
skipping the wrong exception?
* um-hum
* get_step_times was parsed incorrectly
* cleanup
2024-08-14 15:05:49 -04:00
qazal
30035df5a4
add metal process replay back ( #6068 )
...
test this new one
2024-08-14 12:29:56 +03:00
qazal
0e62076cf5
more process replay cleanups ( #6013 )
...
* more process replay cleanups
* comma benchmark missing
2024-08-10 17:29:10 +03:00
qazal
7373b05ee8
assert conv bw reduceops merge [compare_schedule] ( #6001 )
...
* assert conv bw reduceops merge [compare_schedule]
* diff with ref_commit_hash
2024-08-09 19:29:56 +03:00
qazal
a833f1a735
scheduler process replay with [compare_schedule] ( #5997 )
2024-08-09 16:58:22 +03:00
qazal
24c7c41ce0
diff LazyBuffer schedules in process replay ( #5996 )
...
* start diff printing
* this should be 2
* add to process_replay.py
* enable schedule capture
* arange diff is process replay
2024-08-09 14:16:43 +03:00
qazal
d6f4a61c42
graph LBScheduleItem [run_process_replay] ( #5960 )
...
* add toposort key to LBScheduleItem
* use dedup
* graph LBScheduleItem
* make that comment beautiful again
* diff_schedule utils
* update fuzz_schedule
2024-08-07 19:59:11 +03:00
qazal
728b7e189e
diff_schedule tests [run_process_replay] ( #5958 )
...
* diff_schedule tests [run_process_replay]
* ok to run serial
2024-08-07 13:50:27 +03:00
George Hotz
3a0515ea22
hotfix: process_replay/diff_schedule.py to LBScheduleItem
2024-08-06 17:01:05 -07:00
qazal
a7db4c3ee9
show timings for DIFF_ARANGE=1 ( #5935 )
...
* show timings for DIFF_ARANGE=1
* always with DEBUG=2
2024-08-06 17:20:38 +03:00
qazal
102a8c184b
diff fused arange schedules with ARANGE_DIFF=1 ( #5934 )
...
* diff fused arange schedules with ARANGE_DIFF=1
* better llama diff
2024-08-06 16:52:26 +03:00