George Hotz
2c90f3ea76
split test_advancedindex
2025-10-14 19:02:21 +08:00
George Hotz
d99457657b
svd nonfull in parallel
2025-10-14 18:50:11 +08:00
George Hotz
8a34a4e2c7
fix up some slow tests that launch python
2025-10-14 18:42:42 +08:00
George Hotz
fb61f3519f
remove assign contiguous hack ( #12659 )
...
* remove assign contiguous hack
* remove bad contiguous usage in torch backend
* assign
2025-10-14 16:42:14 +08:00
Sieds Lykles
e06cbfcb8a
combine pm_drop_and_clauses ( #12660 )
...
* combine those
* wino kernels decreased
2025-10-14 10:09:41 +02:00
George Hotz
84d4589ed4
remove pylint from pre-commit and CI ( #12658 )
...
* remove pylint from pre-commit and CI
* multidevice test is fast
* faster pre-commit
* 8 is faster than 4
* better name
* how did that typecheck?
2025-10-14 15:39:59 +08:00
George Hotz
b9eb5b5d49
clean up the LLM tokenizer ( #12653 )
...
* clean up the LLM tokenizer
* simple tokenizer is actually simple
* ugh write good code
2025-10-14 14:22:01 +08:00
wozeparrot
47e0c43976
feat: Tensor.{load, store} ( #12629 )
2025-10-13 08:04:41 -07:00
Sieds Lykles
e0139fafc1
UOp symbolic tests use eval to check against string ( #12643 )
2025-10-13 14:19:42 +02:00
Sieds Lykles
e537e895b1
drop unused invalid conditions ( #12635 )
...
* drop where conditions if the ranges are not used inside the index
* remove allow_any_len
2025-10-13 10:52:21 +02:00
qazal
fd51ecf983
process_replay for get_rangeify_map ( #12624 )
2025-10-12 15:14:40 +03:00
qazal
b5afa3848e
viz: fix memory graph total nbytes ( #12622 )
...
* viz: fix memory graph total nbytes
* post increment
* simple regression test
* loop with markers + slightly off text baseline
* cpu events clear
2025-10-12 14:32:46 +03:00
Sieds Lykles
772a8dfe31
reshape uses valid when simplifying ( #12597 )
...
* reshape uses valid when simplifying
* try with IGNORE_OOB=0
* is it this test?
* skipif gpuocelot
2025-10-11 17:02:54 +02:00
Sieds Lykles
a2ae56674a
uop_given_valid try multiple clauses (#12615 )
...
* uop_given_valid uses less simplify
* enable test
* try all expressions together
* enable test
2025-10-11 11:53:42 +02:00
Sieds Lykles
dccdd190aa
uop_given_valid uses less simplify ( #12612 )
...
* uop_given_valid uses less simplify
* enable test
2025-10-11 10:57:39 +02:00
George Hotz
cab034b863
improve typing ( #12611 )
...
* improve typing and bump to 3.11
* no need for Self yet
* improve typing
* binop also
2025-10-11 16:20:23 +08:00
Sieds Lykles
4300ebc455
cache apply_movement_op ( #12609 )
...
* cache apply_movement_op
* pyling and clear cache
* fix types
* ignore
* cleanup
2025-10-11 08:53:10 +02:00
George Hotz
7596c1b8f5
TestOuterworldReduce works ( #12608 )
2025-10-10 20:06:41 +08:00
chenyu
001b3710d3
enable some test_ops tests ( #12607 )
2025-10-10 07:23:21 -04:00
chenyu
03ef5197fc
move get_contraction to helpers [pr] ( #12594 )
2025-10-10 04:28:57 -04:00
chenyu
af90dc00de
remove some View add logic [pr] ( #12584 )
...
no longer simplify the case of v0+v1 where v0 has a mask
2025-10-10 03:47:56 -04:00
qazal
caae46cfba
fix process replay progress update ( #12587 )
2025-10-10 10:20:55 +03:00
chenyu
c8dfd10257
ShapeTracker.real_strides -> is_expanded [pr] ( #12579 )
...
only keep the used part
2025-10-09 22:52:45 -04:00
George Hotz
5977df267f
outerworld uses expand ( #12578 )
2025-10-10 10:25:25 +08:00
chenyu
f2c3a72b0c
remove RANGEIFY flag [pr] ( #12577 )
2025-10-09 21:52:54 -04:00
qazal
b86ad6053a
test_schedule independent of RANGEIFY flag ( #12568 )
...
* test_schedule independent of RANGEIFY flag
* comment for expectedFailure + test_cast_padded_view
* test_cast_padded_const works
* don't use full_shape it's fine
* add todos for the rest
2025-10-09 20:00:50 +03:00
chenyu
678f83e41b
delete ShapeTracker to_valid_uop and substitute [pr] ( #12563 )
2025-10-09 05:06:10 -04:00
chenyu
cf8232ec6a
clean up more RANGEIFY flag ( #12556 )
2025-10-09 03:06:48 -04:00
George Hotz
a8a9ac0e95
add more uop gc test ( #12553 )
2025-10-09 14:49:32 +08:00
chenyu
250f05a776
run some hashing test only on METAL ( #12554 )
...
quite slow on CPU
2025-10-09 02:39:49 -04:00
chenyu
ae51bdd06a
remove trivial use of RANGEIFY flag ( #12550 )
...
some tests need update still
2025-10-09 02:29:38 -04:00
George Hotz
1dc500426e
remove restrictions on range ending in indexing ( #12543 )
...
* remove restrictions on range ending in indexing
* early simplify
* Revert "early simplify"
This reverts commit 657d9972c2 .
* disable const folding tests
2025-10-09 13:53:08 +08:00
chenyu
585bd95b50
fix ruff 0.14.0 [pr] ( #12547 )
2025-10-09 01:52:30 -04:00
chenyu
43bce1f39f
delete View minify [pr] ( #12538 )
2025-10-08 23:25:53 -04:00
chenyu
20d98b19c3
delete more unused ShapeTracker stuff ( #12536 )
2025-10-08 23:09:44 -04:00
qazal
bb5671a837
some more ops.py cleanups ( #12525 )
...
* remove GroupOp.Meta and st_arg
* inline axis_arg
* only allow .buffer on reshapes (or the buffer)
* gate is the other way
* still want can_pad?
* use op_in_backward_slice_with_self
* .buffer is recursive
* lint
* pathlib there
2025-10-09 06:06:44 +03:00
chenyu
be05028419
move ASSERT_MIN_STEP_TIME to compile3 ( #12535 )
...
threshold is current time +20%
2025-10-08 22:16:59 -04:00
chenyu
c4732a18bd
update tests that depend on SPLIT_REDUCEOP ( #12534 )
2025-10-08 21:53:30 -04:00
chenyu
28edea5d67
delete FUSE_CONV_BW ( #12527 )
2025-10-08 10:41:38 -04:00
George Hotz
0774575442
delete the old rangeify path and all the children stuff ( #12524 )
...
* delete the old rangeify path and all the children stuff
* remove the on_stack stuff and any retries
* don't use the p word
* Revert "remove the on_stack stuff and any retries"
This reverts commit 49a2b328b9 .
2025-10-08 21:24:04 +08:00
qazal
b6835f4134
remove Ops.VIEW and related UOp methods ( #12522 )
...
* remove Ops.VIEW and related UOp methods
* update abstractions2.py
* no ShapeTrackers in abstractions2.py
* it's a size 1
2025-10-08 14:47:02 +03:00
George Hotz
3b0b3a2e64
fast RANGEIFY ( #12504 )
...
* rtoposort is fast, can replace rangeify with this
* fast rangeify
* work
* fast rangeify works for mnist
* should work
* progress
* pad fix
* FAST
* tests passing
* don't delete those shape ops
* put in rangeify map
* ending ranges fix
* tests
* mstack/mselect no hacks
* move to indexing.py
* touch up tests + add comments
* disable failing test
* actually make the file readable
* failing
* error
2025-10-08 19:38:06 +08:00
qazal
9448924d9e
update gpt2 kernel count tests in CI=0 ( #12523 )
2025-10-08 14:29:11 +03:00
chenyu
ee0382ad99
remove ShapeTracker.invert ( #12520 )
2025-10-08 18:37:34 +08:00
chenyu
d5058427ea
remove ShapeTracker.real_size ( #12519 )
2025-10-08 06:15:29 -04:00
qazal
6f26603f06
delete swizzler.py ( #12518 )
...
* delete swizzler
* remove merge_views tests
* don't need rewrites_for_views
* apply_rewrites
2025-10-08 13:02:34 +03:00
qazal
7e0b14243e
delete grouper and kernelize ( #12517 )
...
* delete grouper and kernelize
* +sys.setrecursionlimit
2025-10-08 12:27:26 +03:00
chenyu
e701106a64
remove FUSE_ARANGE ( #12511 )
...
it was the default already
2025-10-08 04:54:07 -04:00
qazal
ad49f8148b
switch process_replay to rangeify ( #12509 )
2025-10-08 11:26:43 +03:00
nimlgen
4a756a37d8
amd: support rocm7 ( #12502 )
...
* amd: support rocm7
* mock
2025-10-08 14:30:39 +08:00