qazal
3de4c9839f
viz: display UOp tags ( #10751 )
...
* viz: display UOp tags
* g.tag
2025-06-10 16:02:23 +03:00
nimlgen
800d1796d5
am_smi: kill process group ( #10750 )
2025-06-10 15:23:39 +03:00
qazal
5bd4ad2e8b
viz: remove unused arg ( #10747 )
2025-06-10 12:00:09 +03:00
George Hotz
413e223d6e
Revert "remove cpu graph, it's different from the others ( #10743 )" ( #10745 )
...
This reverts commit 3d64a98432 .
2025-06-09 22:40:48 -07:00
George Hotz
3d64a98432
remove cpu graph, it's different from the others ( #10743 )
...
* remove cpu graph, it's different from the others
* remote was blacklisting CPUGraph
2025-06-09 22:17:10 -07:00
George Hotz
245b1d3a46
move add/mul to MathTrait [pr] ( #10741 )
...
* move add to MathTrait [pr]
* both add and mul
2025-06-09 21:48:55 -07:00
George Hotz
c28eceaf44
move to mathtraits.py ( #10742 )
2025-06-09 21:17:35 -07:00
George Hotz
acf72872b3
move view left to the outer graph prereqs + testing ( #10725 )
...
* move view left to the outer graph
* global view right
* dont need that one
* remove comment
* test kernelize
* simple
* split onnx, test sdxl null
* fix testing
* ugh, wrong one
* Update test.yml
2025-06-09 20:43:25 -07:00
chenyu
b7198fdcfd
linearizer failure from wino fuse arange cifar ( #10739 )
2025-06-09 23:10:19 -04:00
George Hotz
58eebdb507
don't reassign metadata to the same uop + ignore oob in pr [pr] ( #10737 )
2025-06-09 18:43:39 -07:00
chenyu
364b903850
minor cleanups in linearize.py [pr] ( #10735 )
2025-06-09 19:49:19 -04:00
George Hotz
81ef879da3
non recursive top_down_rewrite ( #10729 )
...
* non recursive top_down_rewrite
* nicer algorithm
* rewrite bottom up also
* only top down is broken?
* simpler iterative algo
* no recursion errors
* top down and bottom up
* unified rewrite
* simpler rewrite
* clean up comments
* move that comment
2025-06-09 16:33:04 -07:00
chenyu
53cbd4254b
suppress filter_too_much on test_float_cast_to_unsigned ( #10733 )
...
falky, already done in test_float_cast_to_unsigned_overflow and test_float_cast_to_unsigned_underflow
2025-06-09 18:30:04 -04:00
George Hotz
916bbd5c6b
fixed point rewrite [pr] ( #10732 )
2025-06-09 14:46:20 -07:00
chenyu
55cdbb9a20
fix mask in expand into symbolic size ( #10730 )
...
failed before when old size is 1 and it expands into symbolic size, because `resolve(s != ns, False)` is False and it does not expand the mask
2025-06-09 17:33:22 -04:00
wozeparrot
926b11381c
failing test for symbolic expand after pad ( #10727 )
...
* feat: failing test for symbolic expand after pad
* feat: mark test as failing
2025-06-09 16:55:21 -04:00
chenyu
49f999d919
update _reshape_mask for symbolic shape expand ( #10726 )
...
* don't merge shape symbolic reshape symbolic
* proper fix
2025-06-09 16:35:02 -04:00
wozeparrot
27dd97f688
support variable shape none slice in getitem ( #10724 )
2025-06-09 11:53:02 -07:00
Ignacio Sica
afd5140a09
remove no longer used IndexContext acc_num var ( #10720 )
2025-06-09 14:06:59 -04:00
George Hotz
f84c320548
better external_benchmark_schedule [pr] ( #10722 )
2025-06-09 10:26:11 -07:00
George Hotz
6270c0eac0
default ignore oob to 0 ( #10660 )
2025-06-09 10:25:43 -07:00
b1tg
24d328e313
onnx parser ( #10435 )
...
* onnx parser
* fix compile, lint
* onnx.load -> onnx_load
* compatible with ModelProto
* fix test external_test_onnx_ops.py
* fix tests
* fix signed int
* reduce to 261 lines
* fix TypeProto.Optional
* debug for _parse_message, add TypeProto.Sequence, cleanup
* onnx_load from Tensor
* remove BufferedReader
* 174 lines and reduce tensor copy
* cleanup
* use onnx_load in external_model_benchmark.py
* fix qcom test
* [onnx] parser support external data
---------
Co-authored-by: b1tg <b1tg@users.noreply.github.com >
Co-authored-by: chenyu <chenyu@fastmail.com >
2025-06-09 12:44:28 -04:00
Sieds Lykles
cfa65bea05
Subtract 1 from Variable upper bound ( #10715 )
2025-06-09 09:25:53 -07:00
George Hotz
ef58ab340a
hotfix: remove n=auto from REMOTE=1 test
2025-06-09 09:19:36 -07:00
qazal
419a1286f2
viz: share cacheKey [pr] ( #10717 )
2025-06-09 17:48:29 +03:00
chenyu
35523dc35f
move BLOCK_REORDER to caller [pr] ( #10711 )
...
so block_reorder tests won't fail with flag set to 0
2025-06-08 23:26:01 -04:00
chenyu
bb34c28b36
debug flag for linearize block_reorder [pr] ( #10710 )
2025-06-08 22:26:06 -04:00
chenyu
d93a0bee6b
mlperf ci uses its own cache ( #10705 )
...
not to interfere with regular cache which is used by benchmark
2025-06-08 19:43:32 -04:00
qazal
8cdf6e4d1e
viz memory graph tiny fixes [pr] ( #10709 )
...
* sched_sink is a step
* offset for yaxis
* clear existing
* scale offset
2025-06-09 01:10:12 +03:00
George Hotz
81b9c04574
move high level stuff to unit tests [pr] ( #10708 )
...
* move high level stuff to unit tests [pr]
* process replay on unit tests
* fix pr, less compute
* set omp num threads
* set 200MB buffer size limit
* delete junk
* fix tests
* faster
* move test_indexing to unit
* faster
2025-06-08 14:05:56 -07:00
nimlgen
171580e9ec
am: fix reg update ( #10707 )
2025-06-08 21:45:55 +03:00
George Hotz
4305f532d9
clean up apt stuff ( #10706 )
...
* clean up apt stuff
* single apt install
* fixes
* fix opencl + ldconfig
2025-06-08 11:06:09 -07:00
George Hotz
4e2c3560b4
smaller tests are faster tests [pr] ( #10704 )
...
* remove del spam from CI
* more
* preconstruct default buffer spec
* ignore those errors
* check exception
* more exception check
* skip stuff
* smaller tests mean faster tests
* a few more
2025-06-08 10:54:19 -07:00
George Hotz
67a1c92fc0
remove del spam from CI ( #10699 )
...
* remove del spam from CI
* more
* preconstruct default buffer spec
* ignore those errors
* check exception
* more exception check
* skip stuff
2025-06-08 10:14:30 -07:00
George Hotz
32141ec867
make apt CI faster ( #10702 )
2025-06-08 09:43:39 -07:00
chenyu
4f535641f7
add one huggingface_onnx test to mac benchmark ci ( #10700 )
...
this crashed for me on onnx parser pr but seems fine for the author. see if ci mac is fine
2025-06-08 12:26:12 -04:00
George Hotz
32e9949052
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
uuuvn
8e3f337075
Skip flaky test in ci ( #10696 )
...
`test_data_parallel_resnet_train_step` is already skipped on LLVM/CPU:
```python
@unittest.skipIf(CI and REAL_DEV in ("CUDA", "NV", "LLVM", "CPU"), "slow, and flaky on LLVM/CPU")
@unittest.skipIf(REAL_DEV == "WEBGPU" and not OSX, "WEBGPU Vulkan can only run kernels with up to 10 buffers")
def test_data_parallel_resnet_train_step(self):
```
It looks like `test_data_parallel_resnet` (no `_train_step`) is flaky in a similar way:
https://github.com/tinygrad/tinygrad/actions/runs/15472667248/job/43560773882?pr=10642#step:9:64
2025-06-08 08:24:09 -07:00
George Hotz
3ece2e4bb5
hotfix: remove accel from extra
2025-06-08 08:20:34 -07:00
qazal
1ad8062591
more generic naming in VIZ [pr] ( #10695 )
...
* note
* rename kernel to ctx
* rename uop things to currentStep + expandSteps
* already destructured
* some things that were called ctx are steps
* still a kernel
2025-06-08 15:37:39 +03:00
qazal
c70486908e
viz: clicking a KERNEL node can open codegen rewrite ( #10683 )
...
* work
* now it doesn't have 20% slowdown
* label like this
* closer
* ansiStrip
* remove
* better
* id is faster
* fix that
2025-06-08 13:11:03 +03:00
George Hotz
48eb7d76b1
use ALLOW_DEVICE_USAGE context variable instead of MainProcess check ( #10693 )
...
* use DISALLOW_DEVICE_OPEN context variable instead of MainProcess check
* device usage can be disallowed
2025-06-08 00:07:40 -07:00
geohotstan
dedff0e96c
fix run huggingface onnx debug ( #10679 )
2025-06-08 00:59:20 -04:00
George Hotz
8c76250d31
speed up a few tests ( #10692 )
2025-06-07 20:39:25 -07:00
chenyu
e80870e27c
BasicBlock2 -> BasicBlock [pr] ( #10691 )
2025-06-07 23:33:51 -04:00
George Hotz
7ff175c022
cache a venv to avoid pip usage ( #10689 )
...
* try built in pip caching
* try venv
* export venv
* set VIRTUAL_ENV
* revert that
* venv key
* fix
* ci cache hit?
* fix windows
2025-06-07 20:13:41 -07:00
ihar
40c1479267
added unit tests for 'argfix' ( #10678 )
2025-06-07 22:17:10 -04:00
ihar
74b849b5e1
remove unnecessary 'argfix' because 'view' is an alias to 'reshape'. all functionality must be inside 'reshape' ( #10677 )
...
* remove unnecessary 'argfix' because 'view' is an alias to 'reshape'. all functionality must be inside 'reshape'
* added the same set of unit tests for 'view' as for 'reshape' since 'view' is just an alias for 'reshape'
* improved tests for 'view' op
2025-06-07 22:15:31 -04:00
chenyu
e88fe41d37
update vits vctk model to use download from huggingface ( #10688 )
...
google drive points to a warning page that does not work
2025-06-07 20:47:28 -04:00
Sieds Lykles
c29a56dd51
Fix whisper OOB ( #10685 )
...
* fix whisper and test
* remove import
2025-06-07 20:23:50 -04:00