chenyu
a1dfd288bb
different valid order ( #7589 )
...
in simplify_valid, we start with valids that are in others' parent so the others is more likely to be simplified
2024-11-07 20:27:56 -05:00
chenyu
4378b100ad
make UOp.range arg a tuple [pr] ( #7583 )
...
* make UOp.range arg a tuple [pr]
so render works on output of ShapeTracker.to_indexed_uops
* fix
2024-11-07 11:58:09 -05:00
chenyu
bb7b5362be
uop_given_valid in real_strides ( #7231 )
...
simplified idx allows deriving more strides
2024-11-07 09:41:16 -05:00
uuuvn
c846dd70b2
Increase test tolerance for probabilistic test ( #7580 )
2024-11-07 09:35:11 -05:00
George Hotz
205befa788
move is_dtype_supported to device [pr] ( #7575 )
2024-11-07 20:38:03 +08:00
qazal
1f5ea1e412
late fusion tests, early merge view GroupOp.Buffer [pr] ( #7577 )
...
* test_late_fusion_double_transpose
* early merge view buffer ops
2024-11-07 20:04:57 +08:00
qazal
f0fc34e594
swizzle tests from the delete_fuse branch [pr] ( #7576 )
...
* swizzle tests from the delete branch [pr]
* actually test torch
* atol
2024-11-07 18:29:06 +08:00
chenyu
a011562450
fix view add with symbolic shape ( #7569 )
...
the issue is that the symbolic shape is not greedily simplified and canonicalized before reshape
2024-11-06 11:39:20 -05:00
qazal
6a19ca81c9
failing test for View.__add__ RecursionError ( #7567 )
...
* failing test for View.__add__ RecursionError
* move to test_symbolic_shapetracker
2024-11-06 23:46:47 +08:00
qazal
a9a040398c
don't print the entire schedule on assert [pr] ( #7565 )
...
* don't print the entire schedule on assert [pr]
* extra
2024-11-06 18:29:50 +08:00
chenyu
c805e3fff5
skip test_jit_batch_split if JIT >= 2 ( #7561 )
...
* skip test_jit_batch_split if JIT >= 2
only test graphs
* 1600
2024-11-05 14:59:04 -05:00
chenyu
f2fa183651
increase threshold test_strongly_connected_DAG ( #7560 )
...
it shoult test some other properties. flakying with time test https://github.com/chenyuxyz/tinygrad/actions/runs/11688403523/job/32548762512
2024-11-05 11:44:39 -05:00
Carl Basho
630a7f37cf
update tests ( #7554 )
...
Co-authored-by: John Doe <null@mail.com >
Co-authored-by: chenyu <chenyu@fastmail.com >
2024-11-05 11:35:15 -05:00
chenyu
207bca6cea
set PAGE_SIZE=1 and generate new dataset ( #7559 )
...
13080 rows in total. both generating and loading this are pretty broken now. filters are wrong for example
2024-11-05 11:25:01 -05:00
geohotstan
934fb73994
fix test_schedule conv2d bug ( #7549 )
...
* tests tests tests
* slap a resolve on it
* fix comment
2024-11-05 09:07:25 -05:00
George Hotz
99bd4372a5
Ops.ALU is no more, the arg is just an op ( #7525 )
...
* op arg alu [pr]
* more
* more passing
* fix more tests
* more tests passing
* fix single failing test
* so much cleaner
* noop to not have process replay trigger
* fix ptx
2024-11-05 00:22:22 +08:00
Ahmed Harmouche
36488a2a43
Use is_dtype_supported in more places in tests ( #7529 )
2024-11-04 09:21:15 -05:00
qazal
b5718ae135
image dtype fusion tests [pr] ( #7530 )
...
* update test_lil_model
* add test_image_matmul
2024-11-04 22:00:16 +08:00
George Hotz
9c3ee64a3e
hotfix: QoL assert if op is a str
2024-11-04 17:11:38 +08:00
George Hotz
0c19b6298b
rename ops to have unique names ( #7522 )
2024-11-04 17:09:45 +08:00
George Hotz
9a7cc04843
fix viz [pr] ( #7519 )
...
* fix viz [pr]
* Update serve.py
2024-11-04 15:02:41 +08:00
George Hotz
6bb230287b
pass the src into Metal [pr] ( #7518 )
...
* pass the src into Metal [pr]
* put that comment back
* keep old functionality
* move all to disassembler
* metal supports parallel beam
* touchups
* comment in correct place
2024-11-04 12:35:30 +08:00
George Hotz
bac251d2c1
idx_load_store in lowerer [pr] ( #7477 )
...
* idx_load_store in lowerer [pr]
* fix tests (#7513 )
Co-authored-by: John Doe <null@mail.com >
* work
---------
Co-authored-by: Carl Basho <76494676+oldpondplop@users.noreply.github.com >
Co-authored-by: John Doe <null@mail.com >
2024-11-04 10:18:40 +08:00
chenyu
7758f7211b
Revert "s/UPat/Pat ( #7506 )" [pr] ( #7517 )
...
* Revert "s/UPat/Pat (#7506 )"
This reverts commit 400011a8c1 .
* fix
2024-11-03 16:33:02 -05:00
chenyu
df49439b9a
remove reassoc from LLVM flags ( #7512 )
...
reassoc reorders compute and breaks transcendental
2024-11-03 13:11:56 -05:00
chenyu
2f70fb893e
move transcendental fuzzer test to test_transcendental ( #7511 )
2024-11-03 12:36:50 -05:00
chenyu
84592225d8
tweak tqdm ( #7510 )
...
reduce parentheses and fuzz more tests now there's no sleep
2024-11-03 12:07:11 -05:00
chenyu
c25a69b97e
fix tqdm tests ( #7509 )
...
time.sleep masked two issues:
(1) iters_per_sec might have unitscale in it, and calling `float` on it fails
(2) default rate is too low to ensure the output matches, it might skip updating
2024-11-03 10:53:22 -05:00
chenyu
4617c9a565
move COMMUTATIVE flipping to symbolic ( #7507 )
...
* move COMMUTATIVE flipping to symbolic
it cannot go with TRANSCENDENTAL
* skip LLVM
2024-11-03 09:03:45 -05:00
chenyu
400011a8c1
s/UPat/Pat ( #7506 )
2024-11-03 08:26:19 -05:00
George Hotz
c8bf09b7d4
s/UOps/Ops ( #7500 )
...
* s/UOps/Ops [pr]
* fix
2024-11-03 11:26:10 +08:00
chenyu
91a3b27fa9
disable test_setitem_inplace_operator again ( #7495 )
...
it was flaky, not broken broken
2024-11-02 19:01:23 -04:00
chenyu
ba0c246cfd
update test_setitem_overlapping_inplace1 ( #7494 )
...
failed on LLVM and remu, not real AMD
2024-11-02 18:40:53 -04:00
chenyu
f887de0fd6
update test_setitem ( #7493 )
...
some tests passed now
2024-11-02 17:53:04 -04:00
geohotstan
b1866cbfd9
failure test case for pool ops ( #7483 )
...
* add failure test case
* minimum case
2024-11-02 12:13:38 -04:00
geohotstan
585f3a0f24
Add isinf and isnan ops to Tensor ( #7484 )
...
* move isinf and isnan to new branch
* sneak a roll documentation fix in
* add to docs
* update test coverage for detect_positive and detect_negative
* add types to isinf args
2024-11-02 12:12:52 -04:00
George Hotz
72a9ac27e9
support image dtype in cloud [pr] ( #7482 )
...
* support image dtype in cloud [pr]
* remove outdated osx hack
* unused imports
2024-11-02 23:54:27 +08:00
chenyu
18e159c9ac
comment about multi real and more tests [pr] ( #7467 )
2024-11-01 11:49:11 -04:00
geohotstan
6513690223
Add Tensor.hardsigmoid ( #7433 )
...
* move hardsigmoid to new branch
* add to test
* add NOTE to mention differing values for alpha and beta that match torch
* shift from relu6
* correct shift implementation
* or we just use relu? no more 666
2024-11-01 08:36:52 -04:00
George Hotz
a7ba3d2d91
move reduce to lowerer [pr] ( #7462 )
...
* move reduce to lowerer [pr]
* simpler
2024-11-01 16:39:20 +08:00
Tobias Fischer
1a9e145388
Tensor Clone Function ( #7154 )
...
* implemented clone function
* cleanup linting, single func
* added tests, cleaned up grad cloning
* fixed whitespace
2024-11-01 12:24:43 +08:00
chenyu
a21434504b
update payne_hanek_reduction [pr] ( #7455 )
2024-10-31 18:41:22 -04:00
chenyu
4065c3dec8
remove special 0 case in frexp ( #7450 )
...
we can safely assume input is non-zero, also removed unneeded bitcast
2024-10-31 13:02:33 -04:00
chenyu
53db3478fe
cast to float32 for float16 xlog2 ( #7447 )
...
formula has 2X error with denormal floats
2024-10-31 10:36:29 -04:00
George Hotz
5dd1ffd5d0
don't const rewrite in cstyle ( #7442 )
...
* don't const rewrite in cstyle
* Update cstyle.py
* simple_symbolic
* fix bfloat16 const on AMD
2024-10-31 19:16:49 +08:00
George Hotz
50ddd11350
lil cleanup matchers [pr] ( #7437 )
...
* move delete_redundant_gates [pr]
* simpler uops test
* addr in delete_redundant_gates
* lines
* correct early delete gates
* shorter find_gate
2024-10-31 17:52:22 +08:00
George Hotz
2e3048fc57
Revert "improve full_graph_rewrite matchers for speed ( #7431 )" ( #7434 )
...
This reverts commit 996152d2de .
2024-10-31 16:16:47 +08:00
George Hotz
996152d2de
improve full_graph_rewrite matchers for speed ( #7431 )
...
* remove finalize [pr]
* early transcendental
* fix tests
* load store indexing runs with devectorize
* move delete_redundant_gates
* ptx has to wait for the mask to move
2024-10-31 16:13:11 +08:00
George Hotz
17c9a9fde4
pm_render [pr] ( #7430 )
...
* pm_render [pr]
* test fixes
* use gep, not src
* ptx only symbolic, not sym
* move cast rules
2024-10-31 15:04:50 +08:00
George Hotz
e446e95974
enforce ctx is called ctx [pr] ( #7424 )
...
* enforce ctx is called ctx [pr]
* fix bug and use has_ctx
* inspect signature
* assert
* no slow asserts
* now we can support contextual reduce
2024-10-31 11:39:19 +08:00