chenyu
2c93f27652
remove explicit np.array and np.int32 in test_div_int [pr] ( #8395 )
...
vals default loads as int32 now in test_ops
2024-12-24 13:09:30 -05:00
qazal
3273972f44
delete is_unrealized_const, it's just CONST [pr] ( #8390 )
2024-12-24 16:46:12 +08:00
qazal
3a556a7e8b
fully local tensor const representation: CONST(VIEW(DEVICE)) [pr] ( #8389 )
2024-12-24 16:15:56 +08:00
chenyu
0d6fe6200c
test case view from an empty view ( #8388 )
...
currently it behaves differently depends on first view somehow
2024-12-23 17:40:49 -05:00
chenyu
c587b3b08c
test case view the padded area of a view ( #8386 )
...
these cases view the padded area of first view
2024-12-23 16:47:31 -05:00
geohotstan
78cb47dfc5
docs and tests clean ups ( #8383 )
2024-12-23 11:12:13 -05:00
chenyu
a556adf028
add test for Tensor silu and swish ( #8381 )
...
was only tested in onnx, added to test_ops for completeness
2024-12-22 21:08:59 -05:00
chenyu
b7397c1322
more typing cleanups [pr] ( #8376 )
...
List, Tuple, DefaultDict
2024-12-22 05:21:03 -05:00
qazal
514a6740e4
Revert "CONST(VIEW(DEVICE)) ( #8365 )" ( #8372 )
...
This reverts commit 83284985f0 .
2024-12-22 04:44:34 +02:00
qazal
83284985f0
CONST(VIEW(DEVICE)) ( #8365 )
2024-12-22 04:18:35 +02:00
qazal
88bc51385c
scheduler: don't trade complexity for speed ( #8370 )
...
* scheduler: don't trade complexity for speed
* don't need is_scheduled
* make those tests real world
* graph_rewrite dedup
2024-12-22 03:30:51 +02:00
qazal
991b91d4d6
fix string repr of arg in viz and print [pr] ( #8369 )
2024-12-21 23:44:10 +02:00
ignaciosica
ba0c844a83
special tol when f16 and bf16 are tc input dtypes ( #8183 )
2024-12-21 11:32:26 -05:00
qazal
72aa38aa3b
BIND in tensor_uop_spec + cleanups [pr] ( #8363 )
...
* Ops.BIND pattern in tensor_uop_spec + cleanups [pr]
* use metaops there
2024-12-21 21:26:47 +08:00
chenyu
18dca3c3d7
isolate train_gpt2 slow kernels [pr] ( #8358 )
...
also fixed run_linearizer with var_vals=None
2024-12-20 17:59:01 -05:00
qazal
2649e87546
delete the fake buffer from const ( #8355 )
...
* delete the fake buffer from const
* fix test_sink_childless_const_alt
* it should be CONST(VIEW(DEVICE))
2024-12-21 04:20:28 +08:00
George Hotz
074315ec08
hotfix: simpler test_mnist_model
2024-12-20 10:18:17 -08:00
qazal
5776ea9386
hotfix: account for all changes in process_replay early stopping [pr] ( #8348 )
2024-12-20 23:46:46 +08:00
George Hotz
9f306e12ac
hotfix: test_net_speed can't backward before realize
2024-12-19 20:32:59 -08:00
chenyu
7153f7709f
update test_merge_view_recursion_err2 [pr] ( #8339 )
...
the view was not created through View.create, updated the test to show the expected behavior
2024-12-19 18:29:34 -05:00
chenyu
2bf47b75da
temp fix for symbolic shape view add [pr] ( #8337 )
...
something is still wrong with symbolic shape shrink, but it should not recurse forever
2024-12-19 16:10:42 -05:00
chenyu
791a80a1c7
add failed merge view example to test_simplify_valid_idx [pr] ( #8334 )
...
* add failed merge view example to test_simplify_valid_idx [pr]
* !=True is fine
2024-12-19 12:54:03 -05:00
qazal
8e266091fb
tensor const spec [pr] ( #8331 )
2024-12-19 22:41:30 +08:00
George Hotz
0ad264ed2d
new from uops [pr] ( #8330 )
...
* new from uops [pr]
* mem_estimate is it's own thing
2024-12-18 23:42:58 -08:00
geohotstan
423d823c50
add GatherND and ScatterND to onnx ops ( #8241 )
...
* implemented
* this implementation is now correct
* this is fine I guess
* better variable names
* finally correct gathernd
* add a note
* eh just leave it at this for now
* teeny adjustment
2024-12-19 00:35:04 -05:00
George Hotz
6608ba316d
add size of the buffer to the ptr dtype ( #8322 )
2024-12-18 12:46:35 -08:00
George Hotz
52243b258c
move flops_mem to renderer [pr] ( #8320 )
2024-12-18 12:13:17 -08:00
George Hotz
8f95b578f6
use Estimates class [pr] ( #8319 )
...
* use Estimates class [pr]
* frozen dataclass
2024-12-18 10:19:32 -08:00
George Hotz
bd9c015b09
tests from grad uop path [pr] ( #8313 )
2024-12-18 09:25:05 -08:00
qazal
fddaeb6344
scheduler deduping spec and asserts [pr] ( #8307 )
...
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2024-12-18 09:21:41 -08:00
qazal
69eb55a529
bugfix: dtype mismatch in TestAssign.test_double_assign_alt [pr] ( #8310 )
2024-12-18 22:29:07 +08:00
George Hotz
32df46cd73
test const pattern [pr] ( #8304 )
...
* test const pattern [pr]
* add model to test_tiny
2024-12-17 23:34:17 -08:00
Jyotirmaya Mahanta
45f2fb82d5
add failing tests for merge views ( #8306 )
...
* add failing tests for merge views
* assert is not none
* make linter happy
2024-12-18 01:27:38 -05:00
George Hotz
801e199196
change buffer to not be pointer [pr] ( #8302 )
2024-12-17 16:47:51 -08:00
qazal
673a76398a
fix viz with untracked graph_rewrite calls ( #8298 )
...
* fix viz with untracked graph_rewrite calls
* mark as green
2024-12-17 23:37:53 +02:00
qazal
5977a3d8a6
regression test viz failure when there's no tracked context ( #8297 )
...
* regression test viz failure when there's no tracked context
* test inner rewrite locations, keep notes
2024-12-17 23:23:27 +02:00
nimlgen
777d2aec05
metal profiler + cpu_profile ( #8291 )
...
* metal + cpu_profile
* gpt example
* linter + revert gpt2 for now
* a bit of readme
* linter
* unrelated
* tests
* linter
* b
2024-12-18 00:06:56 +03:00
nimlgen
af87e4b53c
viz profiler ( #8287 )
...
* only hcq
* fix get_metadata
* linter
* oops
* tiny
* linter
* time
* print pm
* hmm
* nits
2024-12-17 20:00:53 +03:00
George Hotz
0794af97db
consts do not realize
2024-12-17 08:53:53 -08:00
George Hotz
4764a4c172
Revert "TIP 3 - Tensor realization spec tests ( #8288 )" ( #8289 )
...
This reverts commit c0d4346b5a .
2024-12-17 08:36:11 -08:00
qazal
c0d4346b5a
TIP 3 - Tensor realization spec tests ( #8288 )
2024-12-18 00:04:50 +08:00
qazal
f16188b8c0
viz simplify tracked rewrites [pr] ( #8285 )
2024-12-17 23:18:08 +08:00
qazal
fd23738d9d
start viz better typing, unsupport bottom_up=True [pr] ( #8284 )
...
* start viz refactor
* delete bottom_up tracking
* more cleanup
* early continue
2024-12-17 19:52:30 +08:00
George Hotz
e3731766c9
add a test for UOp representation as Tensor [pr] ( #8278 )
2024-12-16 19:41:29 -08:00
chenyu
3195bd0d12
more test examples to merge views [pr] ( #8277 )
...
these have masks in self and masks in the merged views
2024-12-16 20:44:35 -05:00
chenyu
6e2e56c0ff
unit test for view add when self has a mask [pr] ( #8276 )
2024-12-16 20:07:35 -05:00
chenyu
2bb298f38d
add a missing unittest.main() [pr] ( #8274 )
2024-12-16 14:28:10 -05:00
chenyu
f05fd118a2
few minor code cleanups [pr] ( #8267 )
2024-12-15 23:44:51 -05:00
chenyu
9789a83064
hotfix DEBUG in speed_v_theoretical.py conv ( #8266 )
...
infinite loop with manual DEBUG set `DEBUG=2 python test/external/speed_v_theoretical.py -k conv`
```
File "/Users/chenyu/code/tinygrad/tinygrad/helpers.py", line 95, in __ge__
def __ge__(self, x): return self.value >= x
^^^^^^^^^^^^^^^
[Previous line repeated 4984 more times]
RecursionError: maximum recursion depth exceeded in comparison
```
2024-12-15 19:44:45 -05:00
uuuvn
da2245a458
Fix double => half cast on clang ( #8265 )
2024-12-15 11:24:05 -08:00