qazal
f4f705a07c
can push SWIZZLE through reduce both ways ( #6453 )
2024-09-10 16:00:50 +08:00
qazal
1347e49e82
second iteration on UOps.SWIZZLE ( #6451 )
...
* new swizzle
* fix the failing tests
* test a double swizzle
* ci
2024-09-10 14:43:21 +08:00
chenyu
e0d35e3657
update test_padto_sum_not_ok ( #6450 )
...
updated the setup as `exp() < -1` could be folded to False
2024-09-09 22:46:42 -04:00
qazal
95c9fe841e
UOp.st infra for the new SWIZZLE ( #6449 )
2024-09-10 09:39:45 +08:00
qazal
abfbd9fd2f
fix Variable init from the DEFINE_VAR refactor ( #6448 )
...
prereq for UOps.VALID.
2024-09-10 09:14:29 +08:00
chenyu
fcc69adfc5
simplify c0*x<c1 for negative int c0,c1 ( #6431 )
...
* simplify c0*x<c1 for negative int c0,c1
* fine if rhs is zero
2024-09-09 21:05:53 -04:00
kormann
f6f4f3222f
whisper long batch ( #6335 )
...
* reset
* test
* only part refactor
2024-09-09 21:03:59 -04:00
qazal
29e63097a0
st is a cached_property on UOp [run_process_replay] ( #6433 )
2024-09-10 08:30:35 +08:00
qazal
cf64f8bb40
start with the UOps.VALID spec [run_process_replay] ( #6435 )
...
* document UOps.VALID [run_process_replay]
* now the assert
2024-09-10 08:00:19 +08:00
Tim Becker
58a1b4f427
Faster UOp hashing ( #6447 )
...
* Faster hashing of Enums and UOp
* NOp should not define __eq__
---------
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2024-09-10 07:16:04 +08:00
George Hotz
92e4126793
Revert "Revert "RewriteContext [run_process_replay] ( #6428 )" ( #6438 )" ( #6443 )
...
This reverts commit e7dd08448f .
2024-09-10 07:00:26 +08:00
George Hotz
904f6a63fa
Revert "Revert "cleanup process_replay/* namings [run_process_replay] ( #6429 )…" ( #6442 )
...
This reverts commit eda177da84 .
2024-09-10 07:00:16 +08:00
nimlgen
8d3450ceab
qcom remove unused commands ( #6445 )
...
* qcom remove unused commands
* linetr
2024-09-09 20:26:07 +03:00
nimlgen
f63a9fd649
hcq _cur_cmd_idx for readability ( #6444 )
...
* hcq _cur_cmd_idx for readability
* linter
2024-09-09 20:04:45 +03:00
George Hotz
dbd4536167
Revert "add UOps.VALID ( #6387 )" ( #6441 )
...
This reverts commit 8186e4e7d6 .
2024-09-09 21:33:00 +08:00
George Hotz
e7dd08448f
Revert "RewriteContext [run_process_replay] ( #6428 )" ( #6438 )
...
This reverts commit e1d61b048b .
2024-09-09 18:53:18 +08:00
George Hotz
eda177da84
Revert "cleanup process_replay/* namings [run_process_replay] ( #6429 )" ( #6437 )
...
This reverts commit f4e83b30b4 .
2024-09-09 18:52:36 +08:00
George Hotz
d5bd38c278
add min max rule for expand [run_process_replay] ( #6434 )
2024-09-09 18:30:20 +08:00
George Hotz
42e5c8335e
remove args from min/max [run_process_replay] ( #6430 )
...
* remove args from min/max [run_process_replay]
* it's a ConstType
* sconst_like unused
* any const is fine
2024-09-09 18:18:20 +08:00
qazal
f4e83b30b4
cleanup process_replay/* namings [run_process_replay] ( #6429 )
2024-09-09 16:59:04 +08:00
George Hotz
8186e4e7d6
add UOps.VALID ( #6387 )
...
* uops valid
* broke full_shape
* fixup that st (hardcoded asts still red)
* fixup DEFINE_VAR
debug
more debug
* start moving stuff to ast_const
* move test_linearizer
* move test_linearizer_failures to ast_const
* fixup test_schedule
* small diff change
* regenerate dataset
* fixup test_multitensor
* regen dataset try 2
---------
Co-authored-by: qazal <qazal.software@gmail.com >
2024-09-09 16:58:43 +08:00
George Hotz
e1d61b048b
RewriteContext [run_process_replay] ( #6428 )
2024-09-09 16:49:02 +08:00
qazal
935b6b658f
delete seen from the scheduler api [run_process_replay] ( #6427 )
...
docs
2024-09-09 16:26:34 +08:00
George Hotz
6c7abd18df
non-optional bounds (faster) [run_process_replay] ( #6425 )
...
* non-optional bounds (faster) [run_process_replay]
* pre-fetch min/max
* Revert "pre-fetch min/max"
This reverts commit cdd71840c5 .
2024-09-09 16:00:16 +08:00
qazal
c5bae55ec8
new generate_dataset.sh ( #6423 )
...
* new generate_dataset.sh
* keep those there
* test: rm expected failures
* rename to extract
2024-09-09 15:13:07 +08:00
chenyu
1941e66cc9
real strides with uops ( #6365 )
...
* real strides with uops [run_process_replay]
* compare with old
* Revert "compare with old"
This reverts commit f53a8d4276 .
* make those @unittest.expectedFailure
2024-09-09 03:06:27 -04:00
chenyu
ac98f5056e
move lt-folding to a function [run_process_replay] ( #6422 )
...
and added more tests (some failed to match symbolic)
2024-09-09 02:04:52 -04:00
qazal
ff8a9ac3c1
test new style gated store rendering ( #6413 )
...
* test new style gated store rendering
* switch to lidx
* make lidx optional
* fixup [run_process_replay]
2024-09-09 13:59:22 +08:00
George Hotz
90fb17304f
put rewrite back in ops [run_process_replay] ( #6421 )
2024-09-09 13:53:51 +08:00
chenyu
047ab7d256
minor Program post_init size cleanup [run_process_replay] ( #6415 )
2024-09-08 23:41:27 -04:00
qazal
442150a8df
more ast_const for hardcoding consts [run_process_replay] ( #6418 )
2024-09-09 11:35:08 +08:00
chenyu
25af78c593
failed uop_symbolic divmod test by variable ( #6414 )
2024-09-08 23:08:58 -04:00
qazal
88941bcf16
fold bitwise noops ( #6412 )
...
from `8269a721cd6f5c6030ce120e1139095d7ba117eb`
Co-authored-by: timmy <timmy0x@proton.me >
2024-09-09 10:18:38 +08:00
chenyu
ad05302232
tests of real_stride of symbolic shape ( #6409 )
...
these would have failed in #6365
2024-09-08 21:37:19 -04:00
Obada Khalili
0fbd141038
tinygrad Tensor Puzzles ( #6315 )
...
* Update index.md
* update readme
* Revert "update readme"
This reverts commit 8415a8e90c .
* update readme
* remove mention
* update index.md
2024-09-09 09:32:38 +08:00
qazal
935b4ddff6
use ast_const in test_linearizer asts [run_process_replay] ( #6407 )
2024-09-09 08:46:58 +08:00
qazal
750696a026
use uop vars for ast DEFINE_VAR [run_process_replay] ( #6404 )
...
prereq for UOps.VALID.
2024-09-08 17:52:51 +08:00
qazal
9a67ec6174
refactor to list of kernels [run_process_replay] ( #6403 )
2024-09-08 17:19:45 +08:00
chenyu
7df4373fd9
tensor reduction touchup ( #6402 )
...
- fixing spacing
- use get_args to get valid Literal values and raise ValueError to match, and a test for that
- use `Y` to be consistent
2024-09-08 03:55:51 -04:00
geohotstan
65da03e186
remove _slice [run_process_replay] ( #6395 )
...
* try
* pass
* clean up
* done
* I'm becoming dumber
* clean up 2
* remove useless max
* useless but make computer brrr [run_process_replay]
* try process replay
* try again
* 1 less line, just use pad2d
2024-09-08 09:12:39 +08:00
nimlgen
22e3379578
qcom sampler touchups ( #6400 )
2024-09-07 22:37:46 +03:00
chenyu
3b2e1b922d
Tensor.roll touchup ( #6398 )
...
simplified a bit.
it might be able to write it with only movements, but the backward would contain a reduce.
2024-09-07 04:48:43 -04:00
Irakli Salia
2e01efc35f
tensor roll ( #6375 )
...
* tensor roll function and tests
* fix type annotations
* reduce line count
* more readable
2024-09-07 05:14:28 +08:00
Tim Becker
dfb818788e
Support reduction parameter in more loss functions ( #6302 )
2024-09-07 05:11:20 +08:00
nimlgen
d3f38faeed
tiny qcom touchups ( #6396 )
2024-09-06 22:49:21 +03:00
nimlgen
40e49b6b1a
hcq share singal wait ( #6394 )
...
* hcq share singal wait
* linter
2024-09-06 18:35:06 +03:00
chenyu
26c5d8346a
remove Variable from UOp.DEFINE_VAR ( #6393 )
...
now it's just arg = (expr as str, min as UOp.const, max as UOp.const)
2024-09-06 05:55:19 -04:00
chenyu
9ed2b8b818
fix DEFINE_VAR setup in test_uop_graph [run_process_replay] ( #6392 )
...
making sure arg always have 3 items
2024-09-06 05:32:12 -04:00
George Hotz
282af21b95
hotfix: DEBUG_EXPAND -1 and NOOPT in benchmark schedule
2024-09-06 17:22:30 +08:00
George Hotz
6c15bd960b
flip radd to be correct [run_process_replay] ( #6390 )
2024-09-06 17:09:46 +08:00