George Hotz
60e52fbe36
support opts in contig, simpler ( #12400 )
2025-10-01 17:20:04 +08:00
chenyu
6c95b1f39d
explicitly set device for CI unit test ( #12399 )
2025-10-01 05:16:54 -04:00
chenyu
6ba8bf282f
skip test_masked_select for RANGEIFY PYTHON ( #12395 )
2025-10-01 04:13:31 -04:00
chenyu
689ab9151b
more RANGEIFY tests ( #12393 )
...
would have caught the load alt regression without adding too many tests
2025-10-01 03:43:58 -04:00
chenyu
adc8c3b28f
Revert "load alt value with cast ( #12384 )" ( #12392 )
...
This reverts commit 05e91a248d .
2025-10-01 03:20:04 -04:00
b1tg
154d114364
rangeify: fix abstractions2.py ( #12386 )
...
* rangeify: fix abstractions2.py
* tests
* lint
* only abstractions2
* base
2025-10-01 09:58:56 +03:00
George Hotz
fe96c8d345
add HALF flag to tinygrad.apps.llm
2025-10-01 14:44:59 +08:00
George Hotz
f205352cd7
remove ranges with 1s ( #12388 )
...
* use op_in_parents
* remove the ranges of 1
* fix CL image thing
* fix realize
2025-10-01 14:43:29 +08:00
qazal
90b1c0dd96
rangeify: test_where_fold kernel count ( #12379 )
...
* rangeify: test_where_fold kernel count
* get these from the index
* replace ranges
* fine
* movement ops
* diff
* better
2025-10-01 09:35:12 +03:00
b1tg
42748ccb92
rangeify: fix test_prequant_conv2d_1x1 ( #12391 )
2025-10-01 02:33:47 -04:00
Sieds Lykles
05e91a248d
load alt value with cast ( #12384 )
...
* add or_casted
* add tests and fix old tests
* cast load
* move that to pm_render
2025-10-01 07:14:26 +02:00
qazal
714500edfd
viz: add font-weight to OffscreenCanvas config ( #12390 )
2025-10-01 08:08:47 +03:00
b1tg
57ad46c6e4
rangeify: increase atol for test_two_binops_no_rerun passing on real windows machine ( #12389 )
...
CPU_LLVM=1
2025-10-01 00:56:45 -04:00
George Hotz
e02da8f5ac
use op_in_parents ( #12385 )
2025-10-01 12:37:29 +08:00
chenyu
0662946fac
atol in test_two_binops_no_rerun ( #12387 )
...
for RANGEIFY LLVM
2025-10-01 00:05:47 -04:00
b1tg
da52006bde
rangeify: fix test_scatter_reduce ( #12380 )
...
* rangeify: fix test_scatter_reduce
* ext_vector_type
* set alignment=1 on boolean
2025-09-30 23:26:36 -04:00
George Hotz
1c1b4d14e9
minor cleaups in rangeify ( #12382 )
...
* minor cleaups in rangeify
* op_in_parents
* don't use toposort
* Revert "don't use toposort"
This reverts commit 257d8e2529 .
2025-10-01 11:19:48 +08:00
wozeparrot
4204edc60b
feat: skip test_long ( #12383 )
2025-09-30 20:07:39 -07:00
chenyu
8def8145e4
ALLOWED_KERNEL_COUNT openpilot 0.9.4 with RANGEIFY ( #12381 )
2025-09-30 22:58:59 -04:00
George Hotz
4c9a930de2
rangeify attn tests ( #12377 )
2025-10-01 09:59:19 +08:00
qazal
26247573e1
rangeify multi tests on gpu ( #12376 )
...
* rangeify multi tests on gpu
* fix limit_bufs
2025-10-01 04:53:04 +03:00
qazal
f2eb92948d
rangeify: ban view pushing ( #12371 )
...
* rangeify: ban view pushing
* don't shape INDEX
* fix the codegen cache
* make space
2025-10-01 04:37:52 +03:00
George Hotz
a128fa0f8a
removing double reshapes was wrong ( #12375 )
2025-10-01 09:25:35 +08:00
hooved
969a1b35ca
LR scheduler for Stable Diffusion mlperf training ( #12201 )
...
* add lr scheduler for stable diffusion training
* add lr scheduler test
* rerun ci
* rerun CI
* use np for testing
* move test to CI path
* remove unneeded copy
2025-09-30 21:21:08 -04:00
George Hotz
9ef319f349
bad conv in rangeify ( #12373 )
...
* bad conv with broken rangeify
* no maxpool needed
* add empty_like
* typo
* no self
* issue remains for test
2025-10-01 08:56:22 +08:00
nimlgen
080b26e7d7
use suppress_finalizing to not mute all exceptions ( #12372 )
2025-09-30 21:24:31 +03:00
George Hotz
44558a37f7
fix some rangeify tests ( #12370 )
...
* fix bad range merges
* fix rng
* fix uop gc
* fix some rangeify tests
* now that needs rangeify 2 also
2025-09-30 20:12:08 +08:00
nimlgen
2c397eb2a2
rangeify: buf limit ( #12336 )
...
* limit bufs
* g
* fix buffer limit
* um?
* fix
* only these?
* typo
* f
* cleaner
2025-09-30 14:59:47 +03:00
George Hotz
a83f219253
fix bad range merges ( #12368 )
...
* fix bad range merges
* fix rng
* fix uop gc
2025-09-30 19:30:21 +08:00
qazal
a95159d579
remove TestShapeSpec, it relies on ShapeTracker [pr] ( #12369 )
2025-09-30 14:20:35 +03:00
George Hotz
9cf5e66899
minimal rangeify stable diffusion fix ( #12367 )
...
* minimal rangeify stable diffusion fix
* more minimal
2025-09-30 18:48:35 +08:00
chenyu
b4a4817c9c
fix rangeigy test_linalg ( #12365 )
2025-09-30 06:28:35 -04:00
qazal
de1d562b69
rangeify: update test_pickle asserts ( #12366 )
...
* realized exists on the base
* use is_realized
2025-09-30 13:27:41 +03:00
b1tg
c9ef5d8fe5
rangeify: fix test_tensor_index_overflow (CPU_LLVM=1) ( #12362 )
...
* rangeify: fix test_tensor_index_overflow (CPU_LLVM=1)
* add test
---------
Co-authored-by: b1tg <b1tg@users.noreply.github.com >
2025-09-30 05:55:15 -04:00
qazal
e8c595c29e
remu: add new instructions introduced in RANGEIFY ( #12363 )
...
* add v_mad_i64_i32 for test_output_padded_conv_transpose2d
* run amd test_ops
* skip test_masked_select
2025-09-30 12:36:29 +03:00
George Hotz
360980f1a3
work on rangeify cost function heuristics ( #12360 )
...
* work on rangeify cost function heuristics
* dedup
* better cost function
2025-09-30 16:44:29 +08:00
qazal
109c63b904
update Tensor unit tests for RANGEIFY ( #12359 )
...
* update test_kernelize for RANGEIFY
* also kernelizes user contiguous
* skip that test
* tensor uop repr
* 4 kernels, still realizes a float
2025-09-30 11:17:21 +03:00
George Hotz
7129419500
fix cifar training in RANGEIFY ( #12355 )
...
* fix cifar training in RANGEIFY
* even more wino fuse
* bugfix
* test to show issue
2025-09-30 15:59:19 +08:00
qazal
4ff7f20b9d
rangeify: fix kernelize ( #12357 )
2025-09-30 10:10:08 +03:00
chenyu
86c5c969ea
linalg cosmetic change ( #12356 )
2025-09-30 03:00:59 -04:00
qazal
6a56d3c859
rangeify: only test correctness in multi ( #12339 )
...
* work
* more work
* back here
* skip tests
* work
2025-09-30 09:55:59 +03:00
George Hotz
ab6b0d3a21
enable cleanup_dead_axes ( #12351 )
...
* enable cleanup_dead_axes
* don't mess with user contig
* correct tag behavior
* double reshape isn't correct
* block on assign too
* skip messing with symbolic
* Fix tests
* disable RANGEIFY=2
* test w rangeify
2025-09-30 14:09:39 +08:00
qazal
2a7310ab59
rangeify: fix remaining multi correctness issue ( #12354 )
2025-09-30 08:08:27 +03:00
Sieds Lykles
73b25bf47d
z3 fix loaded mask ( #12353 )
...
* z3 fix loaded mask
* indentation
2025-09-30 06:55:50 +02:00
wozeparrot
2a0caa09c2
push copy to disk ( #12348 )
2025-09-29 21:55:05 -07:00
chenyu
881709cd33
don't skip rangeify test_instancenorm_3d ( #12350 )
...
seems fine now
2025-09-30 00:05:59 -04:00
hooved
39aae679e4
Support bfloat16 on NULL backend ( #12340 )
...
* add failing test
* move test
* only run test with NULL default
* add skip reason
* add fix
2025-09-30 00:02:30 -04:00
chenyu
af935e7d32
Revert "reduce const folding ( #12344 )" ( #12349 )
...
This reverts commit 8e508a9927 .
2025-09-29 23:45:30 -04:00
George Hotz
f522e83a02
fix rangeify elu fusion for openpilot ( #12341 )
...
* fix rangeify elu fusion for openpilot
* flip the metadata
* copy over permuted contiguous support
* this is correct
* update that
2025-09-30 11:41:52 +08:00
qazal
d95d018bb5
add name to multi rewrite [pr] ( #12346 )
2025-09-30 06:34:58 +03:00