tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-10 07:28:15 -05:00

Author	SHA1	Message	Date
qazal	57c7e0a8f8	RANGEIFY=1 test_jit (#12254 ) * RANGEIFY=1 test_jit * don't do any of that * disk * simple disk tensor * more work * run more tests * it also doesn't copy everytime * skip tests that hang everything	2025-09-20 17:34:32 +03:00
chenyu	393c6b236c	test case to sum twice in different order (#12253 ) * test case to sum twice in different order fixed by #12251 * try metal	2025-09-20 10:11:57 -04:00
qazal	4756971c88	skip test_bf16_disk_write_read on CL=1 (#12256 )	2025-09-20 17:11:06 +03:00
chenyu	5e794be8af	tighter spec for RANGE (#12250 )	2025-09-20 07:59:50 -04:00
Sieds Lykles	73c8dae60d	add missing remove_blockend case (#12251 ) * add missing remove_blockend case * remove expectedFailure * better comment	2025-09-20 06:29:19 +02:00
wozeparrot	dc4dd898b7	fix: close mmap (#12249 )	2025-09-19 14:09:12 -07:00
Sieds Lykles	bb1f376ae6	profile z3 (#12248 )	2025-09-19 22:52:06 +02:00
Sieds Lykles	7e06d3ebba	enable test_symbolic_jit (#12245 ) Co-authored-by: qazal <77887910+Qazalin@users.noreply.github.com>	2025-09-19 20:23:42 +02:00
qazal	bb59eed82f	rangeify: don't tag consts, they are global (#12247 ) * rangeify: don't tag consts, they are global * don't map movement ops * sym failing test * remove that * update comment * simpler test * work	2025-09-19 15:25:03 +03:00
Sieds Lykles	cc038b31b6	Shrink instead of reshape to unregister symbolic (#12241 ) * Slice to unbind symbolic * use vmax for now * assert shape in reshape is valid * update test_symbolic_ops to use shrink instead of reshape * remove infer_with_bound_values for npw * symbolic output doesnt have symbolic strides * symbolic jit tests use shrink to unregister symbolic * update test * update more tests * wrap vmax in int() * only create a new st if the store is not an assigne * unwrap st * comments	2025-09-19 06:04:35 +02:00
chenyu	a531a649fb	test_resize_upsample_scales_cubic_align_corners_cpu is fixed (#12244 )	2025-09-18 20:55:26 -04:00
Sieds Lykles	8d703a6369	z3 xor doesnt use bitcast (#12243 )	2025-09-19 00:31:44 +02:00
chenyu	0dad6cc518	good RANGEIFY kernel counts in external_test_opt (#12242 ) no push permute stuff. the model ones are less clear if it's good, some got slower	2025-09-18 17:58:54 -04:00
chenyu	cff1065f5e	test CL=1 RANGEIFY=1 onnx (#12240 ) all except test_resize_upsample_scales_cubic_align_corners_cpu runs	2025-09-18 16:49:46 -04:00
Sieds Lykles	ef05178855	fix 0//0 infinite rewrite in rangeify onnx (#12239 )	2025-09-18 21:59:50 +02:00
chenyu	87707ef0b8	unify range_start [pr] (#12236 )	2025-09-18 13:52:54 -04:00
qazal	825f148469	rangeify: fix copy size mismatch errs (#12232 ) * rangeify: fix copy size mismatch errs * const folding can happen in sym assert it * shippable * rangeify copy is completely wrong * pre_bufferize * tag bufferize * pre back	2025-09-18 18:23:32 +03:00
chenyu	f82b16a0e9	RANGEIFY test_tensor (#12235 )	2025-09-18 10:35:43 -04:00
chenyu	7487c13b61	truncate_fp16 -> float_to_fp16 (#12234 ) match float_to_bf16 and float_to_fp8	2025-09-18 09:48:27 -04:00
b1tg	54c15d74a4	python float8 support (#11960 ) * basic support * alu * nan in exec_alu * rand_for_dtype * inf + 0.0 * finfo * revert rand_for_dtype * clean * truncate fp8s inf * spec ok * float_to_fp8 nan/inf * least_upper_dtype * clean up --------- Co-authored-by: b1tg <b1tg@users.noreply.github.com>	2025-09-18 09:17:09 -04:00
qazal	dbbc261075	rangeify: fix COPY simplifier (#12233 )	2025-09-18 14:35:33 +03:00
Sieds Lykles	f1108f1cbe	Enable test_symbolic_ops on rangeify (#12230 ) * enable * merge correctly	2025-09-18 02:12:36 +02:00
Sieds Lykles	812f485cd7	Enable threefry_doesnt_use_long test on rangeify (#12229 ) * dont bufferize rangeify * enable doesnt_use_long test	2025-09-18 01:58:34 +02:00
nimlgen	3c5b8bf50c	am: bump fw to rocm7 (#12226 )	2025-09-17 21:20:22 +03:00
qazal	525f80e0d2	rangeify: enable putting consts back in the tensor graph (#12225 ) * rangeify: enable putting consts back in the tensor graph * work * sym in ci	2025-09-17 19:45:04 +03:00
chenyu	edffc246ed	MUL in reduce_unparented (#12223 ) * MUL in reduce_unparented * some test	2025-09-17 11:56:39 -04:00
qazal	7733c217c5	remove spam comments in test_schedule (#12224 )	2025-09-17 18:24:55 +03:00
qazal	d917895569	map out rangeify errors in test_schedule (#12211 ) * map out rangeify errors in test_schedule * skip that * add to ci	2025-09-17 09:10:28 +03:00
Sieds Lykles	158506b91e	Upgrade some divmod folding for symbolic divs (#12216 ) * use const_factor() instead of arg * add test * change div min_max * add tests * add divide_by_symbolic_gcd * add tests * one more test * Slice to unbind symbolic * deal with const factor properly * minor cleanup * divide_by_symbolic_gcd becomes UOp.gcd and UOp.divide_exact * add tests * add gcd_without_const * fix divide_exact bug * add factor_remainder * add tests * fix imports * elif -> if * remove expectedFailure * add more tests * add more unwrap * fix signature of pop_const * remove that * remove that	2025-09-17 03:00:50 +02:00
Sieds Lykles	328bfe6b9b	fix map_expand for symbolic shapes (#12218 ) fix incorrect default argument in resolve	2025-09-17 01:20:18 +02:00
chenyu	5b12764b83	add arange cat arange test (#12217 ) simple test case to catch wrong reduce const folding. also clean up the old arange complexity test	2025-09-16 17:12:32 -04:00
nimlgen	53655a4ee5	cuda: cleanup old comment (#12215 )	2025-09-16 23:11:32 +03:00
chenyu	6b808c5fe6	update TestSymbolicJit.test_plus1_pad (#12214 ) was failing because movement was not captured	2025-09-16 15:57:50 -04:00
Shun Usami	2a72b00679	Add test for 2D tensor indexing in setitem (#12193 ) * Add test for 2D tensor indexing in setitem * Fix _masked_setitem to handle multi dim indexing correctly * Fix indent * Add fuzz test for 3D tensor indexing in setitem * Skip indexing fuzz test (slow)	2025-09-16 14:57:25 -04:00
chenyu	c7b03457d7	Revert "Revert "more llvm intrinsics (#11961 )" (#12194 )" (#12195 ) This reverts commit `df1c183e46`.	2025-09-16 14:55:31 -04:00
chenyu	494bb12500	skip slow cifar bf16 on red benchmark (#12213 ) very slow to compile the fake bf16	2025-09-16 14:55:01 -04:00
chenyu	419e997187	increase benchmark timeout (#12212 ) account for compile cache, and it's annoying that job died due to timeout also messes the machine	2025-09-16 14:09:02 -04:00
chenyu	84d2d047ea	Tensor.pad_to and Tensor.shrink_to (#12210 ) most of the time i want this instead of spelling out the args also add more input validation to shrink	2025-09-16 12:24:55 -04:00
qazal	122a50fe8c	assert kernel count (#12205 )	2025-09-16 14:24:39 +03:00
chenyu	e555748807	test rangeify const folding (#12200 ) * test rangeify const folding reduce i know how to fix, multi and test_cast_padded tbd * test_instancenorm_3d is very slow	2025-09-15 20:03:48 -04:00
chenyu	f732f66709	rangeify test_nn almost pass (#12198 ) * rangeify test_nn almost pass * issue with jit * flaky	2025-09-15 17:49:20 -04:00
chenyu	82e037aad5	ci test.yml updates (#12197 ) * ci test.yml updates move docs together and external_benchmark_schedule to unit * torch	2025-09-15 17:09:02 -04:00
chenyu	146c31586d	split RANGEIFY ci (#12196 ) one CPU and one CL for speed	2025-09-15 15:41:10 -04:00
chenyu	df1c183e46	Revert "more llvm intrinsics (#11961 )" (#12194 ) This reverts commit `d01e3d7719`.	2025-09-15 13:56:43 -04:00
b1tg	d01e3d7719	more llvm intrinsics (#11961 ) * more llvm intrinsics * assert nan * skip test_log_nan on metal --------- Co-authored-by: b1tg <b1tg@users.noreply.github.com>	2025-09-15 13:05:23 -04:00
nimlgen	b63bd02969	update runtime docs (#12191 )	2025-09-15 17:46:20 +03:00
qazal	57e8bf61e8	viz: fix Specificity for rect styling (#12190 )	2025-09-15 17:33:37 +03:00
chenyu	72e010d816	fix rangeify ci (#12189 ) CL=1, and multitensor needs to test with CPU since CL does not support multi in CI	2025-09-15 10:24:57 -04:00
qazal	f1bd06134d	test fuse with RANGEIFY=2 (#12187 )	2025-09-15 15:51:23 +03:00
qazal	ef0ef705fe	viz: remove async from event listener (#12186 )	2025-09-15 15:08:28 +03:00

1 2 3 4 5 ...

10255 Commits