tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-13 00:48:21 -05:00

Author	SHA1	Message	Date
George Hotz	d1223922b1	fixed and test is real	2025-12-04 16:52:11 -08:00
George Hotz	05c4b18f91	Merge branch 'master' into sched_cache	2025-12-04 16:46:23 -08:00
chenyu	42f6cf3a90	tighter test_real_world mem and kernel count bounds (#13573 ) also check if actual usage is within 20% of set limit, the old limits are too big to be useful	2025-12-04 13:35:39 -05:00
chenyu	89f9e1dcd5	add SGD to beautiful_mnist (#13571 )	2025-12-04 12:17:29 -05:00
Rory Clear	6eab756578	fix and test loading num_batches_tracked (#13538 ) * fix and test loading num_batches_tracked * add failing reverse case * try reshape state dict if mismatch * reshape for () and (1,) --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-12-04 01:22:49 -08:00
Douglas Nyberg	a8a62bc08e	add max/min reduction support to ScatterND (#13562 )	2025-12-04 00:53:47 -08:00
ayanhan	edf929ec9d	fix: add __delitem__ to Tensor with proper TypeError (#13561 )	2025-12-04 00:53:08 -08:00
ayanhan	92b40290c7	fix: add test_sum_int and remove outdated TODO in test_custom_kernel (#13560 )	2025-12-03 21:51:58 -05:00
Christopher Milan	0a54434b15	mitigate ctypes c_bool bitfield bug (#13558 ) * mitigate ctypes c_bool bitfield bug * don't delete old test	2025-12-03 20:46:04 -05:00
George Hotz	7c66e44454	fix JIT in examples/gradaccum_mnist.py	2025-12-03 16:00:28 -08:00
George Hotz	24ca8eeaa7	small fixups from schedule_cache (#13557 )	2025-12-03 15:41:16 -08:00
George Hotz	8c69e26d22	metadata is best effort	2025-12-03 15:22:58 -08:00
George Hotz	bf5de6ba5f	delete abstractions2	2025-12-03 15:02:20 -08:00
George Hotz	9ba612f0b4	Merge branch 'master' into sched_cache	2025-12-03 14:50:29 -08:00
Douglas Nyberg	f5abd38132	remove tfa dependency: use keras.optimizers.Lamb and tf.raw_ops for LARS (#13555 )	2025-12-03 17:48:27 -05:00
George Hotz	a4c4e48385	add LUNIQUE op (#13554 )	2025-12-03 14:34:34 -08:00
George Hotz	723179dfd6	Merge branch 'master' into sched_cache	2025-12-03 13:43:58 -08:00
chenyu	22777a89ea	minor test_uop_symbolic updates (#13551 )	2025-12-03 13:17:44 -05:00
chenyu	a205f98ef4	tighter bound for MOD (#13550 )	2025-12-03 11:24:29 -05:00
nimlgen	549f3287a8	fix caching for fetch (#13544 )	2025-12-03 14:34:14 +03:00
George Hotz	81bafb1af3	Merge branch 'master' into sched_cache	2025-12-02 19:59:48 -08:00
George Hotz	6bd355fa26	add needs_second_gpu decorator (#13543 ) * add needs_second_gpu decorator * more skips * two more fixes	2025-12-02 19:08:23 -08:00
wozeparrot	0d55aec605	fix after end (#13542 )	2025-12-02 18:42:58 -08:00
George Hotz	055d5aeb7f	add external_test_process_count	2025-12-02 17:26:30 -08:00
George Hotz	ed89217ef2	fix tests	2025-12-02 17:14:06 -08:00
George Hotz	79f2cfcb96	schedule cache cleanup	2025-12-02 16:59:32 -08:00
chenyu	e8879f7e31	match torch clamp backward (#13533 ) * match torch clamp backward * fix PYTHON	2025-12-02 17:58:32 -05:00
Roelof van Dijk	c158e3c988	add cifar gated uop_given_valid regression test (#13536 )	2025-12-02 16:02:47 -05:00
George Hotz	b4c3a6977e	Merge branch 'master' into sched_cache	2025-12-02 12:54:14 -08:00
Roelof van Dijk	e329baffa7	fix cifar while keeping openpilot fused (#13528 ) * this works * test now passes	2025-12-02 12:05:56 -08:00
nimlgen	0874ba8cc8	test_hevc: do not download the whole file (#13531 ) * test_hevc: do not download the whole file * fix	2025-12-02 21:31:28 +03:00
qazal	366badaa68	require renderer argument in get_program, removes device opening in process replay [pr] (#13524 )	2025-12-03 02:05:31 +08:00
Douglas Nyberg	6a7c58abf1	fix(onnx): unwrap list/tuple value in Pad op (#13500 ) * fix(onnx): unwrap list/tuple value in Pad op * add regression test for Pad list value * remove trailing whitespace * use _resolve_const for Pad constant_value	2025-12-02 07:47:20 -08:00
George Hotz	7f7aa0a7f8	start work on schedule cache	2025-12-02 07:44:10 -08:00
nimlgen	77a76d1b13	device: respect compiler ContextVars (#13523 ) * device: envvars for cc * fix * fix * x * um * fix * remote * em * cleanup * typing * fix * debug * lvp? * ugh * singl * rm * lol * fix * ? * this? * why? * rev * mod test * l	2025-12-02 14:42:04 +03:00
wozeparrot	1b7dbfb37f	tk: named kernels + per kernel range id (#13522 )	2025-12-01 22:51:04 -08:00
nimlgen	455dd88236	nv: minimal hevc (#13502 ) * nv: minimal hevc * validate * not needed * tralin * var * cpu * fxi * desc * move * cleanup	2025-11-30 16:46:55 +03:00
George Hotz	fd373fea7a	fix a few tests [pr] (#13498 )	2025-11-29 13:43:45 -08:00
George Hotz	6a140f74fe	split out unique_const and cache const [pr] (#13493 ) * split out unique_const * add cache to const * call const in unique_const	2025-11-29 10:44:28 -08:00
George Hotz	c38b7684dc	improve microbenchmarks (#13492 ) * improve microbenchmarks * bugfix + ubench * lil * no src in const method	2025-11-29 10:15:22 -08:00
kamilisjon	3d76ef9ba8	Update tests (#13479 )	2025-11-28 18:35:28 -08:00
qazal	ae9c56134e	skip test_tk failing locally on macbook (#13476 )	2025-11-29 01:15:37 +08:00
qazal	72ef533d9c	tracing: use u32 for buffer args encoding (#13472 )	2025-11-28 00:19:51 +08:00
George Hotz	18addc0a1d	process replay only get_program (#13475 )	2025-11-27 08:18:18 -08:00
George Hotz	a8e005b095	enable process replay (non-checking) by default (#13474 )	2025-11-27 07:28:44 -08:00
George Hotz	05cd2279d0	add cache on reshape (#13466 ) * remove cache on divmod, way less objects * _apply_reshape * reshape * no gc on realize * wow that cache is fast	2025-11-26 18:57:40 -08:00
George Hotz	19228e8d37	test_graph is flaky	2025-11-26 16:37:42 -08:00
George Hotz	e4cd649ff0	remove kernelize to prepare for refactors (#13463 ) * remove kernelize to prepare for refactors * less kernelize * last test	2025-11-26 14:18:50 -08:00
wozeparrot	ffc31a23f4	tk mi350 (#13288 )	2025-11-25 15:49:44 -08:00
qazal	7238df7a94	viz: cleanup sort_fn (#13454 )	2025-11-26 04:10:10 +08:00

1 2 3 4 5 ...

4735 Commits