tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-14 09:28:04 -05:00

Author	SHA1	Message	Date
Sieds Lykles	53985297bd	add test, fix rewrite rule and raise error on division by zero (#11073 )	2025-07-03 08:25:06 -04:00
George Hotz	d049639221	little setitem test (#11064 ) * setitem has one less realize, why broken * put realize back	2025-07-02 15:10:24 -07:00
George Hotz	3b85534df0	outerworld range test [pr] (#11059 ) * outerworld range test [pr] * bound range * grad acc test * more tests * 5 steps is fine	2025-07-02 14:28:44 -07:00
qazal	ad155f5454	print inputs to get_program in process replay [pr] (#11051 ) * print inputs to get_program in process replay [pr] * colors * keep dataclass default escapes * Revert "keep dataclass default escapes" This reverts commit `c6db7e8a7a`. * note for ast_repr * add that back	2025-07-02 20:20:01 +03:00
qazal	a919b8325b	add test_kernel_info (#11054 ) * add test_kernel_info * reorder	2025-07-02 19:48:12 +03:00
kevvz	3b041d188f	[bounty] Singular Value Decomposition (#10875 ) * inital commit * add qr + expand svd to full matrix * add odd number support * add linalg tests * qr supports dims of arbitrary size * add qr tests * svd supports dims of arbitrary size * small cleanip * improvements over svd batch handling * improve linalg tests * make u_pad match q shape * add nonfull matrix tests * little less verbose nonfull svd test * added dtypes on svd + return vt instead of vt * lint * more lint * lint + set seed * small fix * small lint * lint * add int casting to indices and shapes * remove int from shape tuple in svd * small cleanup * add return types * reuse inverse_permute * refactoring * whitespace * remove regularization term to prevent bad outputs on ill conditioned matrices * remove seed * refactor * lint * refactor * spacing * remove clone * line reduction * smarter heuristic for iterations_per_round * add big test * lint * turns out no constant needed? * wrap tests * some small matrices need the constant * remove realize --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-07-02 09:06:03 -07:00
Ahmed Harmouche	e992ed10dc	WebGPU on Windows (#10890 ) * WebGPU on Windows * Fix dawn-python install * New test * pydeps * Minor fix * Only install dawn-python on windows webgpu --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-07-02 08:38:45 -07:00
chenyu	4626e9c172	is_numpy_ndarray helper [pr] (#11050 )	2025-07-02 09:12:53 -04:00
qazal	452b22c9b6	fix process replay diff in PYTHON device [pr] (#11052 ) * fix process replay diff in PYTHON device [pr] The PYTHON backend pickles and encodes UOps, the encoded binary can't be directly diffed in process replay. * note	2025-07-02 11:06:46 +03:00
geohotstan	8ebf0abaae	ONNX external_test_onnx_backend use PYTHON device for model (#10915 ) * try * ruff check --fix * no skip test * hmmmmmmm I don't get this D: * run CI again * why is PYTHON device faster than CPU? * run ci again and fix lint * actually doesn't PYTHON device make sense here? * see cpu speed again * Revert "see cpu speed again" This reverts commit `1e366f2256`. * trigger CI * pretty good --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2025-07-01 12:11:17 -04:00
qazal	8b0871ac31	viz: test for no lockup on infinite loop (#11041 ) * viz: add test infinite loop fallback * assert * continue til the end * work * bring that back * fallback to nop	2025-07-01 17:44:20 +03:00
b1tg	fcbefde8f5	fix DiskDevice reuse (#11039 ) * fix DiskDevice reuse * fix mypy and DiskDevice.count * mypy * add test --------- Co-authored-by: b1tg <b1tg@users.noreply.github.com>	2025-07-01 10:29:21 -04:00
George Hotz	0597735f28	remove TC=3 not porting this (#11045 )	2025-06-30 15:12:49 -07:00
George Hotz	cccfe6b422	hotfix: test_no_inf_loop_bottom_up	2025-06-30 14:21:45 -07:00
George Hotz	b829331219	infinite loop detect in fixed_point_rewrite [pr] (#11038 )	2025-06-30 08:57:29 -07:00
George Hotz	cb531dba42	detect infinite loop in graph rewrite [pr] (#11036 )	2025-06-30 08:15:13 -07:00
qazal	2ea4737930	viz: fix newlines breaking label colors (#11030 ) * viz: fix newlines breaking label colors * TestViz.test_colored_label * TestWordWrap	2025-06-30 13:39:44 +03:00
George Hotz	5911b71404	early support for bidirectional pattern matcher (#11027 ) * early support for bidirectional pattern matcher * expose it and add a test * no bottom up arg there * disable flaky test	2025-06-29 16:54:07 -07:00
Piyush	454bc3393d	redundant code (#11014 )	2025-06-29 09:06:10 -07:00
chenyu	126fcf4129	clean up AMD_LLVM in tests (#11021 )	2025-06-28 22:45:47 -04:00
qazal	4c8d2a0383	buffer viz (#10960 ) * add mem_layout * ui * cleanup * work * debugLine work and expander * tooltip style * real expand device * wheel does one thing * diff * shows llama oom * add y axis * mypy chill * work * unittests for the memory layout	2025-06-28 21:50:32 +03:00
George Hotz	be53ef4f0a	rename DEFINE_ACC -> DEFINE_REG (#11006 ) * rename DEFINE_ACC -> DEFINE_REG * add CMPEQ to groupops	2025-06-27 11:09:25 -07:00
George Hotz	5a1911b7c4	apply the global dims late (#11002 ) * apply the global dims late [pr] * late gpudims * tests passing * remove the random local_dims inc * simpler	2025-06-27 09:54:34 -07:00
qazal	4ef10c57f9	remove unused test helper (#10999 )	2025-06-27 13:48:48 +03:00
qazal	a39343e39f	viz: move timeline layout to python (#10998 ) * viz: move timeline layout to python * DevEvent has a device and a name	2025-06-27 13:06:00 +03:00
George Hotz	b4eb876d5a	kernel.py no longer permutes reduce axis [pr] (#10968 ) * kernel.py no longer permutes reduce axis [pr] * delete tests that handcode uops * regen of sops is broken... * put import back * just remove that * disable those tests	2025-06-26 17:44:58 -07:00
qazal	1127302c46	move perfetto to extra (#10994 ) * move perfetto to extra * update TestViz and fix tests * remove perfetto.html from viz directory * work * mypy	2025-06-27 01:53:54 +03:00
qazal	712980e167	fix extract_dataset + add tests to CI (#10995 ) * fix extract_dataset + tests * add CI * sops.gz itself is same as master * yml + gzip -c + ge * don't commit that * bump limit to 1000 * axis=7 * test_tiny	2025-06-27 01:51:36 +03:00
Ignacio Sica	579194f523	remove some linearize calls from tests 2 [pr] (#10992 ) * refactor count_float4 to take uops as input instead of kernel * remove some calls to linearize in test_linearizer * remove some more calls * remove one more call	2025-06-26 18:22:27 -03:00
geohotstan	50936b4a18	ONNX real float16 (#10694 ) * squash commits * temp fix for const tensor * actually realizing float16 can only happen in raw_data * .float -> cast(float) to rerun CI --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2025-06-26 14:05:12 -04:00
chenyu	49bba2f0a0	improve test_nll_loss (#10986 ) build target and weight tensors outside so it tests backward too.	2025-06-26 02:46:55 -04:00
chenyu	0612acfc70	improve Tensor.cross_entropy (#10985 ) separate when Y is prob vs indices and check shapes for indices. also fix higher dim cases	2025-06-26 01:39:48 -04:00
chenyu	8751d47985	CosineAnnealingLRWithWarmup (#10981 )	2025-06-25 17:45:21 -04:00
Ignacio Sica	21f1c4cc09	remove some linearize calls from tests [pr] (#10978 ) * remove some linearize calls from tests speed_compare_cuda_ptx test_uop_spec test_linearizer test_uops test_winograd * more clear assert message	2025-06-25 12:37:17 -07:00
Ignacio Sica	98d2cde293	revert tc_group feature (#10971 )	2025-06-24 20:58:13 -07:00
George Hotz	cf60ccac6a	support new const lowering (#10967 ) * support new const lowering * delete invalid linearizer failure tests	2025-06-24 15:21:41 -07:00
George Hotz	8a65720528	hotfix: disable test_tensor_core_opts_group test on real metal	2025-06-24 15:21:33 -07:00
George Hotz	8743ca40e2	force reduce to be in axis order (#10837 ) * force reduce to be in axis order * disable rule causing loop * disable that rule * no ra there * only move non reduce * fix tests	2025-06-24 13:00:16 -07:00
chenyu	bfa87f3490	clean up binary_crossentropy_logits (#10958 )	2025-06-24 12:23:40 -04:00
qazal	de4b9bf53b	add opts_to_apply option to AST KernelInfo (#10950 ) * proposal: add option to override opts in the get_program API * update test_linearizer_rewrite * state in uops * update process_replay and names * empty isn't none * fix process replay	2025-06-24 18:55:39 +03:00
chenyu	18e264a449	Tensor.logsigmoid (#10955 )	2025-06-24 11:16:14 -04:00
b1tg	cc32394b32	support copyin/copyout/is_allocated for subbuffers (#10869 ) * support copyin/copyout/is_allocated for subbuffers * simple * clean up * rm underlying_buf * add function is_initialized * add tests * better test_subbuffer_copy_in_out * fix allocator --------- Co-authored-by: b1tg <b1tg@users.noreply.github.com> Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-06-24 07:49:04 -07:00
chenyu	35504c938e	torch.clip(x,y) -> x.clip(y) in test_ops (#10954 ) * torch.clip(x,y) -> x.clip(y) in test_ops * test_binary_crossentropy_logits_pos_weights	2025-06-24 10:22:19 -04:00
Fang-Pen Lin	86d458533f	Add pos_weight for binary_crossentropy_logits (#10855 ) * Add pos_weight for binary_crossentropy_logits * Remove debug code * Code style * Code style * Rename	2025-06-24 09:42:37 -04:00
Sieds Lykles	61dad3740f	fix min_max and add test (#10952 )	2025-06-24 09:33:26 -04:00
qazal	f41c28a048	update test_tensor_uop_representation comments [pr] (#10946 ) These comments can update to match new tinygrad.	2025-06-24 10:47:09 +03:00
qazal	7a5e4e0bf1	fix unittests process replay [pr] (#10947 )	2025-06-24 10:30:23 +03:00
George Hotz	7d560dbd75	hotfix: corealize in the tiny mnist test	2025-06-23 17:41:16 -07:00
George Hotz	0f89660ce4	Revert "change clang -march flag to -mcpu on arm (#10841 )" (#10942 ) This reverts commit `897e42fd1b`.	2025-06-23 16:48:28 -07:00
Ignacio Sica	956a8391a5	minor cleanup on test_tensor_core_opts tests (#10924 ) * minor cleanup on test_tensor_core_opts tests Tests now notify when skipped Before, they silently skipped if backend didn't had half precision and accumulation Also cleaned up atol and rtol setup * refactor test_tensor_core_opts_group --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-06-23 16:30:21 -07:00

... 13 14 15 16 17 ...

4667 Commits