tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-04-29 03:00:14 -04:00

Author	SHA1	Message	Date
chenyu	519336cfea	factor out partial in SumNode div int (#3841 ) * factor out partial in SumNode div int * div not rem * space	2024-03-20 16:34:33 -04:00
chenyu	455f7bea9b	test example from half resnet that idx has number outside of int32 (#3838 ) * test example from half resnet that idx has number outside of int32 * ruff	2024-03-20 13:44:20 -04:00
Patrick Tsai	b436c9792f	Fix factoring bug (O(n) arange related) (#3817 ) * Factoring bug * Another one in case * It works now so change tests back * large arange cumsum optimization * More cleanup * symbolic no factor div test * name change * Rename test --------- Co-authored-by: Patrick Tsai <patosai@users.noreply.github.com>	2024-03-19 11:49:42 -04:00
chenyu	968d109453	apply more create_lt_node (#3597 ) updated one in linearizer if condition, and various symbolic tests	2024-03-03 16:12:39 -05:00
Patrick Tsai	0082300a59	Fix symbolic negative floordiv (#3594 ) Co-authored-by: Patrick Tsai <patosai@users.noreply.github.com>	2024-03-03 11:40:52 -05:00
chenyu	e09619ab6c	explicitly create_lt_node when used in shapetracker _expr_view (#3561 ) * explicitly create_lt_node when used in shapetracker leave regular __lt__ and cmps for symbolic shape cmp * hmm it fixed that? * LtNode.substitute uses create_lt_node	2024-03-03 10:08:21 -05:00
chenyu	88939c3347	fix Node.max can be symbolic (#3514 ) Also made sure taking max twice can get int.	2024-02-27 17:21:31 -05:00
chenyu	61605ccc69	Remove special case of SumNode div SumNode (#3502 )	2024-02-26 09:42:06 -05:00
chenyu	0d326a48b8	fix LtNode simplification when lhs and rhs contain same variables (#3451 ) * fix LtNode simplification when lhs and rhs contain same variables `(Variable("a", 1, 5) < Variable("a", 1, 5))` should eval to `NumNode(0)` * fix with less perf impact	2024-02-20 09:06:55 -05:00
chenyu	2da734920e	use __getnewargs__ to fix unpickling Variable (#3441 ) it's recommended to use __getnewargs__ to update the args of classes that use __new__ when unpickling. It's preferred because it does not change the __new__ behavior.	2024-02-18 10:28:37 -05:00
David Hou	3378625773	name upcast variables (#3200 ) * name upcast variables * typing * unused	2024-01-22 11:37:28 -05:00
chenyu	f018a55ea1	update NumNode.__hash__ to be hash(self.b) (#3105 ) with this, `a:=NumNode(x) == b` implies `hash(a) == hash(b)`	2024-01-12 19:46:21 -05:00
chenyu	74cc6fd3c2	remove AndNode.__floordiv__ special case (#2996 ) * remove AndNode.__floordiv__ AndNode produces a Node that min/max is bounded by [0, 1] so `//` on top of that is almost always 0. we don't really use that either * keep the test	2024-01-03 17:44:55 -05:00
chenyu	8291986959	Variable.sum -> Node.sum, Variable.ands -> Node.ands (#2961 )	2024-01-01 16:21:28 -05:00
chenyu	3d720b5761	move expand_idx, iter_idxs and expand_node from symbolic to linearizer (#2959 )	2024-01-01 14:41:21 -05:00
Umut Zengin	8ad7cfeeb1	More simplification in to_image_idx and symbolic (#2679 ) * less valid * add test --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2023-12-13 12:30:44 -05:00
George Hotz	35b5e95097	parallel beam search (#2610 ) * better print * fix beam search with vars * cleanups * parallel is not default * restore that * bugfix * cleanups * bugfix	2023-12-05 10:09:45 -08:00
Amrit Sahu	e8d6a6ef2e	view.reshape without symbolic (#2218 ) * handle reshape of contiguous subparts with explicit mask * remove the add/remove ones logic in reshape * accomodate ones in accumulate logic * make multiply commutative * fix linting * make mypy happy * add test for commutative mul * merge dimensions in shape_strides for 1 range masks * add offsets for merging * fix linting * add back explicit 1 reshapes * fix mypy errors * fix accumulate by includng state * include non-zero stride dimension in acc * small cleanup * more compact to_shape_strides * more logical cleanup * compress more * compress reshape mask * adding some comments * small bug fix * improve test coverage * remove explicit add remove ones * small bug in test * enable test_reshape_splitting_combining * small fix * 10 lines less to_shape_strides * shorten reshape mask * some more cleanup * more cleanup * introduce some symbols for compactness * more symbols * more cleaner * lessen symbols, it became less readable * remove merge_views from view.reshape * change to_shape_strides to _merge_dims * improve readability * fix corner case * cleanup * better handling of 1 <= Variable('i',1,10) & new_dim = Variable('i',1,10) * rewrite _reshape_mask for readability * fix white space * add comment * nice shorthands for readability * add proof in docs * small nit --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2023-12-04 12:46:53 -05:00
chenyu	847f0a02b1	non-simplifiable mod should result in ModNode (#2490 ) * non-simplifiable mod should result in ModNode * space	2023-11-28 16:52:19 -05:00
Christopher Mauri Milan	7f01dd04f0	Apply ruff linting rules to tests (#2473 ) * everything except F821 * enable F821 with noqa * dumb fix * fix remaining imports and (former) lambdas * replace _ with noqa to avoid gc	2023-11-27 21:24:06 -08:00
Paul Gustafson	98cd9e8926	Add assertion to prevent nonsense mod values (#2474 )	2023-11-27 18:37:44 -08:00
chenyu	61a80a0675	asserts LtNodes of SumNode with MulNode of Nodes (#2465 )	2023-11-27 12:56:59 -05:00
Paul Gustafson	1d89c018fa	Add isinstance check before gcd call in SumNode.__lt__ (#2450 ) * Add isinstance check before gcd call * Delete blank lines * Fix unit test typo * Delete blank lines again --------- Co-authored-by: Paul Gustafson <paul.gustafson@theambrusgroup.com>	2023-11-26 13:05:04 -08:00
chenyu	d7d078c7f9	Node.vars() returns a set and properly dedup (#2356 ) * dedup RedNode.vars() * vars returns a set * fix more vars * unused import * update to_movement_ops * comment	2023-11-18 17:44:52 -05:00
chenyu	f02e17a967	Variable.num -> NumNode (#2354 )	2023-11-18 15:45:52 -05:00
Umut Zengin	01b98b7f42	MulNode.__lt__ rule (#2086 ) * Added the rule * Added tests * flake8 * self.b == -1 shortcut	2023-10-17 13:18:35 -07:00
Umut Zengin	776605f2fc	O(1) VALIDHACKS (#2072 ) * first refactoring * O(1) validhacks * O(1) validhacks * Some cleaning * mypy * flake8 * Trim trim * flake8 * clean * less chaotic * less chaotic * flake8 * Symbolic, SumNode include mulnode for gcd * fix tests * smal optim * revert * clean * clean * flake8 * small fix * Add symbolic test	2023-10-15 11:26:41 -07:00
Umut Zengin	6b7ac5c431	ModNode __mod__ rule (#2039 ) * Implement mod rule * mypy * feat: New test added	2023-10-12 11:30:10 -07:00
Umut Zengin	3987280daf	Fix VALIDHACKS for Images and make it default (#1832 ) * valid hacks * valid hacks * valid hacks * new method * new method * handtune * is gate load breaking? * lint ruff less junk new approach? maybe this? * Make it more clear * Make it more clear * Will deal with the linter later * hack for linter * subs the idx but dont touch the valid * Updated the mod rules * lint hack * I believe bug fix lets see * Mod Node left * revert * Maybe this wont break? * revert * implemented "handtuned garbage" * revert and use VALIDHACKS * Lets see the CI * still broken? * currently its jungle * maybe this jungle ? * This works for everything somehow * Added test for symbolic * lint * final touch * This still works * lint * midway clean * less garbage * lint * final form * Slow but working way * lint and other stuff * lint * mypy * Make sure CI test Openpilot valid checks * test if CI break * Convert back * refactor * refactor * Managed to reduce openpilot time from 30 secs to 5 secs * Refactor * Substitute a node with variable * flake8 * Comment and refactor * More comprehensive mod * refactor * bug fix * More shave off * remove not sure part	2023-09-23 07:34:43 +08:00
chenyu	a5090f0ee9	remove NumNode.int() (#1876 )	2023-09-21 10:29:16 +08:00
David Hou	e74a6ca7e4	expand in terms of substitute (#1827 )	2023-09-09 14:43:00 -07:00
chenyu	f964b9e5ee	visitor pattern for sym_infer and unit tests (#1733 ) * visitor pattern for sym_infer and unit tests * comments	2023-09-01 09:47:45 -07:00
George Hotz	5c403d43b9	New >3 indexing (#1729 ) * move reindexing into linearizer * get_grouped_dims * don't limit for clang	2023-08-31 21:24:15 -07:00
Max Hahn	f9cb31fdc2	added visitor pattern (#1669 ) * added visitor pattern * pylint bug workaround * added tests, made abstract OpNode inherit from ABC * fixed assert * fix check of abstract classes in negative test * remove assert False	2023-08-30 09:03:44 -07:00
George Hotz	86a32ffb1a	lt sum (#1617 )	2023-08-21 21:19:16 -07:00
chenyu	be50b2fe8f	more symbolic symbolic ops (#1564 ) * more symbolic symbolic ops * handle NumNode in __mul__	2023-08-18 09:21:41 -07:00
chenyu	11dd9b1741	symbolic codegen and exec (#1552 ) * symbolic codegen and exec * fix and add test * no sketchy * merge_dicts type * dtypes._arg_int32	2023-08-16 14:43:41 -07:00
chenyu	a89142e46f	ShapeTracker.var_vals (#1540 )	2023-08-14 18:53:37 -07:00
chenyu	3e0c2d256f	symbolic shapetracker (#1506 ) * symbolic shapetracker * no need * keep only symbolic and clean up * explicit // and % Node support * NumNode * Node	2023-08-12 12:22:58 -07:00
chenyu	34f348643b	Support constant expand to symbolic shape (#1411 )	2023-08-02 21:21:22 -07:00
chenyu	18d0a93f09	LazyBuffer.get_variable_buffers() (#1391 ) * LazyBudder.get_variable_buffers() * remove left_only, add ProdNode * no vars for OpNode.b * do not change symbolic vars, remove ProdNode	2023-08-02 09:01:35 -07:00
chenyu	32be39554c	Simplify symbolic.SumNode.__floordiv__ logic (#1220 )	2023-07-12 12:54:12 -07:00
George Hotz	d16c16ec28	new upcast works (#1066 ) * new upcast works * float4 try * fix unaligned float4 * disallow unaligned access * upcast dim * maybe good now * fix gpu half * vstore_half4 * fix deep image bugs * improve symbolic to fix issues * fix symbolic * cl test * this maybe * gcd of 1 is 1 * real fix for old python * improve fuzzer	2023-06-27 19:34:53 -07:00
George Hotz	c62c64f0b7	remove GeNode (#965 )	2023-06-09 21:48:56 -07:00
Rayan Hatout	8b2c2d6896	Optimizations in `symbolic.py` (#796 ) * optimizations in symbolic.py * fix infinite recursion when expanding sums * add test case to make sure NumNodes are hoisted up in cases where MulNodes cancel eachother out	2023-05-26 12:59:53 -07:00
George Hotz	8b7ecd63bb	Remove Zeroview (#748 ) * no zeroview start * closer * stride mask * st tests pass, delete ZeroView * byebye zv * close to working * not contiguous with mask * subtract, don't add * mask on view * ugh, that shouldn't have been in there * shape merge * bugfixes * fuzzer + 4 fuzzer failures * fuzzer for symbolic * more fuzzing and nothing * that fuzzer doesn't hit either * fixes padding...ugh * no more offsets * working * rewrite load and store * all checks * fix idxs * progress * bugfix * float4_axis * works * cleanups * complex valids_okay	2023-04-17 08:21:46 -07:00
George Hotz	5495c7d64e	linearizer! (#714 ) * linearizer outputs something * working ish * cstyle codegen * clang mostly works * fix load valid * fix numberless loop * fancy gen * working * fix enet compiler * cleanups * float4 upcasting * less lines * supports_float4 * constant folding * mulacc * internet tests flaky in CI * 90% image support * fix image generic * bugs exposed with shapetracker and single view * new llvm * use vload, remove OLD * that's really poorly done * ending up being more lines	2023-03-19 23:43:49 -07:00
George Hotz	f5467cfedc	Devicebufferless (#708 ) * runs one metal kernel * conv2d works * ops tests are passing * const folding * all ops work * pre commit always passes * torch works * working still * fix graph test * tests passing * image almost works * image conv works * most images * fix custom * fix assignment * fix compile enet * clean up comments * fix realize return value * include shapetracker in LB repr * copy should make a copy * reenable method cache * fix lna * dtypes in graph * forward only for IMAGE=2 * simple realize * getting close * fixup new api, it's good except the kernel count * back to 197 kernels * tests should pass * go to a real float * no type_on_cpu * fix the docs * put shapetracker back in it's proper place	2023-03-18 14:40:23 -07:00
George Hotz	c594a0a835	fix flip bug, add new unit tests	2023-03-12 23:55:31 -07:00
George Hotz	7a1d96fd76	No negative (#632 ) * behavior is correct without VALIDHACKS * simple div and mod * fix tests * no negative variables * alt form is correct * still correct * bug in mulnode * at least validhacks works now * cleanups * test validhacks, and to_image_idx * cache compare key * tests and __neg__	2023-03-03 16:48:14 -08:00

1 2

51 Commits