tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-07 22:23:55 -05:00

Author	SHA1	Message	Date
George Hotz	8a04a3a77a	rename LazyBuffer -> UOp [pr] (#8169 ) * rename LazyBuffer -> UOp [pr] * fix docs	2024-12-11 16:15:52 -08:00
geohotstan	0a2e10be1d	add SELU to Tensor (#7993 ) * add selu * more clean ups	2024-12-02 10:04:01 -05:00
nimlgen	10f431b96d	hcq replace update with sint (#7899 ) * try sym hcq * start with amd * move to nv * nv works * cache and qcom * fixes * signals * fix nv * qcom fixes * linter * linter * cache + typings * fixes * tiny fixes * linter * linter * lntr * ugh * comments	2024-11-29 20:08:13 +03:00
geohotstan	cea5853cfa	add Tensor.scatter (#7737 ) * working I think * where are my onnx scatter tests?? * forward_only for now * try if nan hack fix NV * looks like issue is different... CUDA WHY * oops that was wrong. Try if this fixes CUDA * simpler multiply * actually finish this up tmrw morning :x * fix tests? * improve tests * improve test and implementation * fix ruff * complete but lots of expected failure... * reviewed tests * add onnx tests * is this a processing op? * add return type to indicate that it's not in-place * final cleanups * use or and improve tests a little * add masked_index_select * call it masked_setitem instead * try * FIXED --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2024-11-27 10:52:04 -05:00
chenyu	3b26e51fce	Tensor.cummax (#7854 ) generalized the existing cumsum and take Ops.MAX in addition to Ops.ADD	2024-11-22 15:55:02 -05:00
geohotstan	cf1ec90ad4	add inverse trig functions to Tensor (#7805 ) * implement inverse trig functions * guess we should still test nans? * magnitude as variable name :D * reorder onnx_ops ops * approximation -> x for consistency * address feedback * simpler acos * improvement? * actually just have asin depend on atan * actually this is nicer * remove a comment --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2024-11-21 09:13:36 -05:00
George Hotz	9df5a62c5e	unify to HWQueue [pr] (#7812 ) * unify to HWCommandQueue [pr] * all is HWQueue	2024-11-21 10:33:08 +08:00
George Hotz	d71fe7faa5	rename allocator methods to not conflict [pr] (#7788 ) * rename allocator methods to not conflict [pr] * forgot those * transfer + offset	2024-11-20 00:10:29 +08:00
geohotstan	72a41095bc	add Tensor.meshgrid (#7714 ) * initial implementation and test * some other places that can use meshgrid * revert the onnx_ops change * add to docs * revert interpolate too * update * improve edge case test * might as well test grad * add to test can improve docs --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2024-11-16 23:06:47 -05:00
ignaciosica	597a239e28	Remove UnaryOps, BinaryOps, TernaryOps, MetaOps [pr] (#7725 ) * remove unaryops * remove ternaryops * remove metaops * hotfix * remove binaryops * hotfix: test_pattern_matcher --------- Co-authored-by: qazal <77887910+Qazalin@users.noreply.github.com>	2024-11-16 20:56:56 +08:00
geohotstan	f8056a74d6	combine pad2d with pad (#7677 ) * I have pad2d, I have pad, uuh~, pad2dpad~ * fix some small things * strategically placed cast hack * fix more * fix more more * tests * periods	2024-11-14 17:56:02 +08:00
chenyu	51afc3cc88	update env_vars doc on VIZ link (#7689 ) existing one throws 404 because mkdocs does not allow traverse above doc root (i think?). so for now just stick the github link to it	2024-11-13 17:28:14 -05:00
geohotstan	9c41c376d3	add Tensor.nll_loss (#7683 ) * move nll_loss to new branch * make nll_loss examples practical * self is * add to docs * small	2024-11-13 13:12:13 -05:00
geohotstan	5eef59d732	add Tensor.linspace (#7609 ) * add linspace * shave off tests and forgot to add to docs crap * WHOOPS * better tests	2024-11-12 10:29:36 +08:00
Kinvert	6a0ed46b1c	adding viz to env_vars docs (#7630 )	2024-11-11 21:28:27 +08:00
George Hotz	c8bf09b7d4	s/UOps/Ops (#7500 ) * s/UOps/Ops [pr] * fix	2024-11-03 11:26:10 +08:00
geohotstan	585f3a0f24	Add isinf and isnan ops to Tensor (#7484 ) * move isinf and isnan to new branch * sneak a roll documentation fix in * add to docs * update test coverage for detect_positive and detect_negative * add types to isinf args	2024-11-02 12:12:52 -04:00
geohotstan	6513690223	Add Tensor.hardsigmoid (#7433 ) * move hardsigmoid to new branch * add to test * add NOTE to mention differing values for alpha and beta that match torch * shift from relu6 * correct shift implementation * or we just use relu? no more 666	2024-11-01 08:36:52 -04:00
chenyu	fb694a63eb	Tensor.erf (#7419 ) the same one used in onnx and the one in bert.	2024-10-30 18:12:28 -04:00
vinzentbeer	573a848229	fix small typo (#7399 ) "We use with Tensor.train() set the internal flag" -> "We use with Tensor.train() to set the internal flag"	2024-10-30 19:20:28 +08:00
chenyu	96fcc47e27	touchup abstraction docs (#7327 ) fix typing and use tinygrad tqdm	2024-10-27 22:29:55 -04:00
George Hotz	4438d6a467	Tensor.from_url API [pr] (#7210 ) * Tensor.fetch API [pr] * update docs * from_url	2024-10-22 14:54:17 +08:00
George Hotz	ded1b38b84	minor dtype cleanup [pr] (#7124 ) * minor dtype cleanup [pr] * use ptr() function	2024-10-17 17:41:23 +08:00
George Hotz	3169cb386d	remove graph [pr] (#7085 )	2024-10-16 11:40:07 +08:00
Harsh Natuskar	ace834ef7b	=docs update (#7027 )	2024-10-13 19:39:06 +08:00
jeffzh4ng	19a7e41113	implement logcumsumexp (#6921 ) * implement logcumsumexp * change axis=None to axis=0	2024-10-06 10:45:36 -04:00
George Hotz	f588169fdc	hotfix: ad for DEBUG=2 in the mnist tutorial	2024-10-06 21:05:48 +08:00
George Hotz	4df5c7a4ef	move lazy to engine [pr] (#6886 ) * move lazy to engine [pr] * engine.lazy	2024-10-04 23:19:26 +08:00
George Hotz	6b063450df	move hcq device to runtime [pr] (#6879 ) * things that are only used in one place don't belong in helpers [pr] * start moving hcq device [pr] * fix paths	2024-10-04 22:26:50 +08:00
nimlgen	3c56aeee70	add Tensor.from_blob (#6765 ) * draft tensor from pointer init * some docs and types * comment * cleaner * test * malloc * qcom cl interop * jit example * cleaner * dealoc * wording * docs	2024-09-26 18:33:19 +08:00
George Hotz	e015b41ce9	remove e( function just alu( [run_process_replay] (#6589 ) * remove e( function just alu( [run_process_replay] * missed two	2024-09-19 10:24:02 +08:00
George Hotz	bdd0c06f29	add void type to uop (#6471 ) * unwrap_dtype maybe * uopgraph stuff that hardcoded None * test_ops passes * dtypes.py fixups * update test_linearizer and friends * more ast updates * test_beam and test_schedule too * add void type to uop [run_process_replay] * remove dumb casts * start making it green * more cast cleanups * more cls methods to fix * regenerate dataset * split UOp and NOp const * maybe that too * fix docs * update test_uop_symbolic * test_verify_ast * new sops with no diff * meh, type_ignore is alright * remove that assert --------- Co-authored-by: qazal <qazal.software@gmail.com>	2024-09-11 18:16:28 +08:00
Obada Khalili	0fbd141038	tinygrad Tensor Puzzles (#6315 ) * Update index.md * update readme * Revert "update readme" This reverts commit `8415a8e90c`. * update readme * remove mention * update index.md	2024-09-09 09:32:38 +08:00
nimlgen	bf645d62b3	qcom docs (#6338 )	2024-09-02 20:42:20 +03:00
nimlgen	9b616cb33e	HCQArgsState lifetime docs (#6323 )	2024-08-30 00:31:49 +03:00
qazal	8c50ef8b7c	start uop docs (#6291 ) * start uop docs * only need show_labels * sink comes first * hotfix: invalid * touchups * 2 space indent works * limit some buffer uops * better BARRIER doc, Op -> UOp when it makes sense. * make KernelInfo optional * more work relative links don't work * this can be local in multi reduce+pads * add UOps.SHAPETRACKER details * UOps.CONST both types * nit: local buffer isn't device Buffer, habit * nit2: dtype -> DType	2024-08-29 15:22:39 +03:00
wozeparrot	ea5b7910b7	AMD support gfx103x (#5926 )	2024-08-28 14:17:08 -07:00
George Hotz	5ed6c6ef3e	hotfix: 220V 15A -> 220V 20A	2024-08-27 10:20:43 -07:00
wozeparrot	a7bf20c7cd	feat: updated tinybox docs (#6261 ) * feat: updated tinybox docs * fix: grammar	2024-08-23 18:27:46 -07:00
chenyu	590c0922b6	Tensor.prod (#6250 ) * Tensor.prod a new reduce op! * onnx ReduceProd	2024-08-23 10:06:32 -04:00
Alessandro Benetti	9328248610	support for std_mean and cross_entropy (#6181 ) * support for std_mean and cross_entropy (#3) * Cross entropy and std mean support * remove extra examples	2024-08-19 12:06:44 -07:00
George Hotz	9bc81c6db4	UOps.SHAPETRACKER (#6129 ) * UOps.SHAPETRACKER [run_process_replay] * no process replay	2024-08-16 23:26:34 -07:00
George Hotz	89c7989659	no shapetracker in ops [run_process_replay] (#6117 )	2024-08-16 17:23:27 -07:00
George Hotz	74ee9febec	remove iter from uopgraph (#6110 ) * remove iter from uopgraph * linearize returns uops * fix tests * linearize in linearize * tests fix * touchup * test failures	2024-08-16 15:58:29 -07:00
qazal	28c75bf2a6	merge uops with ops (#6111 ) Co-authored-by: chenyu <chenyu@fastmail.com>	2024-08-16 18:17:57 -04:00
qazal	c23d44c779	AST is UOp (#6030 ) * most of the work from the uops2 branch * schedule * realize * kernel * lowerer * search * green * merge uops with ops * Revert "merge uops with ops" This reverts commit `1408a59f12`. * fix benchmark * remove extra dedup	2024-08-16 22:09:00 +03:00
George Hotz	64563abc90	add LSTMCell to nn (#6080 ) * add LSTMCell to nn * lstmcell works with no input on first * fix no bias 0 * simpler	2024-08-14 12:08:42 -07:00
George Hotz	97c3563109	hotfix: clamp in docs	2024-08-13 16:06:30 -07:00
nimlgen	fa84e6ec48	init hcq args state (#6046 ) * init hcq args state * cleaner * amd * fillargs * fixes * myoy * docs * fix * not needed * spacing	2024-08-13 17:11:58 +03:00
chenyu	d82370f6ef	docs: fix broken links and update is_floating_point (#6023 ) * docs: fix broken links and update is_floating_point broken links would only show as INFO and not an error. * make doc andhors warn	2024-08-10 15:58:48 -04:00

1 2 3 4 5 ...

357 Commits