tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-24 22:38:16 -05:00

Author	SHA1	Message	Date
George Hotz	8c849e637c	that was in there twice, DEBUG>=4 to see loop opt	2022-10-30 15:31:39 -07:00
George Hotz	cfdf803b52	fix llvm vectorization by add analysis passes from the target machine	2022-10-30 15:28:36 -07:00
George Hotz	2f602a92ff	seperate STRIDED and EXPAND	2022-10-30 13:23:58 -07:00
George Hotz	4b6097f81d	more amx notes	2022-10-29 14:04:10 -07:00
George Hotz	fdb43fe553	gemm is 1.7 TFLOPS on a single M1 core	2022-10-29 13:42:33 -07:00
George Hotz	52bfbc31be	vectorization	2022-10-29 12:47:52 -07:00
George Hotz	e473d35f90	llvm doesn't vectorize	2022-10-29 11:59:48 -07:00
George Hotz	86eb06eb76	accurate flop estimation	2022-10-28 19:13:20 -07:00
George Hotz	dd543fbc7a	MovementOps is unused	2022-10-28 18:26:08 -07:00
George Hotz	71b336503f	no RESHAPEs in the AST	2022-10-28 18:25:30 -07:00
George Hotz	b65b70812a	Exec AST (#404 ) * working exec ast * exec_ast is staticmethod * GenericExecAST * fold that sometimes * ExplicitExecAST * exec_ast for GPU * gpu working * get_lazyop_shape * now gpubuffer is ExplicitExecAST * dedup * add a type * RESHAPE in opencl code * fix linter * that too for linter * cleanups * remove dead code * GenericShape is less lines * add ALLOWED_KERNEL_COUNT to tests * fix mypy * that's gotta be recursive * fix opencl shape processing * remove unneeded lambda	2022-10-28 08:27:03 -07:00
George Hotz	6a15fd3844	LLVM Backend take 2 (#403 ) * take 2 llvm * get_lazybuffers -> get_buffers * llvm tests pass * fix type issues and refactor LLVM	2022-10-26 20:32:31 -07:00