tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-10 23:48:01 -05:00

Author	SHA1	Message	Date
George Hotz	0615dcffe7	compile_tensorflow: save the weights	2023-02-22 21:05:45 -08:00
George Hotz	c537fd0614	compile_tensorflow: add initialize and tests	2023-02-22 20:50:53 -08:00
George Hotz	dc914cde50	compile_tensorflow	2023-02-22 20:08:58 -08:00
George Hotz	76b4d0577d	yolov8 works up to the MaxPool	2023-02-22 19:32:13 -08:00
Mischa Untaga	14bb2c40a2	Fix yolov3 example (#577 )	2023-02-21 09:24:00 -08:00
George Hotz	d9fa47ecc9	use the TinyJit in the efficientnet runner, 200ms -> 20ms	2023-02-20 19:58:16 -08:00
George Hotz	714bf4b108	clang backend (#572 ) * start clang backend * mostly working * no group for reduce w clang * it compiles * compiles * a11y * minor fixups * formatting * add a test * rename test	2023-02-20 18:18:18 -08:00
Jacky Lee	cb679cd051	Fix weight initialization (#566 ) * Fix weight initialization * Use scaled_uniform in serious_mnist	2023-02-19 11:25:29 -08:00
Kirill	7944cfdadc	Remove Tensor.data (#565 )	2023-02-18 16:36:12 -08:00
Jacky Lee	7e8b0305f3	Fix mnist gan example (#563 )	2023-02-18 13:45:37 -08:00
Jacky Lee	9fd41632c6	Import get_parameters from tinygrad.nn (#559 ) * get_parameter is in optim * Update all imports for get_parameters * Clean up * use optim.get_paramters	2023-02-17 15:22:26 -08:00
Jacky Lee	e172f0087a	BatchNorm2D -> BatchNorm2d (#558 ) * BatchNorm2D -> BatchNorm2d * Fix typo	2023-02-16 12:31:49 -08:00
Jacky Lee	c35fcc6964	Replace phrase for prompt (#555 )	2023-02-12 09:04:44 -08:00
George Hotz	191c76cfd7	hlb_cifar10 torch version	2023-02-11 18:04:40 -08:00
George Hotz	9057d98d36	no lr decay in cifar. test this in torch tomorrow	2023-02-11 17:42:54 -08:00
George Hotz	dd7accb9cc	decay LR, little bugfix	2023-02-11 17:34:15 -08:00
George Hotz	ba3bf5bdf7	cifar stops learning	2023-02-11 17:21:42 -08:00
George Hotz	7d33f2d659	CL.CACHE is over, GlobalCounters.cache is it	2023-02-11 12:00:14 -08:00
George Hotz	9152bb5b4a	momentum support in SGD	2023-02-11 10:22:37 -08:00
George Hotz	031edd01e6	switch openpilot compile to TinyJit	2023-02-11 09:51:44 -08:00
jspieler	8f912c3966	added deep deterministic policy gradient example (#531 )	2023-02-11 10:10:46 -06:00
George Hotz	608fd730d3	put the JIT in extra	2023-02-11 00:35:18 -06:00
George Hotz	ed8ae7522a	tinyjit	2023-02-11 00:22:36 -06:00
George Hotz	4c90a15689	make the fake data actually learnable	2023-02-10 23:35:21 -06:00
George Hotz	07629d7476	fakedata and move to new cache	2023-02-10 23:32:31 -06:00
George Hotz	63fa7daf30	wrong place for CL	2023-02-10 23:22:24 -06:00
George Hotz	fed95119dc	CL.mem_used -> GlobalCounters.mem_used	2023-02-10 23:13:29 -06:00
Kirill	27154db99a	Downloads weights in examples/stable_diffusion.py (#537 ) * Downloads weights in examples/stable_diffusion.py * use download_file_if_not_exists in fetch * make consistent with previous NOCACHE behavior	2023-02-10 14:37:04 -06:00
Jacky Lee	f08187526f	Fix examples (#540 ) * Fix examples * Remove training in parameters * Simplify a bit * Remove extra import * Fix linter errors * factor out Device * NumPy-like semantics for Tensor.__getitem__ (#506) * Rewrote Tensor.__getitem__ to fix negative indices and add support for np.newaxis/None * Fixed pad2d * mypy doesn't know about mlops methods * normal python behavior for out-of-bounds slicing * type: ignore * inlined idxfix * added comment for __getitem__ * Better comments, better tests, and fixed bug in np.newaxis * update cpu and torch to hold buffers (#542) * update cpu and torch to hold buffers * save lines, and probably faster * Mypy fun (#541) * mypy fun * things are just faster * running fast * mypy is fast * compile.sh * no gpu hack * refactor ops_cpu and ops_torch to not subclass * make weak buffer work * tensor works * fix test failing * cpu/torch cleanups * no or operator on dict in python 3.8 * that was junk * fix warnings * comment and touchup * dyn add of math ops * refactor ops_cpu and ops_torch to not share code * nn/optim.py compiles now * Reorder imports * call mkdir only if directory doesn't exist --------- Co-authored-by: George Hotz <geohot@gmail.com> Co-authored-by: Mitchell Goff <mitchellgoffpc@gmail.com> Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2023-02-10 12:09:37 -06:00
George Hotz	a5a55ac19e	GlobalCounters cache + assign in optim	2023-02-08 17:10:55 -06:00
George Hotz	3d63934995	refactor to keep cl in the runtime (#545 ) * refactor to keep cl in the runtime * fix thneed, rename cl to _cl * bugfix + _cuda * fix tests * thneed more correct	2023-02-08 16:46:09 -06:00
George Hotz	2844482a60	Mypy fun (#541 ) * mypy fun * things are just faster * running fast * mypy is fast * compile.sh * no gpu hack * refactor ops_cpu and ops_torch to not subclass * make weak buffer work * tensor works * fix test failing * cpu/torch cleanups * no or operator on dict in python 3.8 * that was junk * fix warnings * comment and touchup	2023-02-08 09:56:51 -06:00
George Hotz	f7291f6ca3	fixes big KOPT, breaks opencl (#505 ) * fixes big KOPT, breaks opencl * fix optimizer * KernelCache * oops, broke batchnorm * hack to fix it * fix llvm, less hacky gpu * disable the cache * cache just breaks things	2023-02-05 10:46:17 -08:00
James Roberts	db0a9b0a2d	Refactor CL.time_sum into GlobalCounters (#519 )	2023-02-01 20:13:56 -08:00
George Hotz	5e37f084db	stable diffusion: clean up constant folding	2023-02-01 12:53:16 -08:00
Jacky Lee	486f023e81	Rename Normalize and move to nn (#513 ) * Rename Normalize and move to nn * Match PyTorch for dim>1	2023-02-01 11:55:03 -08:00
Jacky Lee	799b3f185a	Refactor getenv into helpers (#508 ) * Refactor getenv into helpers * Remove unused os * Fix default value * Fix more defaults for CI * Fix bracket * Revert changes to openpilot/compile.py * Use getenv from helpers when possible	2023-01-31 15:09:09 -08:00
George Hotz	21f2af08d5	getenv + graphing	2023-01-30 19:15:03 -08:00
George Hotz	60ccddb58b	reenable SWAP	2023-01-30 17:32:02 -08:00
George Hotz	aea55eb196	found failing upcast	2023-01-30 16:12:56 -08:00
George Hotz	7ee0d99c70	CLCACHE	2023-01-30 14:02:06 -08:00
George Hotz	cccfea4b25	factor out KOPT code	2023-01-30 13:13:55 -08:00
George Hotz	de2c419fd4	make_pair and first attempt at hlb_cifar10	2023-01-30 11:07:23 -08:00
AllentDan	7b6b1f32b1	[Fix] fix typo: test_mnist -> datasets (#492 ) * test_mnist -> datasets * fix mnist_gan	2023-01-29 21:30:47 -08:00
George Hotz	2db272c7f7	Kernel Optimizer (#489 ) * kernel optimizer * 10x faster, but wrong. not good deal * move test -> extra * print x speedup * clcache * fix clcache + DEBUG * GFLOPS estimate * i==3	2023-01-29 17:15:00 -08:00
George Hotz	66da3bc3c0	reset the benchmark timer	2023-01-25 09:20:34 -08:00
George Hotz	487685919b	Revert "Rename Normalize and move to nn (#415 )" (#474 ) This reverts commit `d768acb6a9`.	2023-01-25 07:50:04 -08:00
Jacky Lee	d768acb6a9	Rename Normalize and move to nn (#415 ) * Rename Normalize and move to nn * Fix comparison to None error * Add test for GroupNorm * Rename test case * Flip parameters to match PyTorch * Increase error tolerance * Fix elementwise_affine on channels * Match arguments with PyTorch * Initialize weight and bias only when affine is true * Is this it? * A bit cleaner * Handle case where weight or bias is None	2023-01-25 07:47:59 -08:00
George Hotz	6d7658db12	delete opencl <celebration>	2023-01-24 14:18:35 -08:00
nogira	2e744ef2f2	confirmed (#449 ) w/ a bunch of print statements in the official model here: `ce05de2819/ldm/modules/diffusionmodules/openaimodel.py (L413)`	2023-01-07 08:41:06 -08:00

... 19 20 21 22 23 ...

1235 Commits