tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-23 22:08:08 -05:00

Author	SHA1	Message	Date
George Hotz	81b73f97a3	Optiimzation (#355 ) * constant folding into kernels * that opt worth it? * fix mypy * ast one kernel * save 2 lines in conv kernel * debug print kernel count * cl debugging * early realize inputs * refactor Device	2022-07-04 08:58:57 -07:00
George Hotz	7276f8d6bf	improve constant folding, detach before moving tensor	2022-07-02 15:29:40 -07:00
George Hotz	8cf1aed0f4	don't track_running_stats, parameters must require_grad	2022-07-02 14:38:45 -07:00
George Hotz	49c954b389	comments	2022-06-26 17:20:25 -07:00
George Hotz	83d50e2687	move to extra.onnx	2022-06-21 19:43:44 -07:00
George Hotz	9b27ba650b	load new torch files	2022-06-07 10:06:48 -07:00
George Hotz	233c71a7ba	support requires_grad	2022-06-06 07:47:31 -07:00
George Hotz	d8d19ed468	wikimedia wasn't returning 200	2022-01-15 19:09:29 -08:00
George Hotz	e28cdfb0cf	clean up resnet	2021-11-30 16:14:54 -05:00
George Hotz	58ed46963e	fix broadcastdot	2021-11-29 18:54:57 -05:00
George Hotz	dca076dbf1	remove dumb nn ops	2021-11-29 18:05:31 -05:00
George Hotz	30eb3afbe1	add bias term to transformer	2021-11-29 12:45:27 -05:00
George Hotz	e2a8961a18	less lines, fix bug	2021-11-17 12:52:17 -08:00
George Hotz	ba28761894	move yolo into examples/yolo	2021-10-30 19:46:00 -07:00
George Hotz	63f50cff45	move back again	2021-10-30 16:13:29 -07:00
Evan Mays	285621aeda	Cherry backprop for conv2d (#281 ) * quick math: 0 + x = x. * gradient w.r.t. x using cherry for conv * gradient w.r.t. w for conv on cherry but doing vector dot products * small optimization * [cherry] optimize conv backpass for large channel count * get rid of numpy einsum	2021-10-30 16:12:19 -07:00
George Hotz	3d646272d6	move back	2021-10-30 16:12:12 -07:00
George Hotz	ac8afd24fa	refactor accel	2021-10-30 16:10:59 -07:00
Guglielmo Camporese	2b7589db64	Added ResNet-{18, 34, 50, 101, 152} (#271 ) * added resnets * fix minor * fix minor * resnet in models * added resnet test * added resnet train test * added linear, conv2d nn tests * fix minor in extra/training * resnet in models * fix minor * fix tolerance for linear in nn test * fix eval, this causes cpu and gpu UT failing * revert transformer test * fix minor for CPU test * improved model get_params for sequential layer * fix minor for params counting * commented broken ops tests * improved train for resnet	2021-06-21 09:37:24 -07:00
George Hotz	89798d2f43	some flags	2021-06-19 11:46:31 -07:00
George Hotz	d81eae8288	debug cherry crash	2021-06-19 11:41:20 -07:00
George Hotz	d3f169b267	move good models to models, add a training step test	2021-06-19 11:24:15 -07:00
George Hotz	b48d4bad2e	clean up print spam	2021-06-19 10:31:04 -07:00
George Hotz	027535d0b5	microcoded matmul	2021-06-17 21:03:08 -07:00
George Hotz	026e2ae6a7	three registers and a zero command	2021-06-17 17:09:18 -07:00
George Hotz	2e71ae33f6	max op works	2021-06-17 17:01:21 -07:00
George Hotz	9e12c1bbba	cherry binop	2021-06-17 16:50:40 -07:00
George Hotz	fcdabea880	training mnist with cherry ops	2021-06-17 16:45:35 -07:00
George Hotz	2affd226b3	speed up sum	2021-06-17 16:38:34 -07:00
George Hotz	e8eb7d1b7e	max op	2021-06-17 16:20:56 -07:00
George Hotz	c1d469d440	sum op	2021-06-17 16:19:35 -07:00
George Hotz	b1000d866e	readme, plus reduce ops	2021-06-16 11:21:06 -07:00
George Hotz	ff3fdc58e5	risk -> cherry	2021-06-16 09:59:48 -07:00
George Hotz	2f91c012eb	build note	2021-06-15 22:41:41 -07:00
George Hotz	4850d6eb43	update todo	2021-06-15 10:22:39 -07:00
George Hotz	4e1edb3692	have tinygrad log the loads	2021-06-14 18:35:14 -07:00
George Hotz	93f2e9769d	little note	2021-06-14 15:49:41 -07:00
George Hotz	a89d12d735	wow, way faster	2021-06-10 17:11:39 -07:00
George Hotz	10b1306525	binops	2021-06-10 16:52:37 -07:00
George Hotz	4535d39baa	comments and pow	2021-06-10 09:03:40 -07:00
George Hotz	2075fdeb4f	FPGA Based Accelerator for Tinygrad (#258 ) * ops_risk * risk sim * guessing is for winners * minor * better * matmal with risk * conv doesn't work * closer * conv2d works * ops_risk * opt2 works * opt1 may not be possible * opt1 is a mulacc * arty * attosoc example building on mac * minor * riscv assembler * gucci gang * we got C code * not a scam * hello * make risk mergeable into master * unop support	2021-06-07 17:45:09 -07:00
Josh Smith	ad756f6112	minor optimizations & cleaning (#257 ) * use isinstance, some optimizations & whitespace removal * revert whitespace changes * revert more whitespace * some more cleanup * revert fstring (not a fan of the {{}}) * fix typo * fix typo	2021-06-02 09:57:15 -07:00
George Hotz	b80cacb416	fix GPU efficientnet example	2021-05-26 17:29:35 -07:00
20kdc	2653d33292	vgg7 (image upscaling) implementation - not the best, but it works (#255 ) * vgg7 implementation - not the best, but it works * VGG7 implementation: Spread nansbane to deter NaNs, maybe improved training experience * VGG7 implementation: Fix training, for real this time Results actually attempt to approximate the input * VGG7 implementation: Sample probability management	2021-05-12 23:48:51 -07:00
George Hotz	ac229ea750	remove print	2021-01-02 12:53:30 -08:00
George Hotz	895d142503	start trying to load yolo v5	2021-01-02 12:51:55 -08:00
Marcel Bischoff	42b4761025	transformer >99.98% test accuracy in ~30s (#230 ) * transformer * BS might divide len(Y_test) * outoput when accuracy is high * more readeable * fixed loss in serious_mnist for new API	2021-01-02 07:45:09 -08:00
Liam	ebd72ff437	Test split (#231 ) * Split tests Split tests into "Test CPU" and "Test GPU". Add test flag "TEST_DEVICES" which is a comma separated list of devices: CPU,GPU,ANE * Run tests based on provided TEST_DEVICES flag By default will run all "CPU,GPU,ANE" * fix bad quote * Revert changes and use GPU=1 This is done through setting the default Tensor Device to Device.CPU of GPU=1 is set. Run GPU tests: GPU=1 pytest -s -v	2021-01-01 09:19:03 -05:00
George Hotz	f9170505b3	if you like your transformers twice as slow, use the GPU	2020-12-29 17:14:23 -05:00
George Hotz	3f8e137b6f	extra/transformer	2020-12-29 14:14:00 -05:00

1 2

68 Commits