tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-14 01:18:26 -05:00

Author	SHA1	Message	Date
George Hotz	b9feb1b743	fp16 support in stable diffusion	2023-08-20 05:37:21 +00:00
George Hotz	47f18f4d60	[New] SD: Refactor AttnBlock, CrossAttention, CLIPAttention to share code (#1516 ) (#1518 ) * Refactor AttnBlock, CrossAttention, CLIPAttention to share code * Reshape and transpose in loop * Bugfix on attention mask Co-authored-by: Jacky Lee <39754370+jla524@users.noreply.github.com>	2023-08-10 15:04:18 -07:00
George Hotz	c82bd59b85	Revert "SD: Refactor AttnBlock, CrossAttention, CLIPAttention to share code (#1513 )" (#1515 ) This reverts commit `85e02311a2`.	2023-08-10 09:08:51 -07:00
Jacky Lee	85e02311a2	SD: Refactor AttnBlock, CrossAttention, CLIPAttention to share code (#1513 ) * Refactor AttnBlock, CrossAttention, CLIPAttention to share code * Reshape and transpose in loop	2023-08-10 08:52:33 -07:00
George Hotz	d78fb8f4ed	add stable diffusion and llama (#1471 ) * add stable diffusion and llama * pretty in CI * was CI not true * that * CI=true, wtf * pythonpath * debug=1 * oops, wrong place * uops test broken for wgpu * wgpu tests flaky	2023-08-06 21:31:51 -07:00
Felix	97a6029cf7	Corrected a few misspelled words (#1435 )	2023-08-04 16:51:08 -07:00
George Hotz	f27df835a6	delete dead stuff (#1382 ) * delete bpe from repo * remove yolo examples * Revert "remove yolo examples" This reverts commit `cd1f49d466`. * no windows	2023-07-31 11:17:49 -07:00
George Hotz	bfbb8d3d0f	fix ones, BS=2 stable diffusion, caching optimizer (#1312 ) * fix ones, BS=2 stable diffusion * caching optimizer * print search time * minor bug fix	2023-07-21 09:55:49 -07:00
George Hotz	f45013f0a3	stable diffusion: remove realizes we don't need	2023-07-20 19:53:07 -07:00
George Hotz	b58dd015e3	stable diffusion: remove import numpy as np	2023-07-20 19:35:44 -07:00
George Hotz	35bc46289c	stable diffusion: use new tinygrad primitives	2023-07-20 19:25:49 -07:00
AN Long	f75de602df	fix typo in stable diffusion example (#1219 )	2023-07-11 15:26:40 -07:00
Diogo	2d4370b487	Adds tril & triu support (#936 ) * triu & tril support * lint and kernel count error * switched shape indicies * larger shape tests * reverted numpy removal until #942 is resolved	2023-06-09 22:13:20 -07:00
Diogo	3bb38c3518	limit split to 1 due to windows path containing : (#944 )	2023-06-06 10:27:54 -07:00
George Hotz	ed1963b899	Fast DiskTensor to other Tensor (#916 ) * make disktensors fast * loading * loader for sd and llama	2023-06-03 12:25:41 -07:00
George Hotz	46d419060b	start on mlperf models	2023-05-10 16:30:49 -07:00
Kirill	0fe5014b1f	Use pathlib (#711 ) * Use pathlib in llama * Use pathlib in stablediffusion	2023-03-18 13:49:21 -07:00
Kirill	af7745073f	Add comments to SD (#686 ) * Add explanation for empty lambdas * Fix my_unpickle if pytorch_lightning is installed * oops	2023-03-12 10:56:49 -07:00
George Hotz	b1206bcb18	third try at torch loading (#677 ) * third try at torch loading * numpy fixed * fix enet compile * load_single_weight supports empty weights * oops, CPU wasn't the default * so many bugs	2023-03-10 19:11:29 -08:00
George Hotz	8bf75a7fdd	fix stable diffusion and CI	2023-03-10 17:48:12 -08:00
Pankaj Doharey	9d97d97b26	Opens image in default viewer after saving. (#612 )	2023-03-03 17:28:49 -08:00
Jacky Lee	c35fcc6964	Replace phrase for prompt (#555 )	2023-02-12 09:04:44 -08:00
Kirill	27154db99a	Downloads weights in examples/stable_diffusion.py (#537 ) * Downloads weights in examples/stable_diffusion.py * use download_file_if_not_exists in fetch * make consistent with previous NOCACHE behavior	2023-02-10 14:37:04 -06:00
Jacky Lee	f08187526f	Fix examples (#540 ) * Fix examples * Remove training in parameters * Simplify a bit * Remove extra import * Fix linter errors * factor out Device * NumPy-like semantics for Tensor.__getitem__ (#506) * Rewrote Tensor.__getitem__ to fix negative indices and add support for np.newaxis/None * Fixed pad2d * mypy doesn't know about mlops methods * normal python behavior for out-of-bounds slicing * type: ignore * inlined idxfix * added comment for __getitem__ * Better comments, better tests, and fixed bug in np.newaxis * update cpu and torch to hold buffers (#542) * update cpu and torch to hold buffers * save lines, and probably faster * Mypy fun (#541) * mypy fun * things are just faster * running fast * mypy is fast * compile.sh * no gpu hack * refactor ops_cpu and ops_torch to not subclass * make weak buffer work * tensor works * fix test failing * cpu/torch cleanups * no or operator on dict in python 3.8 * that was junk * fix warnings * comment and touchup * dyn add of math ops * refactor ops_cpu and ops_torch to not share code * nn/optim.py compiles now * Reorder imports * call mkdir only if directory doesn't exist --------- Co-authored-by: George Hotz <geohot@gmail.com> Co-authored-by: Mitchell Goff <mitchellgoffpc@gmail.com> Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2023-02-10 12:09:37 -06:00
George Hotz	5e37f084db	stable diffusion: clean up constant folding	2023-02-01 12:53:16 -08:00
Jacky Lee	486f023e81	Rename Normalize and move to nn (#513 ) * Rename Normalize and move to nn * Match PyTorch for dim>1	2023-02-01 11:55:03 -08:00
George Hotz	487685919b	Revert "Rename Normalize and move to nn (#415 )" (#474 ) This reverts commit `d768acb6a9`.	2023-01-25 07:50:04 -08:00
Jacky Lee	d768acb6a9	Rename Normalize and move to nn (#415 ) * Rename Normalize and move to nn * Fix comparison to None error * Add test for GroupNorm * Rename test case * Flip parameters to match PyTorch * Increase error tolerance * Fix elementwise_affine on channels * Match arguments with PyTorch * Initialize weight and bias only when affine is true * Is this it? * A bit cleaner * Handle case where weight or bias is None	2023-01-25 07:47:59 -08:00
George Hotz	6d7658db12	delete opencl <celebration>	2023-01-24 14:18:35 -08:00
nogira	2e744ef2f2	confirmed (#449 ) w/ a bunch of print statements in the official model here: `ce05de2819/ldm/modules/diffusionmodules/openaimodel.py (L413)`	2023-01-07 08:41:06 -08:00
Drew Hintz	165fb4d631	remove redundant list comprehension from inside all. (#397 ) remove explicit inherit from object.	2022-10-13 09:58:35 -07:00
George Hotz	178ba50c03	some args for stable diffusion	2022-09-29 01:52:04 -04:00
George Hotz	60df954377	Fix weight init: this work? (#391 ) * this work? * glorot uniform * requies_grad broke * propagate the None correctly * so this weight init works * ahh, i think it's this * can't beat this * glorot is best for ae * remove comments	2022-09-25 16:46:33 -04:00
George Hotz	894a7cee79	forgot a few	2022-09-12 09:21:46 -07:00
George Hotz	801ecd4a07	cleanup clip tokenizer	2022-09-12 09:20:12 -07:00
Fernand Pajot	ff0da4c802	Added standalone CLIP tokenizer (#382 ) * Added standalone CLIP tokenizer. * Fixed empty phrase. * Truncating long prompts. * Keeping two slots for the start and end token. * Fixed empty phrase. * Using tokenizer for empty phrase. * Typo.	2022-09-12 09:12:55 -07:00
George Hotz	ecc1a0470d	add Linear to tinygrad.nn	2022-09-07 07:40:48 -07:00
George Hotz	896f9f74a9	hmm, need this with broadcast change	2022-09-06 16:54:01 -07:00
George Hotz	a18a6a0773	fix sd with TORCH=1	2022-09-06 16:51:16 -07:00
George Hotz	0516359af8	fix stupid OPENCL=1 OOM	2022-09-06 14:29:23 -07:00
George Hotz	f215534a64	1100 lines, but sane linter rules	2022-09-06 13:47:45 -07:00
George Hotz	682dc64430	works at work	2022-09-06 08:06:11 -07:00
George Hotz	d6f499fd69	improve opencl, why is it OOMing	2022-09-05 20:14:31 -07:00
George Hotz	0ba6179de7	stable diffusion in readme	2022-09-05 18:51:56 -07:00
George Hotz	c1d5af8b0c	stable diffusion cleanups	2022-09-05 18:34:13 -07:00
George Hotz	3728ef6d02	better alphas	2022-09-05 16:48:26 -07:00
George Hotz	0fda854b3e	other prompt example	2022-09-05 16:14:16 -07:00
George Hotz	16cb4290c4	cat horse winning ❗	2022-09-05 16:05:14 -07:00
George Hotz	1043fa067a	it renders something	2022-09-05 15:52:14 -07:00
George Hotz	5a685b93ac	brown img	2022-09-05 15:20:18 -07:00

1 2 3

119 Commits