George Hotz
29ae21bb0d
import tests from CL metal texture fix
2022-09-19 20:01:47 -04:00
George Hotz
a8aa1f9589
that's simpler
2022-09-18 20:40:46 -04:00
George Hotz
57e804a9bf
add min support
2022-09-18 20:39:41 -04:00
YassineYousfi
2f0f91ba3d
support float16 onnx weights ( #384 )
2022-09-15 09:12:18 -04:00
Comma Device
75f937227a
add barrier
2022-09-13 11:39:48 -04:00
George Hotz
3c3534736e
fix matmul kernel and tests
2022-09-13 08:31:04 -07:00
Comma Device
62e9419206
fix test failure on MATMUL=1 backward pass
2022-09-13 11:18:52 -04:00
Comma Device
3b82afc6a0
simple on device failing test
2022-09-13 10:59:15 -04:00
George Hotz
4efde1ba0a
test_matmul
2022-09-13 07:51:33 -07:00
George Hotz
894a7cee79
forgot a few
2022-09-12 09:21:46 -07:00
George Hotz
801ecd4a07
cleanup clip tokenizer
2022-09-12 09:20:12 -07:00
Fernand Pajot
ff0da4c802
Added standalone CLIP tokenizer ( #382 )
...
* Added standalone CLIP tokenizer.
* Fixed empty phrase.
* Truncating long prompts.
* Keeping two slots for the start and end token.
* Fixed empty phrase.
* Using tokenizer for empty phrase.
* Typo.
2022-09-12 09:12:55 -07:00
David Redmon
a1810c8617
update serious_mnist.py ( #380 )
2022-09-11 13:37:40 -07:00
George Hotz
ce348f0c92
Revert "change default opt to 2"
...
This reverts commit 726f4e98e9 .
2022-09-11 13:35:42 -07:00
George Hotz
726f4e98e9
change default opt to 2
2022-09-09 07:50:25 -07:00
YassineYousfi
1a7bdc51f8
support more onnx ops ( #376 )
...
* broadcast from right to left
* add another broadcasted add test
* more onnx ops
* use float32 range in clip
2022-09-07 15:15:24 -07:00
George Hotz
0b8c2221b5
relax mnist test a tiny bit
2022-09-07 07:52:05 -07:00
George Hotz
ecc1a0470d
add Linear to tinygrad.nn
2022-09-07 07:40:48 -07:00
George Hotz
d26bd73c1e
have to ignore that type
2022-09-07 07:24:27 -07:00
George Hotz
b7783565af
cpu line savings and cleaner
2022-09-06 21:24:22 -07:00
George Hotz
1c92a6da22
make gpu code readable
2022-09-06 21:17:36 -07:00
George Hotz
790af99a48
fix slice one multi, and linear can be simpler with new broadcasting
2022-09-06 19:51:33 -07:00
George Hotz
4f4ecbec97
add div to operators
2022-09-06 17:39:26 -07:00
George Hotz
5a76e652b8
simpler movement op
2022-09-06 17:27:33 -07:00
George Hotz
896f9f74a9
hmm, need this with broadcast change
2022-09-06 16:54:01 -07:00
George Hotz
a18a6a0773
fix sd with TORCH=1
2022-09-06 16:51:16 -07:00
YassineYousfi
5aad460c7a
broadcast from right to left ( #375 )
...
* broadcast from right to left
* add another broadcasted add test
2022-09-06 16:36:13 -07:00
George Hotz
0516359af8
fix stupid OPENCL=1 OOM
2022-09-06 14:29:23 -07:00
George Hotz
f215534a64
1100 lines, but sane linter rules
2022-09-06 13:47:45 -07:00
George Hotz
682dc64430
works at work
2022-09-06 08:06:11 -07:00
George Hotz
f683b26eef
bring back native exp log
2022-09-06 07:59:04 -07:00
George Hotz
d6f499fd69
improve opencl, why is it OOMing
2022-09-05 20:14:31 -07:00
George Hotz
0ba6179de7
stable diffusion in readme
2022-09-05 18:51:56 -07:00
George Hotz
c1d5af8b0c
stable diffusion cleanups
2022-09-05 18:34:13 -07:00
George Hotz
3728ef6d02
better alphas
2022-09-05 16:48:26 -07:00
George Hotz
0fda854b3e
other prompt example
2022-09-05 16:14:16 -07:00
George Hotz
16cb4290c4
cat horse winning ❗
2022-09-05 16:05:14 -07:00
George Hotz
1043fa067a
it renders something
2022-09-05 15:52:14 -07:00
George Hotz
5a685b93ac
brown img
2022-09-05 15:20:18 -07:00
George Hotz
98d6264987
all models match
2022-09-05 12:27:54 -07:00
George Hotz
b8bd34b5d2
fix last bug in unet probz
2022-09-05 11:32:44 -07:00
George Hotz
3df67aa0af
fix transformer bugs
2022-09-05 11:26:32 -07:00
George Hotz
2ed3bb6223
clip model is running
2022-09-05 11:26:32 -07:00
Ollin Boer Bohan
2c6f4e4c66
Make creation helpers use fp32 by default ( #374 )
...
* Make creation helpers use fp32 by default
half the big = twice the fast
* Fix flake8 with an extra multiply
2022-09-04 13:47:27 -07:00
George Hotz
1a54ea2417
runs on torch cpu
2022-09-04 12:06:42 -07:00
George Hotz
9590d92750
stable diffusion compiles (add no_init)
2022-09-04 11:40:50 -07:00
George Hotz
172683c314
work
2022-09-04 11:21:09 -07:00
George Hotz
bcb867cdd6
better idea for numbers, do the division in python
2022-09-03 16:23:39 -07:00
George Hotz
39e1d23c88
from_number_like to fix div issue
2022-09-03 16:19:16 -07:00
George Hotz
c2a030fe55
one liner that's more clear
2022-09-03 16:08:48 -07:00