Commit Graph

4667 Commits

Author SHA1 Message Date
George Hotz
85d17a2acd running resnet onnx 2022-06-11 13:17:15 -07:00
George Hotz
0225360191 fixed with one return x 2022-06-11 12:08:53 -07:00
George Hotz
db5a632e8c multicat + test onnx is generic onnx 2022-06-11 11:50:47 -07:00
George Hotz
a710b3a210 it's a real test now 2022-06-11 11:33:33 -07:00
George Hotz
8440dbfa5d support inputs 2022-06-11 11:21:45 -07:00
George Hotz
08de1aa636 add flatten to tinygrad 2022-06-11 11:15:16 -07:00
George Hotz
aee251cc41 op model test 2022-06-11 11:06:03 -07:00
George Hotz
d061ce8d5e add ELU support 2022-06-11 10:47:23 -07:00
George Hotz
8864b37333 fix torch convdw 2022-06-10 15:04:39 -07:00
George Hotz
aac1a9b419 this breaks tests 2022-06-10 12:20:42 -07:00
George Hotz
e01ed64d7c restore that naming 2022-06-09 08:38:34 -07:00
George Hotz
60a48455ad still over line count, maybe test pass 2022-06-08 09:51:28 -07:00
George Hotz
70561f3d90 way over the line limit 2022-06-08 09:36:31 -07:00
George Hotz
4f7ee235c5 not a real test now 2022-06-08 09:00:59 -07:00
George Hotz
ae33060dae early float4 stuff for binary 2022-06-08 08:59:54 -07:00
George Hotz
82f29b5dbf better GPU block 2022-06-08 08:01:04 -07:00
George Hotz
42ae78241e only run test on GPU 2022-06-08 07:54:40 -07:00
George Hotz
cdf4b5f142 opencl perf test 2022-06-08 07:49:08 -07:00
George Hotz
d8ee8a39ac sgd threestep graph is so pretty 2022-06-06 09:45:37 -07:00
George Hotz
c143c92828 adam threestep 2022-06-06 09:38:28 -07:00
George Hotz
d302049e53 don't use div 2022-06-06 09:25:31 -07:00
George Hotz
a1dff4061b minor cleanups 2022-06-06 08:14:52 -07:00
George Hotz
3dac8fa728 this fix the gc 2022-06-05 17:16:40 -07:00
George Hotz
0ee21ba115 add ViT test and car 2022-06-05 17:12:43 -07:00
George Hotz
1de75b67d5 fix bug in graph with use of id 2022-06-05 16:31:20 -07:00
George Hotz
f0fe37bd34 simpler graph demo 2022-06-05 12:40:12 -07:00
George Hotz
88de42fb6e document graph mode 2022-06-05 12:13:05 -07:00
George Hotz
845bb1fc34 bs 4 -> 2 in training test 2022-01-15 21:34:21 -08:00
George Hotz
c0d1254003 don't run unneeded grads 2022-01-15 21:32:13 -08:00
George Hotz
8ba3d1f803 fix bn test, affine is True 2022-01-15 19:52:15 -08:00
George Hotz
e28cdfb0cf clean up resnet 2021-11-30 16:14:54 -05:00
George Hotz
46bbbcf7f0 model touchups 2021-11-30 11:13:34 -05:00
George Hotz
bd21304e3c linear takes in weight and bias 2021-11-30 00:38:47 -05:00
George Hotz
de938c2d9d vit is now tested 2021-11-30 00:23:06 -05:00
George Hotz
58ed46963e fix broadcastdot 2021-11-29 18:54:57 -05:00
George Hotz
dca076dbf1 remove dumb nn ops 2021-11-29 18:05:31 -05:00
George Hotz
f909ab194f gelu with broken test 2021-11-29 15:00:50 -05:00
George Hotz
c752033283 fix GPU OOM in test 2021-11-29 13:05:59 -05:00
George Hotz
99b6051467 add ff_dim to transformer 2021-11-29 12:40:52 -05:00
George Hotz
29dee59368 cat: forward only not required 2021-11-29 00:14:56 -05:00
George Hotz
3cdc77f526 add cat support 2021-11-28 23:21:49 -05:00
George Hotz
ce3d198bb7 less lines and fix default device 2021-11-27 11:18:49 -05:00
George Hotz
7ae14179d3 refactor ops 2021-11-27 11:12:23 -05:00
George Hotz
c162e748f5 fix float64 warning on training 2021-10-30 20:07:31 -07:00
George Hotz
b0f14b4af8 move datasets into datasets 2021-10-30 19:55:50 -07:00
George Hotz
7472a7ebe2 not forcing 3.9 for a stupid type 2021-10-30 16:52:40 -07:00
George Hotz
fc6597a6d9 only resnet18, it's too slow otherwise 2021-10-30 16:48:39 -07:00
Evan Mays
285621aeda Cherry backprop for conv2d (#281)
* quick math: 0 + x = x.

* gradient w.r.t. x using cherry for conv

* gradient w.r.t. w for conv on cherry but doing vector dot products

* small optimization

* [cherry] optimize conv backpass for large channel count

* get rid of numpy einsum
2021-10-30 16:12:19 -07:00
Sebastian Kreft
8113eec4cf feat: add efficientnet test (#285)
Simple test using the Chicken example from https://upload.wikimedia.org/wikipedia/commons/4/41/Chicken.jpg and the image preprocessing from example/efficientnet.py

Note that EfficientNet loads the weights from the internet so running the tests may be slow the first time. We could speed up the tests by caching the /tmp folder.

Fixes #234
2021-10-30 15:53:51 -07:00
Guglielmo Camporese
2b7589db64 Added ResNet-{18, 34, 50, 101, 152} (#271)
* added resnets

* fix minor

* fix minor

* resnet in models

* added resnet test

* added resnet train test

* added linear, conv2d nn tests

* fix minor in extra/training

* resnet in models

* fix minor

* fix tolerance for linear in nn test

* fix eval, this causes cpu and gpu UT failing

* revert transformer test

* fix minor for CPU test

* improved model get_params for sequential layer

* fix minor for params counting

* commented broken ops tests

* improved train for resnet
2021-06-21 09:37:24 -07:00