George Hotz
85d17a2acd
running resnet onnx
2022-06-11 13:17:15 -07:00
George Hotz
0225360191
fixed with one return x
2022-06-11 12:08:53 -07:00
George Hotz
db5a632e8c
multicat + test onnx is generic onnx
2022-06-11 11:50:47 -07:00
George Hotz
a710b3a210
it's a real test now
2022-06-11 11:33:33 -07:00
George Hotz
8440dbfa5d
support inputs
2022-06-11 11:21:45 -07:00
George Hotz
08de1aa636
add flatten to tinygrad
2022-06-11 11:15:16 -07:00
George Hotz
aee251cc41
op model test
2022-06-11 11:06:03 -07:00
George Hotz
d061ce8d5e
add ELU support
2022-06-11 10:47:23 -07:00
George Hotz
8864b37333
fix torch convdw
2022-06-10 15:04:39 -07:00
George Hotz
aac1a9b419
this breaks tests
2022-06-10 12:20:42 -07:00
George Hotz
e01ed64d7c
restore that naming
2022-06-09 08:38:34 -07:00
George Hotz
60a48455ad
still over line count, maybe test pass
2022-06-08 09:51:28 -07:00
George Hotz
70561f3d90
way over the line limit
2022-06-08 09:36:31 -07:00
George Hotz
4f7ee235c5
not a real test now
2022-06-08 09:00:59 -07:00
George Hotz
ae33060dae
early float4 stuff for binary
2022-06-08 08:59:54 -07:00
George Hotz
82f29b5dbf
better GPU block
2022-06-08 08:01:04 -07:00
George Hotz
42ae78241e
only run test on GPU
2022-06-08 07:54:40 -07:00
George Hotz
cdf4b5f142
opencl perf test
2022-06-08 07:49:08 -07:00
George Hotz
d8ee8a39ac
sgd threestep graph is so pretty
2022-06-06 09:45:37 -07:00
George Hotz
c143c92828
adam threestep
2022-06-06 09:38:28 -07:00
George Hotz
d302049e53
don't use div
2022-06-06 09:25:31 -07:00
George Hotz
a1dff4061b
minor cleanups
2022-06-06 08:14:52 -07:00
George Hotz
3dac8fa728
this fix the gc
2022-06-05 17:16:40 -07:00
George Hotz
0ee21ba115
add ViT test and car
2022-06-05 17:12:43 -07:00
George Hotz
1de75b67d5
fix bug in graph with use of id
2022-06-05 16:31:20 -07:00
George Hotz
f0fe37bd34
simpler graph demo
2022-06-05 12:40:12 -07:00
George Hotz
88de42fb6e
document graph mode
2022-06-05 12:13:05 -07:00
George Hotz
845bb1fc34
bs 4 -> 2 in training test
2022-01-15 21:34:21 -08:00
George Hotz
c0d1254003
don't run unneeded grads
2022-01-15 21:32:13 -08:00
George Hotz
8ba3d1f803
fix bn test, affine is True
2022-01-15 19:52:15 -08:00
George Hotz
e28cdfb0cf
clean up resnet
2021-11-30 16:14:54 -05:00
George Hotz
46bbbcf7f0
model touchups
2021-11-30 11:13:34 -05:00
George Hotz
bd21304e3c
linear takes in weight and bias
2021-11-30 00:38:47 -05:00
George Hotz
de938c2d9d
vit is now tested
2021-11-30 00:23:06 -05:00
George Hotz
58ed46963e
fix broadcastdot
2021-11-29 18:54:57 -05:00
George Hotz
dca076dbf1
remove dumb nn ops
2021-11-29 18:05:31 -05:00
George Hotz
f909ab194f
gelu with broken test
2021-11-29 15:00:50 -05:00
George Hotz
c752033283
fix GPU OOM in test
2021-11-29 13:05:59 -05:00
George Hotz
99b6051467
add ff_dim to transformer
2021-11-29 12:40:52 -05:00
George Hotz
29dee59368
cat: forward only not required
2021-11-29 00:14:56 -05:00
George Hotz
3cdc77f526
add cat support
2021-11-28 23:21:49 -05:00
George Hotz
ce3d198bb7
less lines and fix default device
2021-11-27 11:18:49 -05:00
George Hotz
7ae14179d3
refactor ops
2021-11-27 11:12:23 -05:00
George Hotz
c162e748f5
fix float64 warning on training
2021-10-30 20:07:31 -07:00
George Hotz
b0f14b4af8
move datasets into datasets
2021-10-30 19:55:50 -07:00
George Hotz
7472a7ebe2
not forcing 3.9 for a stupid type
2021-10-30 16:52:40 -07:00
George Hotz
fc6597a6d9
only resnet18, it's too slow otherwise
2021-10-30 16:48:39 -07:00
Evan Mays
285621aeda
Cherry backprop for conv2d ( #281 )
...
* quick math: 0 + x = x.
* gradient w.r.t. x using cherry for conv
* gradient w.r.t. w for conv on cherry but doing vector dot products
* small optimization
* [cherry] optimize conv backpass for large channel count
* get rid of numpy einsum
2021-10-30 16:12:19 -07:00
Sebastian Kreft
8113eec4cf
feat: add efficientnet test ( #285 )
...
Simple test using the Chicken example from https://upload.wikimedia.org/wikipedia/commons/4/41/Chicken.jpg and the image preprocessing from example/efficientnet.py
Note that EfficientNet loads the weights from the internet so running the tests may be slow the first time. We could speed up the tests by caching the /tmp folder.
Fixes #234
2021-10-30 15:53:51 -07:00
Guglielmo Camporese
2b7589db64
Added ResNet-{18, 34, 50, 101, 152} ( #271 )
...
* added resnets
* fix minor
* fix minor
* resnet in models
* added resnet test
* added resnet train test
* added linear, conv2d nn tests
* fix minor in extra/training
* resnet in models
* fix minor
* fix tolerance for linear in nn test
* fix eval, this causes cpu and gpu UT failing
* revert transformer test
* fix minor for CPU test
* improved model get_params for sequential layer
* fix minor for params counting
* commented broken ops tests
* improved train for resnet
2021-06-21 09:37:24 -07:00