Commit Graph

71 Commits

Author SHA1 Message Date
George Hotz
835869974c clean up vit code 2021-11-30 10:58:03 -05:00
George Hotz
c39824bc62 oops, forgot some stars 2021-11-30 00:46:14 -05:00
George Hotz
908db3bdea support bias in conv like linear 2021-11-30 00:44:59 -05:00
George Hotz
bd21304e3c linear takes in weight and bias 2021-11-30 00:38:47 -05:00
George Hotz
535f02cc64 use sequential 2021-11-30 00:25:39 -05:00
George Hotz
de938c2d9d vit is now tested 2021-11-30 00:23:06 -05:00
George Hotz
aff810e722 unify transformer block 2021-11-29 18:58:15 -05:00
George Hotz
58ed46963e fix broadcastdot 2021-11-29 18:54:57 -05:00
George Hotz
033b04494a resnet pretrained is broken 2021-11-29 18:13:52 -05:00
George Hotz
dca076dbf1 remove dumb nn ops 2021-11-29 18:05:31 -05:00
George Hotz
8097b8f7d6 vit works 2021-11-29 16:28:14 -05:00
George Hotz
f909ab194f gelu with broken test 2021-11-29 15:00:50 -05:00
George Hotz
1eafa5580e layernorm with learnable parameters 2021-11-29 13:03:57 -05:00
George Hotz
c7f795ca1e added dot affine 2021-11-29 12:55:56 -05:00
George Hotz
30eb3afbe1 add bias term to transformer 2021-11-29 12:45:27 -05:00
George Hotz
99b6051467 add ff_dim to transformer 2021-11-29 12:40:52 -05:00
George Hotz
641b1dbb40 remove ane, start supporting ops_torch 2021-10-30 17:47:00 -07:00
George Hotz
7d12482d80 refactor efficientnet loading 2021-10-30 17:02:17 -07:00
Sebastian Kreft
3358770182 chore(efficientnet): don't use eval when loading weights (#286)
Because the weights are being loaded from a third party internet address, it's unsafe to use eval. Also with the change I think the code became a little bit more clear as now it's clearer which keys are being transformed.

Co-authored-by: Seba Kreft <sebastian.kreft@houm.com>
2021-10-22 15:10:04 -07:00
Guglielmo Camporese
2b7589db64 Added ResNet-{18, 34, 50, 101, 152} (#271)
* added resnets

* fix minor

* fix minor

* resnet in models

* added resnet test

* added resnet train test

* added linear, conv2d nn tests

* fix minor in extra/training

* resnet in models

* fix minor

* fix tolerance for linear in nn test

* fix eval, this causes cpu and gpu UT failing

* revert transformer test

* fix minor for CPU test

* improved model get_params for sequential layer

* fix minor for params counting

* commented broken ops tests

* improved train for resnet
2021-06-21 09:37:24 -07:00
George Hotz
d3f169b267 move good models to models, add a training step test 2021-06-19 11:24:15 -07:00