George Hotz
892ac661e1
enet readability
2022-06-07 10:23:05 -07:00
George Hotz
0ee21ba115
add ViT test and car
2022-06-05 17:12:43 -07:00
George Hotz
c8b569a8c7
cleaner comments
2022-05-14 21:28:39 -07:00
cjg91
7025c9bbeb
Transfer learning for ResNet ( #295 )
...
* Transfer learning for ResNet
* moved ResNet depth specifics into the class
2022-01-15 23:22:10 -05:00
George Hotz
55d792b065
Revert "fixup resnet"
...
This reverts commit 4eabe677ed .
2022-01-15 20:22:01 -08:00
George Hotz
4eabe677ed
fixup resnet
2022-01-15 20:21:02 -08:00
George Hotz
c0c2c0b041
support larger ViT models
2021-12-12 10:45:10 -08:00
George Hotz
e28cdfb0cf
clean up resnet
2021-11-30 16:14:54 -05:00
George Hotz
8f5779eeaa
very minor change
2021-11-30 15:54:03 -05:00
George Hotz
d31ef0ae48
make vit names match pytorch
2021-11-30 11:34:14 -05:00
George Hotz
4b7c31b5b7
break vit into it's own file
2021-11-30 11:19:22 -05:00
George Hotz
46bbbcf7f0
model touchups
2021-11-30 11:13:34 -05:00
George Hotz
835869974c
clean up vit code
2021-11-30 10:58:03 -05:00
George Hotz
c39824bc62
oops, forgot some stars
2021-11-30 00:46:14 -05:00
George Hotz
908db3bdea
support bias in conv like linear
2021-11-30 00:44:59 -05:00
George Hotz
bd21304e3c
linear takes in weight and bias
2021-11-30 00:38:47 -05:00
George Hotz
535f02cc64
use sequential
2021-11-30 00:25:39 -05:00
George Hotz
de938c2d9d
vit is now tested
2021-11-30 00:23:06 -05:00
George Hotz
aff810e722
unify transformer block
2021-11-29 18:58:15 -05:00
George Hotz
58ed46963e
fix broadcastdot
2021-11-29 18:54:57 -05:00
George Hotz
033b04494a
resnet pretrained is broken
2021-11-29 18:13:52 -05:00
George Hotz
dca076dbf1
remove dumb nn ops
2021-11-29 18:05:31 -05:00
George Hotz
8097b8f7d6
vit works
2021-11-29 16:28:14 -05:00
George Hotz
f909ab194f
gelu with broken test
2021-11-29 15:00:50 -05:00
George Hotz
1eafa5580e
layernorm with learnable parameters
2021-11-29 13:03:57 -05:00
George Hotz
c7f795ca1e
added dot affine
2021-11-29 12:55:56 -05:00
George Hotz
30eb3afbe1
add bias term to transformer
2021-11-29 12:45:27 -05:00
George Hotz
99b6051467
add ff_dim to transformer
2021-11-29 12:40:52 -05:00
George Hotz
641b1dbb40
remove ane, start supporting ops_torch
2021-10-30 17:47:00 -07:00
George Hotz
7d12482d80
refactor efficientnet loading
2021-10-30 17:02:17 -07:00
Sebastian Kreft
3358770182
chore(efficientnet): don't use eval when loading weights ( #286 )
...
Because the weights are being loaded from a third party internet address, it's unsafe to use eval. Also with the change I think the code became a little bit more clear as now it's clearer which keys are being transformed.
Co-authored-by: Seba Kreft <sebastian.kreft@houm.com >
2021-10-22 15:10:04 -07:00
Guglielmo Camporese
2b7589db64
Added ResNet-{18, 34, 50, 101, 152} ( #271 )
...
* added resnets
* fix minor
* fix minor
* resnet in models
* added resnet test
* added resnet train test
* added linear, conv2d nn tests
* fix minor in extra/training
* resnet in models
* fix minor
* fix tolerance for linear in nn test
* fix eval, this causes cpu and gpu UT failing
* revert transformer test
* fix minor for CPU test
* improved model get_params for sequential layer
* fix minor for params counting
* commented broken ops tests
* improved train for resnet
2021-06-21 09:37:24 -07:00
George Hotz
d3f169b267
move good models to models, add a training step test
2021-06-19 11:24:15 -07:00