Commit Graph

11106 Commits

Author SHA1 Message Date
George Hotz
6a56d5d030 remove pyopencl 2020-11-09 23:58:35 -08:00
George Hotz
943ff6490d logic error okay too 2020-11-09 23:55:21 -08:00
George Hotz
29b22d117f will gpu tests work? 2020-11-09 23:51:29 -08:00
George Hotz
abbf0d1328 cleanup logsoftmax 2020-11-09 23:47:04 -08:00
George Hotz
aeb90226a8 reduce lines with reduce_op 2020-11-09 23:36:44 -08:00
George Hotz
55c914912d minor cleanup 2020-11-09 23:14:59 -08:00
George Hotz
d41ad2bf37 uint2 saves lines 2020-11-09 23:11:20 -08:00
George Hotz
8b23033fa9 support all the enet sizes 2020-11-09 18:04:16 -08:00
George Hotz
9db95ab942 fix enet padding 2020-11-09 17:56:57 -08:00
George Hotz
866b759d3b match torch api for pad2d 2020-11-09 17:48:56 -08:00
George Hotz
daf073535f new -> zeros 2020-11-09 17:31:52 -08:00
Ryan Neph
16d564a53c finish unsupporting strided pool, add global avg pool test (#92) 2020-11-09 17:31:22 -08:00
George Hotz
7ac1b163a5 add backward to enet train 2020-11-09 16:05:52 -08:00
George Hotz
8ca9c0205f train_efficientnet is broken still 2020-11-09 16:01:16 -08:00
George Hotz
870b84a893 test pad2d backward on GPU 2020-11-09 15:50:43 -08:00
adamritter
b541c05d88 Pad2d backward pass on GPU (#89)
* Pad2d backward pass on GPU

* Faster Pad2D GPU backward pass (no zeroing needed)

* Fix out of bounds error

* Don't save prg

Co-authored-by: holonomicjl <58403584+holonomicjl@users.noreply.github.com>
2020-11-09 15:49:37 -08:00
George Hotz
e46d122f65 not supporting stride 2020-11-09 15:06:58 -08:00
Ryan Neph
c21c2a0b62 revert b0c0c5d: Strided Pool funcs (#74) (#87)
Strided CPU Pooling was introduced but assumes small kernel size
(<=(10,10)), but efficientnet.py feeds kernel_size=(112,112).

This causes a huge array buffer allocation in stack_for_pool() that
hangs inference for a long time or until system OOM.

Revert CPU Pooling for now, and re-introduce #74 later with a new
global-average-pooling op that can be used instead of avgpool2d with
large kernel size for efficientnet inference.

Co-authored-by: Ryan Neph <ryanneph@google.com>
2020-11-09 14:58:18 -08:00
George Hotz
53157fb876 add back scale 2020-11-09 10:20:56 -08:00
George Hotz
3ffbd47335 Revert "Revert "pygame is fine, cv2 can also do the trick (#79)" (#85)"
This reverts commit 6b982621f8.
2020-11-09 10:18:48 -08:00
George Hotz
6b982621f8 Revert "pygame is fine, cv2 can also do the trick (#79)" (#85)
This reverts commit e7f2f43331.
2020-11-09 10:03:38 -08:00
dustcollector12
e7f2f43331 pygame is fine, cv2 can also do the trick (#79)
* pygame is fine, cv2 can also do the trick

* retimg and copy constructor not needed

* shape is missing without copy constructor

* retimg put back

* addressing capture buffering
2020-11-09 10:02:06 -08:00
Ryan Neph
7e515308a5 label op subtests by params (#83) 2020-11-09 06:25:06 -08:00
Ryan Neph
5bedf566d1 tests should use rtol unless special case (#82) 2020-11-08 17:25:11 -08:00
Ryan Neph
04b9312a34 Fix GPU Pooling bug at boundary + better Pooling test coverage (#81)
* fixed Pooling bug

* Clarify Pooling tests
2020-11-08 17:25:01 -08:00
niclaswue
c57b1b9e7d deleted unnecessary import in utils (#78) 2020-11-08 15:55:16 -08:00
Ryan Neph
b0c0c5d0d6 strided Pool funcs (#74)
* *Pool2D GPU forward supports stride

* kernel_size from ctx instead of saved_tensors

* *Pool2D CPU forward supports stride

* update ctx.stride properly
2020-11-08 11:45:55 -08:00
George Hotz
06504a5824 bump version 2020-11-08 09:34:07 -08:00
ziofil
db3eccc16b implemented backward for Pad2D & test (#73) 2020-11-07 21:58:42 -08:00
George Hotz
75d69e956f readme more 2020-11-07 21:58:20 -08:00
Dimitar Vagalinski
35a5c82a2a done as he said (#71) 2020-11-07 18:28:39 -08:00
Ryan Neph
5265f6c578 add AvgPool2D backward pass on GPU (#68) 2020-11-07 12:27:29 -08:00
George Hotz
ce6c408d78 readmee 2020-11-07 12:26:57 -08:00
George Hotz
3ab8fb2189 oops, timing 2020-11-07 12:15:52 -08:00
George Hotz
e4c5fb0219 webcam support 2020-11-07 12:15:16 -08:00
George Hotz
5486135f2d readme 2020-11-07 11:41:27 -08:00
George Hotz
9b8fefe4e3 fix broadcast bugggg 2020-11-07 11:38:56 -08:00
George Hotz
b1ca4dd327 who loves speeeeed 2020-11-07 11:36:52 -08:00
George Hotz
e6c8321e5b enet is running on GPU 2020-11-07 11:33:46 -08:00
George Hotz
30442a086a some broadcasting, pool test is fail 2020-11-07 11:29:42 -08:00
Ryan Neph
7c630ca355 GPU {Avg,Max}Pool2D forward pass (#66) 2020-11-07 11:13:24 -08:00
George Hotz
40c538c299 fix reshape 2020-11-07 11:10:29 -08:00
George Hotz
19c2950310 refactor ops to prepare for broadcasting 2020-11-07 11:02:45 -08:00
George Hotz
94d44c97bf add pad2d on GPU 2020-11-07 10:46:36 -08:00
George Hotz
fbff6ab2e5 fix strided convs, GPU env var for enet 2020-11-07 10:26:37 -08:00
George Hotz
ec03eb44bd tinygrad does forward pass convs on GPU 2020-11-07 10:15:56 -08:00
Manjeet Singh
23c39d9f52 fetch() 'ran out of input' fix (#65)
If requests.get fails during a download, the incomplete file is passed
without a size check. This causes errors during image/model loading.
Not sure if this is a problem on all systems.
2020-11-07 09:45:10 -08:00
George Hotz
bc7758cc5b getting convs to work on gpu 2020-11-07 09:17:57 -08:00
George Hotz
3302286e68 yayay test_sgd_gpu passes 2020-11-07 08:48:17 -08:00
Ryan Neph
98d1a4f740 GPU Selection works when len(answers)>ngpus (#64)
Fixes TypeError: sequence item 0: expected str instance, int found
triggered by https://github.com/inducer/pyopencl/blob/v2020.2.2/pyopencl/__init__.py#L1500
in pyopencl v2020.2.2
2020-11-07 08:06:59 -08:00