Comma Device
75f937227a
add barrier
2022-09-13 11:39:48 -04:00
George Hotz
3c3534736e
fix matmul kernel and tests
2022-09-13 08:31:04 -07:00
Comma Device
62e9419206
fix test failure on MATMUL=1 backward pass
2022-09-13 11:18:52 -04:00
George Hotz
0516359af8
fix stupid OPENCL=1 OOM
2022-09-06 14:29:23 -07:00
George Hotz
f215534a64
1100 lines, but sane linter rules
2022-09-06 13:47:45 -07:00
George Hotz
f683b26eef
bring back native exp log
2022-09-06 07:59:04 -07:00
George Hotz
d6f499fd69
improve opencl, why is it OOMing
2022-09-05 20:14:31 -07:00
Comma Device
c07bf72d6e
save free 200ms
2022-08-31 20:31:42 -04:00
Comma Device
a734df98fa
TEST_ENET for openpilot compiler
2022-08-31 13:23:36 -04:00
George Hotz
e194ae0c1d
typos
2022-08-30 19:52:21 -07:00
George Hotz
5efab7cf1d
add reciprocal
2022-08-29 18:00:24 -07:00
George Hotz
dc7af8c3ac
thneed run float32
2022-08-28 11:03:35 -07:00
Comma Device
9678cb8a1a
hmm, the native exp/log breaks it too much
2022-08-22 17:13:08 -07:00
George Hotz
2162cd3383
fix typing
2022-08-22 16:25:15 -07:00
Comma Device
e0a8d0f836
image input works
2022-08-22 16:04:17 -07:00
George Hotz
18340e7d30
remove from_image
2022-08-22 15:52:26 -07:00
Comma Device
1b5f4e52d9
refactor getters
2022-08-22 13:29:08 -07:00
George Hotz
a8734df030
add openpilot tests to tinygrad
2022-08-21 12:03:37 -07:00
George Hotz
b132de677d
tinygrad.nn ( #367 )
...
* tinygrad.nn
* flake8
* working on pylint
* more pylint
* more pylint
* pylint passes
* networkx
* mypy can't infer that type
* junk
2022-08-18 07:41:00 -07:00
George Hotz
783c120a8c
rawcpu ( #365 )
...
* rawcpu
* add should work when we respect shapetracker
* now that's true
* still have to handle shapetracker
* copyin
* Fix mypy
2022-08-17 11:33:20 +02:00
George Hotz
57e5df9f28
ane: procPath issue. don't waste more time with this, focus on core tinygrad
2022-08-16 10:36:13 +02:00
George Hotz
bdfdbc8f8d
broken amfi patch
2022-08-13 10:41:25 +02:00
George Hotz
262efe5784
update readme
2022-08-09 11:08:52 +02:00
George Hotz
6267a3c8c2
notes
2022-08-09 00:42:14 +02:00
George Hotz
f4ff130947
docs
2022-08-09 00:06:24 +02:00
George Hotz
01de17eeb8
amfi note
2022-08-08 13:17:36 +02:00
George Hotz
136706169d
fix ane on new mac os x
2022-08-06 19:10:22 +00:00
George Hotz
f300caa486
notes
2022-08-06 15:21:26 +00:00
George Hotz
94d526f8fc
fix op estimate
2022-08-06 14:15:50 +00:00
George Hotz
f2847cb710
remove useless init, add ops counter
2022-08-06 14:05:25 +00:00
George Hotz
5d45c6e516
Fold reduce ( #362 )
...
* folding reduce
* fold through movementops
* fixup shapes
* was too aggressive
* i knew we needed that
* don't recompute reduce
* working
* fix openpilot compile
* prunegraph openpilot
* types and reduce_shape
* refactor
* cleanups
* neater
* 1009
* 1004
* clean up reduce for 998
2022-07-19 09:24:02 -07:00
George Hotz
5e96ed523a
fix opencl bug, no training on opencl
2022-07-17 12:55:26 -07:00
George Hotz
608e2431f7
test opencl, commit to removing the crap conv code from GPU
2022-07-17 11:55:37 -07:00
George Hotz
3c4565fa21
SLICE -> PAD,SHRINK
2022-07-17 11:33:59 -07:00
George Hotz
bcf422dfdd
Device2 ( #358 )
...
* option for matmul
* fixups
* fast like a nascar
* running
* thneed runner
* no buffer id makes no backing buffer
* move constant folding to the top
* runs on mac
* folded biases
* was v slow
* maybe just that
* elu touchup
* speed and float32
Co-authored-by: Comma Device <device@comma.ai >
2022-07-16 07:26:19 -07:00
George Hotz
817b64f5e5
A conv is a reduce op ( #356 )
...
* universal strided conv
* more correct
* hmm, CPU works
* cleaner cl code output
* make noconv a flag
* cleanup __getitem__
* refactor broadcasting
* put that back
* unneeded reshape in getitem
* fix strided for torch
2022-07-10 19:58:50 -07:00
George Hotz
68959be05d
precompute weights for opencl
2022-07-08 10:56:48 -07:00
George Hotz
d8e7f1f8bc
opencl type ignore
2022-07-08 10:33:55 -07:00
George Hotz
ae335b6d3e
opencl works, but tons of kernels
2022-07-08 10:22:04 -07:00
George Hotz
5b66d1bb0b
begin fixing up opencl
2022-07-08 10:20:14 -07:00
George Hotz
e822aae9ec
reorg opts, nicer graph
2022-07-02 22:29:09 -07:00
George Hotz
7276f8d6bf
improve constant folding, detach before moving tensor
2022-07-02 15:29:40 -07:00
George Hotz
07b438aa8b
move that to resolve time
2022-07-02 14:26:13 -07:00
George Hotz
dbf4aa09db
assert and tuple
2022-06-27 09:19:54 -07:00
George Hotz
37a6c0ef59
create with new ShapeTracker
2022-06-27 09:07:45 -07:00
George Hotz
e55a9833fb
a little more readable
2022-06-27 08:54:04 -07:00
George Hotz
3a414d7f50
cleanup, add flops tracking
2022-06-26 22:43:39 -07:00
George Hotz
a699f7cb0b
debug cleanups
2022-06-26 21:58:44 -07:00
George Hotz
15a16b98e6
remove get_root
2022-06-26 21:18:02 -07:00
George Hotz
e3c2579537
flip stride to match canonical
2022-06-26 19:19:53 -07:00