George Hotz
fd6082dcef
support all _pool2d. conv will eventually be an hlop
2023-02-23 08:19:47 -08:00
George Hotz
94ccab941e
compile_tensorflow: no cast required
2023-02-22 21:14:21 -08:00
George Hotz
135d0ddb78
compile_tensorflow: read weights from disk
2023-02-22 21:12:35 -08:00
George Hotz
0615dcffe7
compile_tensorflow: save the weights
2023-02-22 21:05:45 -08:00
George Hotz
c537fd0614
compile_tensorflow: add initialize and tests
2023-02-22 20:50:53 -08:00
George Hotz
dc914cde50
compile_tensorflow
2023-02-22 20:08:58 -08:00
George Hotz
10ba96029c
device: ImportError -> Exception
2023-02-22 19:48:59 -08:00
George Hotz
76b4d0577d
yolov8 works up to the MaxPool
2023-02-22 19:32:13 -08:00
Mischa Untaga
5190784cbb
Fix Tensor random functions determinism with same seed ( #580 )
...
* fix Tensor random functions determinism with same seed
* long lived rng
* TIL ClassVar typing
2023-02-22 19:08:43 -08:00
George Hotz
c8d89eb20e
avg/max pool strides
2023-02-22 18:00:48 -08:00
George Hotz
ac8daaeea5
refactor device, credit martinloretzzz
2023-02-22 17:29:40 -08:00
George Hotz
a3ddc1d484
CLANG doesn't support float4
2023-02-22 10:41:15 -08:00
George Hotz
c4c2c28738
a sustainable approach to float4 ( #582 )
...
* a sustainable approach to float4
* can_float4
* fix tests
* fix float4
* delete dead code
* types and minor cleanup
2023-02-22 09:45:08 -08:00
George Hotz
50e541174c
fix image copyout
2023-02-22 07:27:13 -08:00
George Hotz
628ce067a1
add tests to mypy
2023-02-22 07:07:38 -08:00
George Hotz
104c3c5e73
oops, forgot that debug
2023-02-22 06:58:27 -08:00
George Hotz
c5e2126d49
move DEBUG to helpers
2023-02-22 06:52:11 -08:00
Mischa Untaga
14bb2c40a2
Fix yolov3 example ( #577 )
2023-02-21 09:24:00 -08:00
Martin Loretz
8550b3e168
Fix kernel cache key ( #570 )
2023-02-21 03:53:07 -08:00
George Hotz
66b4b3bdd3
clang: linux needs fPIC
2023-02-20 21:09:33 -08:00
George Hotz
4f4b7d05a7
clang needs -lm and is very slow
2023-02-20 21:02:04 -08:00
George Hotz
4126bf2982
remove six (hopefully not needed)
2023-02-20 20:44:23 -08:00
George Hotz
efcb3f0cdd
fix metal dep
2023-02-20 20:43:32 -08:00
Connor Henderson
9670bf1fd1
Add unsqueeze ( #574 )
...
* Add unsqueeze
* remove UNSQUEEZE from llops part of readme
* make it an hlop
2023-02-20 20:14:59 -08:00
George Hotz
cfad2902d5
clang: that might have been getting GCed
2023-02-20 20:13:05 -08:00
Diogo
506970414a
added metal packages to setup and release metal buffers after del ( #571 )
2023-02-20 20:01:07 -08:00
George Hotz
d9fa47ecc9
use the TinyJit in the efficientnet runner, 200ms -> 20ms
2023-02-20 19:58:16 -08:00
George Hotz
714bf4b108
clang backend ( #572 )
...
* start clang backend
* mostly working
* no group for reduce w clang
* it compiles
* compiles
* a11y
* minor fixups
* formatting
* add a test
* rename test
2023-02-20 18:18:18 -08:00
George Hotz
353e81fa8f
need hand opt with IMAGE==2
2023-02-20 10:18:19 -08:00
George Hotz
b925d05e22
no networkx is okay
2023-02-20 09:57:12 -08:00
George Hotz
4d232c7c95
optional networkx + DEBUGCL=2
2023-02-20 09:50:46 -08:00
George Hotz
8b0082540b
openpilot compile cleanups
2023-02-20 09:16:03 -08:00
George Hotz
ea13504f35
fix METAL_XCODE
2023-02-19 20:02:12 -08:00
George Hotz
bbfec2fde7
8.46 TFLOPS
2023-02-19 13:21:25 -08:00
George Hotz
1ba847963d
reshape and retain metal_matmul
2023-02-19 13:07:23 -08:00
George Hotz
60008e55cd
sick of that failing
2023-02-19 13:05:37 -08:00
George Hotz
0b3f686530
Good fast triton ( #567 )
...
* runtime fixups
* uints and printbufs
* uints don't work
2023-02-19 12:21:55 -08:00
Jacky Lee
cb679cd051
Fix weight initialization ( #566 )
...
* Fix weight initialization
* Use scaled_uniform in serious_mnist
2023-02-19 11:25:29 -08:00
Diogo
a508c2b429
small tweaks to the metal runtime ( #562 )
...
* small tweaks to the metal runtime
* create buffer straight from numpy
* reverted back due to bug when adding 1+1
* removed comments
2023-02-19 11:25:13 -08:00
Martin Loretz
7e9a5e3f31
Refactor graph ( #560 )
...
* Refactor graph
* Add graph tests
* Use CPUBuffer for graph tests
* Remove the use of GlobalCounters
2023-02-19 10:41:30 -08:00
Kirill
7944cfdadc
Remove Tensor.data ( #565 )
2023-02-18 16:36:12 -08:00
George Hotz
4efe0169bb
remove old metal implementation
2023-02-18 13:51:48 -08:00
Jacky Lee
7e8b0305f3
Fix mnist gan example ( #563 )
2023-02-18 13:45:37 -08:00
Jacky Lee
9fd41632c6
Import get_parameters from tinygrad.nn ( #559 )
...
* get_parameter is in optim
* Update all imports for get_parameters
* Clean up
* use optim.get_paramters
2023-02-17 15:22:26 -08:00
George Hotz
fae7654924
fix sync issue
2023-02-17 12:42:45 -08:00
George Hotz
5e6265be6e
metal timing, fix speed test
2023-02-17 12:31:54 -08:00
George Hotz
121bd03cbd
metal globalcounters
2023-02-17 12:02:54 -08:00
George Hotz
67d1df80ba
gid is array, metal works
2023-02-17 11:54:50 -08:00
George Hotz
f9af0322e7
metal can add
2023-02-17 11:45:33 -08:00
Jacky Lee
e172f0087a
BatchNorm2D -> BatchNorm2d ( #558 )
...
* BatchNorm2D -> BatchNorm2d
* Fix typo
2023-02-16 12:31:49 -08:00