George Hotz
d9fa47ecc9
use the TinyJit in the efficientnet runner, 200ms -> 20ms
2023-02-20 19:58:16 -08:00
George Hotz
714bf4b108
clang backend ( #572 )
...
* start clang backend
* mostly working
* no group for reduce w clang
* it compiles
* compiles
* a11y
* minor fixups
* formatting
* add a test
* rename test
2023-02-20 18:18:18 -08:00
George Hotz
353e81fa8f
need hand opt with IMAGE==2
2023-02-20 10:18:19 -08:00
George Hotz
b925d05e22
no networkx is okay
2023-02-20 09:57:12 -08:00
George Hotz
4d232c7c95
optional networkx + DEBUGCL=2
2023-02-20 09:50:46 -08:00
George Hotz
8b0082540b
openpilot compile cleanups
2023-02-20 09:16:03 -08:00
George Hotz
ea13504f35
fix METAL_XCODE
2023-02-19 20:02:12 -08:00
George Hotz
bbfec2fde7
8.46 TFLOPS
2023-02-19 13:21:25 -08:00
George Hotz
1ba847963d
reshape and retain metal_matmul
2023-02-19 13:07:23 -08:00
George Hotz
60008e55cd
sick of that failing
2023-02-19 13:05:37 -08:00
George Hotz
0b3f686530
Good fast triton ( #567 )
...
* runtime fixups
* uints and printbufs
* uints don't work
2023-02-19 12:21:55 -08:00
Jacky Lee
cb679cd051
Fix weight initialization ( #566 )
...
* Fix weight initialization
* Use scaled_uniform in serious_mnist
2023-02-19 11:25:29 -08:00
Diogo
a508c2b429
small tweaks to the metal runtime ( #562 )
...
* small tweaks to the metal runtime
* create buffer straight from numpy
* reverted back due to bug when adding 1+1
* removed comments
2023-02-19 11:25:13 -08:00
Martin Loretz
7e9a5e3f31
Refactor graph ( #560 )
...
* Refactor graph
* Add graph tests
* Use CPUBuffer for graph tests
* Remove the use of GlobalCounters
2023-02-19 10:41:30 -08:00
Kirill
7944cfdadc
Remove Tensor.data ( #565 )
2023-02-18 16:36:12 -08:00
George Hotz
4efe0169bb
remove old metal implementation
2023-02-18 13:51:48 -08:00
Jacky Lee
7e8b0305f3
Fix mnist gan example ( #563 )
2023-02-18 13:45:37 -08:00
Jacky Lee
9fd41632c6
Import get_parameters from tinygrad.nn ( #559 )
...
* get_parameter is in optim
* Update all imports for get_parameters
* Clean up
* use optim.get_paramters
2023-02-17 15:22:26 -08:00
George Hotz
fae7654924
fix sync issue
2023-02-17 12:42:45 -08:00
George Hotz
5e6265be6e
metal timing, fix speed test
2023-02-17 12:31:54 -08:00
George Hotz
121bd03cbd
metal globalcounters
2023-02-17 12:02:54 -08:00
George Hotz
67d1df80ba
gid is array, metal works
2023-02-17 11:54:50 -08:00
George Hotz
f9af0322e7
metal can add
2023-02-17 11:45:33 -08:00
Jacky Lee
e172f0087a
BatchNorm2D -> BatchNorm2d ( #558 )
...
* BatchNorm2D -> BatchNorm2d
* Fix typo
2023-02-16 12:31:49 -08:00
Liam
09315ef34f
Add tinygrad.org reference in Readme. ( #556 )
2023-02-14 09:39:00 -08:00
George Hotz
82c257e8f5
more kernel search
2023-02-12 10:34:56 -08:00
Jacky Lee
c35fcc6964
Replace phrase for prompt ( #555 )
2023-02-12 09:04:44 -08:00
George Hotz
20a03d5017
woah, don't sync torch if it's not torch
2023-02-12 07:48:56 -08:00
George Hotz
de71c13934
test speed v torch uses jit
2023-02-12 07:43:17 -08:00
George Hotz
693d4b89a4
fixup TRITON backend to use new APIs
2023-02-12 06:57:49 -08:00
George Hotz
191c76cfd7
hlb_cifar10 torch version
2023-02-11 18:04:40 -08:00
George Hotz
9057d98d36
no lr decay in cifar. test this in torch tomorrow
2023-02-11 17:42:54 -08:00
George Hotz
dd7accb9cc
decay LR, little bugfix
2023-02-11 17:34:15 -08:00
George Hotz
ba3bf5bdf7
cifar stops learning
2023-02-11 17:21:42 -08:00
George Hotz
40f3949742
fancier KOPT
2023-02-11 16:40:25 -08:00
George Hotz
446442dbb3
fix tests symbolic
2023-02-11 15:16:47 -08:00
George Hotz
7a7046f264
sum_combine_num
2023-02-11 14:48:31 -08:00
Kirill
a4f5f2ff8b
Add missing packages to setup.py ( #554 )
2023-02-11 14:41:56 -08:00
George Hotz
20a351a3c6
hand optim CONVW
2023-02-11 14:41:08 -08:00
George Hotz
89499b303d
oops, bad else. why didn't linter catch
2023-02-11 12:02:09 -08:00
George Hotz
7d33f2d659
CL.CACHE is over, GlobalCounters.cache is it
2023-02-11 12:00:14 -08:00
George Hotz
b9eae94ae9
move Device back into lazy
2023-02-11 11:26:53 -08:00
George Hotz
9152bb5b4a
momentum support in SGD
2023-02-11 10:22:37 -08:00
George Hotz
0a2035e015
oops, GPU isn't defined
2023-02-11 10:10:02 -08:00
George Hotz
3421d4af10
the jit has a test
2023-02-11 10:04:03 -08:00
George Hotz
031edd01e6
switch openpilot compile to TinyJit
2023-02-11 09:51:44 -08:00
jspieler
8f912c3966
added deep deterministic policy gradient example ( #531 )
2023-02-11 10:10:46 -06:00
George Hotz
608fd730d3
put the JIT in extra
2023-02-11 00:35:18 -06:00
George Hotz
ed8ae7522a
tinyjit
2023-02-11 00:22:36 -06:00
George Hotz
4c90a15689
make the fake data actually learnable
2023-02-10 23:35:21 -06:00