George Hotz
54f499b623
Move rawbuffer ( #697 )
...
* move GlobalCounters to helpers
* that's not part of the public api
* move InterpretedBuffer
* remove fromCPU from devicebuffer
2023-03-13 22:30:36 -07:00
George Hotz
c594a0a835
fix flip bug, add new unit tests
2023-03-12 23:55:31 -07:00
George Hotz
a4abcf0969
improve test_example
2023-03-12 22:59:40 -07:00
George Hotz
5577634cf3
tests in pre commit
2023-03-12 22:42:26 -07:00
George Hotz
ce1564b05e
fix shapetracker test
2023-03-12 22:33:25 -07:00
George Hotz
fe0e8a306f
jittable llama
2023-03-12 14:15:04 -07:00
George Hotz
15e0b56e39
compile works ( #688 )
...
* compile works
* runtimes
* line count
* fix custom, to tg dtype
* meh, that's fine with lazy import
2023-03-12 11:01:25 -07:00
George Hotz
dc9a6b4bb7
fix float16 in CLANG on linux
2023-03-11 21:51:22 -08:00
George Hotz
37cf6fc4c0
err, external_test_opt.py broke...fusing will have to wait. correctness over speed
2023-03-11 17:54:47 -08:00
George Hotz
305b9f2d21
multistep optim tests passing
2023-03-11 17:49:53 -08:00
George Hotz
61071f881a
fix bug, and add unit test to catch failure
2023-03-11 16:57:25 -08:00
George Hotz
3ec457248c
failing llama test
2023-03-11 16:28:10 -08:00
Diogo
784afc6c6f
Eq magic function support ( #683 )
...
* add eq magic func
* changed from eq to __eq__
* ignore type for linter
* mypy doenst like descriptions :(
2023-03-11 10:31:46 -08:00
George Hotz
01f39b19dc
move to shapetracker.py
2023-03-11 07:50:07 -08:00
George Hotz
0b03216cc3
losing lines ( #678 )
...
* losing lines
* FLIP -> STRIDE
* shapetracker refactor
2023-03-10 21:57:05 -08:00
George Hotz
d7cb8e3e56
multithreaded fake_torch_load_zipped
2023-03-10 19:16:27 -08:00
Connor Henderson
8b7a16cf85
Add conv binops_no_rerun test assertions ( #665 )
...
* Add conv binops_no_rerun assertions
* use assert_allclose
* widen tolerance for elu
2023-03-10 19:09:48 -08:00
George Hotz
1826ff6b89
dtypes nice and clean ( #673 )
...
* add dtype class
* dtypes
* buffers are lazy
* dtype is tracked by lazybuffer and GenericShape
* fix types in llvm
* llvm store
* dtype tests
* fix tests maybe
* fix flop counter
* fix CI
* CI fix and check format
* fix dtype and dtype check
* fix custom test
* fix test graph
2023-03-10 16:56:07 -08:00
George Hotz
036737a12a
mem_estimate tracks bytes, not items
2023-03-10 09:44:12 -08:00
George Hotz
1a039306d2
good changes from llama branch ( #671 )
...
* good changes from llama
* transpose behavior changed
2023-03-09 20:51:22 -08:00
George Hotz
dbbaa0bdd7
int32, and refactor pad/shrink
2023-03-09 12:57:17 -08:00
George Hotz
fb5ee9260f
add pad tests to shapetracker
2023-03-09 12:51:18 -08:00
George Hotz
022c5835fc
fix GPU import error and old python Tuple
2023-03-08 12:22:11 -08:00
George Hotz
c22afc52db
move the custom function example to a test
2023-03-08 10:05:04 -08:00
George Hotz
00641aa45d
add challenge tests
2023-03-07 19:39:04 -08:00
George Hotz
e0244baf60
3 letters for graph op
2023-03-07 19:20:48 -08:00
George Hotz
4eb880550f
enable contract test
2023-03-07 17:32:28 -08:00
Alex Wang
d885d2d0f5
Allow 1s for contraction detection ( #663 )
...
* Allow 1s for contraction check
* More test cases for 1s
2023-03-07 17:31:28 -08:00
George Hotz
b561256a0e
allow all reduces ( #661 )
...
* allow all reduces
* push permute tests
* explict permute reshape push
* contractw1s
2023-03-07 15:36:01 -08:00
George Hotz
b14d31d6db
ConvNeXt + extras ( #657 )
...
* simple convnext implementation
* shorter function names
* need to realize the random functions now
* creating an optimizer realizes all params
* assign contiguous
* fix lazy lazy
* why was i doing that...add convnext to tests
* LazyNumpyArray
* enable assert + comment
* no two tiny
2023-03-06 22:10:56 -08:00
George Hotz
8c5dea8d72
fix CUDA float4 issues
2023-03-06 07:16:38 -08:00
George Hotz
7dbcc26582
fix up external tests
2023-03-06 06:52:28 -08:00
George Hotz
50012f679b
move get_contraction to shapetracker
2023-03-06 06:42:57 -08:00
Alex Wang
64ecbd91b5
Refactor contraction and add integration test cases for push permute ( #650 )
...
* Refactor contraction and add unit tests
* Fix typo; Fix TestConv.test_elu failure due to some ones in old_shape
* Add push permute test cases
* Fix mypy type annotation check error
* Add contraction unit test; Reshape to higher dimension is not contraction
2023-03-06 06:36:55 -08:00
George Hotz
382f346523
clean up opt ( #649 )
...
* clean up opt
* don't let global kernels get too small
* 8192 -> 1024
* disable local shape for clang
* fix can_merge
* unroll the 5x5 depthwise convs in op
* load float4 check
2023-03-05 20:49:36 -08:00
George Hotz
7930c6ab5c
CLImage backing bug + test_vec_mul
2023-03-05 16:32:05 -08:00
George Hotz
8de24e3b05
accumulator can be a float4 ( #647 )
...
* remove reduceopop
* not float4 yet
* float4 acc works
* group_float4 on store
2023-03-05 15:44:41 -08:00
George Hotz
7940ad258e
fix dropout test
2023-03-05 12:24:04 -08:00
George Hotz
b1ba78ac38
move applegpu disassembler
2023-03-05 11:21:12 -08:00
George Hotz
16b03f3c3b
wow, can't believe that was broken ( #642 )
...
* wow, can't believe that was broken
* remove namedtuple comment
2023-03-04 22:28:28 -08:00
George Hotz
4a607f7d65
more ext gpu tests
2023-03-04 21:00:08 -08:00
George Hotz
69198a73d2
test_1x1_24_6
2023-03-04 20:37:46 -08:00
George Hotz
b02a392d69
Improve local ( #635 )
...
* local is improving
* local is finding bugs
* new local should work
2023-03-04 09:30:49 -08:00
George Hotz
528cb3b3b9
fix ast test
2023-03-04 07:49:25 -08:00
George Hotz
28a6ada4ce
line reduction in metal
2023-03-03 23:14:40 -08:00
George Hotz
7a1d96fd76
No negative ( #632 )
...
* behavior is correct without VALIDHACKS
* simple div and mod
* fix tests
* no negative variables
* alt form is correct
* still correct
* bug in mulnode
* at least validhacks works now
* cleanups
* test validhacks, and to_image_idx
* cache compare key
* tests and __neg__
2023-03-03 16:48:14 -08:00
George Hotz
8c475ea86a
relax atol, merge_view
2023-03-03 07:48:44 -08:00
George Hotz
b9ce20c374
openpilot test wasn't running, factor out image idx
2023-03-03 07:41:53 -08:00
George Hotz
9bd2cdee08
skip broken bn training test for speed
2023-03-03 06:52:11 -08:00
George Hotz
999b44c274
fix external test + speed
2023-03-03 06:46:16 -08:00