George Hotz
41f0a25b53
lazy.py: cache consts ( #3577 )
...
* lazy.py: cache consts
* add regression test
* always always cache const
* bump by 1
2024-03-02 03:50:05 -08:00
George Hotz
a40df14fef
ops_ext to replace cpu import ( #3409 )
...
* ops_ext to replace cpu import
* don't allow zero copy with as buffer
* memoryview(bytearray
* reenable test
* fix jit issue
2024-02-15 13:03:42 +01:00
George Hotz
6356474d6d
Revert "ops_ext to replace cpu import ( #3406 )" ( #3408 )
...
This reverts commit 91eb93f85a .
2024-02-15 12:16:10 +01:00
George Hotz
91eb93f85a
ops_ext to replace cpu import ( #3406 )
...
* ops_ext to replace cpu import
* don't allow zero copy with as buffer
* memoryview(bytearray
* reenable test
2024-02-15 12:14:58 +01:00
George Hotz
2e60012bcf
move create schedule and delete old API ( #3377 )
...
* move create schedule and delete old API
* fix test multitensor
2024-02-12 18:10:45 +01:00
George Hotz
41efaa848c
move graph.py and jit.py into features ( #3376 )
...
* move graph.py into features
* move jit into features
* fix quickstart
2024-02-12 17:34:34 +01:00
George Hotz
2495ca95c7
early gate the graph ( #3070 )
2024-01-09 20:17:13 -08:00
George Hotz
a280cfe169
move dtypes to dtype.py ( #2964 )
...
* move dtypes to dtype.py
* fix urllib
2024-01-01 14:58:48 -08:00
George Hotz
1765849937
new lazy, benchmark ( #2878 )
...
* lazy rewrite, try 2
* min fix tests
* pass contig test
* put broken pads back
* move that to realize
* no contig child fixes array packing
* so wrong
* now that's correct
* base children
* fix bind issues
* disable to_image_idx
* fix tests
* that failure shouldn't break other tests
* more fixes
* fix torch
* skip failing tests in CI
* 1e-7
* half is broken
* 1e-6 margin of error
2023-12-20 14:33:21 -08:00
George Hotz
877c78b4ce
lazy tests ( #2796 )
...
* tests
* mini sd is very mini
2023-12-16 08:24:21 -08:00
George Hotz
96a276cc7c
hotfix: add test_reduce_permute_nofuse to master
2023-12-15 09:39:47 -08:00
George Hotz
c6eb618013
tests from new lazy branch ( #2774 )
...
* tests from new lazy branch
* fix lin 11
* that was needed
* doesn't fail
* mark
* meant that
* llvm passes
2023-12-14 23:06:39 -08:00
chenyu
aa4a0de287
simpler Tensor.pow to integer ( #2746 )
2023-12-13 11:39:20 -05:00
chenyu
67f4e03724
rewrite 0 size loadop into a CONST ( #2556 )
...
* rewrite 0 size loadop into a CONST
* check alloc size
* EMPTY is better
* Revert "EMPTY is better"
This reverts commit 574fe0f9ed28f1b97da5a81afdfd2cd5d9a94ff9.
* no ast is created
* fix test
2023-12-01 18:29:06 -05:00
Christopher Mauri Milan
7f01dd04f0
Apply ruff linting rules to tests ( #2473 )
...
* everything except F821
* enable F821 with noqa
* dumb fix
* fix remaining imports and (former) lambdas
* replace _ with noqa to avoid gc
2023-11-27 21:24:06 -08:00
George Hotz
9e07824542
move device to device.py ( #2466 )
...
* move device to device.py
* pylint test --disable R,C,W,E --enable E0611
* fix tests
2023-11-27 11:34:37 -08:00
George Hotz
0cbf6c1811
move things, clean up extra ( #2292 )
...
* move things
* idk why pylint needs that now
* delete unused
2023-11-13 20:18:40 -08:00
George Hotz
8ba7ced7f9
extract const if it's const ( #2193 )
...
* extract const if it's const
* fix if statement
* fast math issue
* fix graphing and casting
* disable flaky copyout test
2023-10-31 18:52:35 -07:00
George Hotz
881fd7c141
add mops to graph, refactor IMAGE ( #2100 )
...
* add mops to graph, refactor IMAGE
* no reshape pushing
* add todo
* fix openpilot model alt
* push reshapes reduces kernels in new op
* IMAGE=2 is a first class citizen now
2023-10-17 21:27:51 -07:00
George Hotz
e4846771b2
Revert "limit metal buffers and revert the 207 fix (try 2) ( #2088 )"
...
This reverts commit 5e24dc5a95 .
2023-10-16 17:50:11 -07:00
George Hotz
5e24dc5a95
limit metal buffers and revert the 207 fix (try 2) ( #2088 )
...
* limit metal buffers
* look at the base, not the srcs
* Revert "Revert "openpilot kernel fix from 209 to 207 (#2006 )" (#2065 )"
This reverts commit 924ecc4d6a .
* add a test for that
2023-10-16 14:52:16 -07:00
George Hotz
e8fcd2f3db
Revert "limit metal buffers and revert the 207 fix ( #2087 )"
...
This reverts commit 2fb10f6a19 .
2023-10-16 14:32:22 -07:00
George Hotz
2fb10f6a19
limit metal buffers and revert the 207 fix ( #2087 )
...
* limit metal buffers
* Revert "Revert "openpilot kernel fix from 209 to 207 (#2006 )" (#2065 )"
This reverts commit 924ecc4d6a .
2023-10-16 14:26:32 -07:00
George Hotz
cea4cbfc7a
move image+kopt to features ( #2015 )
...
* move image+kopt to features
* fix tests
* debug prints (unrelated)
2023-10-07 15:41:08 -07:00
George Hotz
121f7aa8c5
Schedule item ( #2012 )
...
* ScheduleItem
* put var_vals in the schedule
* fix tests, wow that proliferated quickly
* not ready to be in the schedule
2023-10-07 08:59:25 -07:00
George Hotz
f54959e5cd
move print tree into graph ( #2003 )
...
* move print tree into graph
* add winograd profiling test
* change pre-commit to run ruff first
2023-10-07 04:39:21 -07:00
George Hotz
ffa33d743a
good changes from openpilot_compile2 ( #2000 )
...
* good changed from openpilot_compile2
* float32 image type was wrong
* cleaner way to write that + a test
2023-10-06 13:33:24 -07:00
George Hotz
21a2c5df73
fix up contiguous ( #1978 )
2023-10-05 07:22:05 -07:00
George Hotz
d52df788d3
remove RawConst and add test ( #1939 )
2023-09-29 01:21:51 -07:00
George Hotz
22b8576887
more lazy cleanup ( #1938 )
...
* small lazy cleanups
* a few more
* cleanups
* no more realizing in the scheduler test
* a few more minor things
* that was just wrong
* fix graph. the graph test was completely useless
* make graph usable
* fix op graph
2023-09-29 00:53:29 -07:00
Yixiang Gao
094d3d71be
with Tensor.train() ( #1935 )
...
* add with.train
* remove the rest TODOs
* fix pyflake
* fix pyflake error
* fix mypy
2023-09-28 18:02:31 -07:00
George Hotz
adab724caa
schedule2, keep the tests working with small changes ( #1932 )
...
* lazy cleanups
* ast functions take in LazyOps
* op instead of self.op
* _base for mops
* fix contiguous
* start schedule
* test_schedule
* fix openpilot
* more tests
* bugfix and test skip
* work
* make sure things get freed
* fix zerosized tensors
* fix failing test
* fix ceil and friends
* fix openpilot
* disable training
* disable test collectives
2023-09-28 09:14:43 -07:00