b1tg
a78fcc55a4
amd tc 1616128 ( #13439 )
...
* amd tc 1616128
* fix test
* remove hardcoded check in test
2026-01-02 09:01:05 -05:00
George Hotz
2af2b4da5d
Revert "rewrites for renderer and compiler ( #13646 )" ( #13806 )
...
This reverts commit 339dadf056 .
2025-12-22 19:21:33 -05:00
George Hotz
339dadf056
rewrites for renderer and compiler ( #13646 )
...
* rewrites for renderer and compiler
* full_rewrite_to_program
* fix pre-commit
* compiler passed into get_program
* no pkl compiler
* lib on program spec
* fix spec
* fix test
* no device
* compiler_device
* nm
* fix nir
* fix
* simplest
* fix tests
* revert
2025-12-22 18:58:43 -05:00
George Hotz
fa40df972f
fix tests for NV ( #13744 )
...
* small fix
* min diff
* bfloat16 out
2025-12-18 13:20:21 -04:00
George Hotz
3dbde178c1
mark slow tests as slow instead of as CI ( #13736 )
...
* mark slow tests as slow instead of as CI
* CI shouldn't have different behavior
* more skips / CI
* slow
2025-12-17 10:29:57 -04:00
qazal
366badaa68
require renderer argument in get_program, removes device opening in process replay [pr] ( #13524 )
2025-12-03 02:05:31 +08:00
Sieds Lykles
e22c5e7e73
process_replay uses opts argument for KernelInfo.opts_to_apply ( #12946 )
...
* opts_to_apply is opts
* skip beamed kernels
* simpler change
* fix the tensor cores tests for process replay
* use opts
2025-10-28 09:00:28 +01:00
George Hotz
1d1e1d9d88
delete the ShapeTracker ( #12720 )
...
* delete the ShapeTracker
* fix tests
* fix more
* fix gc test
2025-10-16 15:36:22 +08:00
chenyu
d25ceffe8d
update padto opts tests ( #12679 )
2025-10-14 17:00:42 -04:00
chenyu
ae51bdd06a
remove trivial use of RANGEIFY flag ( #12550 )
...
some tests need update still
2025-10-09 02:29:38 -04:00
qazal
b6835f4134
remove Ops.VIEW and related UOp methods ( #12522 )
...
* remove Ops.VIEW and related UOp methods
* update abstractions2.py
* no ShapeTrackers in abstractions2.py
* it's a size 1
2025-10-08 14:47:02 +03:00
Sieds Lykles
5a4b244e6b
Check for group inside another reduce ( #12268 )
...
* add check
* get the ranges correctly
* add test
* comment and better check
2025-09-23 00:32:41 +02:00
chenyu
0e266f376c
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
George Hotz
9789337722
early reduce simplify ( #12046 )
...
* early reduce simplify
* min changes
* need that
* that goes in simplify
* no more arange reduce opt
2025-09-10 21:02:46 +08:00
nimlgen
1c6c42715f
unify cpu and llvm ( #11982 )
...
* try unify cpu and llvm
* fixes
* fix
* ops
* no llvm
* fix
* rm
* lvmm is ot
* oops
* override
* no llvm
* ignore
* skip llvm
* ooops
2025-09-09 13:54:44 +03:00
nimlgen
10ac427aaa
cpu threading ( #11951 )
...
* start cpu threading
* fix
* fix2
* fix
* hacks?
* threads
* minor
* no dsp
* dsp 2
* n
* more
* test
* xm
* cleaner
* readable
* f
* reorder
* when no threads
* rangeify
* typos
* not needed
* reapply
* remoev this
* linter
* fixed cpu count in ci
* fix
* fixes
* rm
* typo
* sort based on speed
* test if test works in ci
* Revert "test if test works in ci"
This reverts commit 1f05edb531 .
* do not pad thread
2025-09-06 16:13:43 +03:00
George Hotz
12c7b1bb01
cleanup lin tests without Kernel ( #12041 )
...
* cleanup lin tests without Kernel
* no kernel.py there
* remove that test
2025-09-05 15:13:14 -07:00
George Hotz
30eb42a69e
fix POSTOPT pad ( #11999 )
...
* fix POSTOPT=1
* fix some tests
* Revert "fix some tests"
This reverts commit 8ee058e206 .
* fix padding restrictions
* cuda has two tensor cores
* Set POSTOPT ContextVar to 0 in helpers.py
2025-09-04 14:28:58 -07:00
chenyu
dc8501af30
clean up wino tests ( #12008 )
...
removed the one that tests hcopt and added one for backward kernel counts
2025-09-04 16:14:55 -04:00
George Hotz
560df206cc
split tc test ( #12003 )
...
* split tc test
* split hand coded opts
* remove some skipped tests
* skips on emulated
2025-09-04 11:47:56 -07:00
George Hotz
09106e4aae
refactor and split test_linearizer ( #12001 )
...
* refactor and split test_linearizer
* forget that file
* imports
* remove from docs
* test gen float4
2025-09-04 10:53:07 -07:00