George Hotz
744af193f0
remove ScheduleItem and merge it with ExecItem ( #13759 )
...
* remove ExecItem and merge it with ScheduleItem
* less diff
* fix issues
* min diff
* don't change bufs in _lower
* min diff
* update
* revert
* fixes
* diff
2025-12-19 17:04:24 -04:00
George Hotz
3dbde178c1
mark slow tests as slow instead of as CI ( #13736 )
...
* mark slow tests as slow instead of as CI
* CI shouldn't have different behavior
* more skips / CI
* slow
2025-12-17 10:29:57 -04:00
George Hotz
6bd355fa26
add needs_second_gpu decorator ( #13543 )
...
* add needs_second_gpu decorator
* more skips
* two more fixes
2025-12-02 19:08:23 -08:00
nimlgen
da0b955be4
hcq: cpu can be graphed ( #11474 )
...
* hcq: cpu can be graphed
* ops
* new jit decisions
* fix test
* fix remote
* cleaner
* fix
2025-08-02 21:01:19 +03:00
George Hotz
32e9949052
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
uuuvn
dba073e5c0
Less messy broken graph on paravirtualized metal workaround ( #10182 )
...
* Less messy broken graph on paravirtualized metal workaround
GitHub CI macOS runners use paravirtualized metal which is broken with
graph (some comments say that ICB in particular is broken but in my
testing it was fine sometimes, but other times hitting an assert inside
metal's code related to resouces, so not sure).
> Assertion failed: (resource != nil), function -[IOGPUMetalResource initWithResource:], file IOGPUMetalResource.m, line 458.
This can be reproduced locally with any virtualization software (like utm)
that can create macOS VMs with apple's own virtualization framework.
* unused import
2025-05-06 20:41:02 +03:00
quortus
9e49721c47
CPUGraph support for clang ( #10014 )
...
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2025-04-24 07:52:35 -04:00
nimlgen
fa888ee077
minor test cleanups ( #9770 )
...
* fix test_graph on max
* pcie5
2025-04-07 15:29:12 +03:00
nimlgen
e88a640ca5
fix _access_resources for offset buffers ( #9580 )
...
* fix _access_resources for offset buffers
* test
2025-03-26 18:42:43 +07:00
chenyu
aaf0a8069f
xor -> bitwise_xor ( #9264 )
2025-02-26 10:21:14 -05:00
qazal
866dfa1f23
create_schedule([x.lazydata]) -> x.schedule() in tests ( #8449 )
2024-12-31 03:15:52 +08:00
qazal
9828277c03
view doesn't have buffer, fix the tests [pr] ( #7841 )
...
* view doesn't have buffer, fix the tests [pr]
* need assigns
2024-11-22 20:41:55 +08:00
qazal
aeeb917b6e
mask out writable bufs in runtime access_resources ( #7234 )
2024-10-23 16:13:50 +03:00
nimlgen
16405b973a
fix hcq sync ( #5062 )
...
* fix hcq sync
* rewrite
* linter + comment
* fix profiler
* no default dict
* correct sync of unjitted transfer
* fix test
2024-06-26 17:50:37 +03:00
nimlgen
fd27f19e92
graph tests ( #5153 )
...
* graph tests
* add test
* cleanup
2024-06-26 16:31:20 +03:00
George Hotz
8919ca8163
test cleanups
2023-03-03 06:36:06 -08:00
George Hotz
bfcec234a2
Refactor ASTs ( #622 )
...
* ugh worst branch name
* compiler refactor continues
* scc -> cloc
* buf -> _buf
* finish _buf, and program -> runtime
* gpu is still working, clang isn't
* clang in new style
* ops_metal
* something broke it
* improve metal
* clean up tons of cl crap
* hack fix sync
* cleaner gpu
* gpu metal clang
* cleanups
* minor refactor
* GPUCodegen
* fix up LLVM
* blind CUDA refactor
* codegen / runtime
* keep ops naming
* linter passes
* woah, llvm was allocing 4x what it needed to
* bugfixes
* fix openpilot compiler
* fix compile_efficientnet
* method cache should fix tests
* deal with duped functions
2023-03-01 18:57:29 -08:00
Martin Loretz
7e9a5e3f31
Refactor graph ( #560 )
...
* Refactor graph
* Add graph tests
* Use CPUBuffer for graph tests
* Remove the use of GlobalCounters
2023-02-19 10:41:30 -08:00