nimlgen
cc3c1e4c14
hcq: move cpu to hcq ( #11262 )
...
* hcq: move cpu to hcq
* import time
* upd
* fix
* windows support
* hm
* cleaner
* fix timer
* fix timing
* std is ns
* skip profiler
* mypy
* cleaner
* cleanups
* after merge
* default is back
2025-07-21 15:10:38 +03:00
qazal
e68af3b336
disable flaky assert in test_cpu_profile ( #11270 )
2025-07-17 06:50:39 +03:00
nimlgen
fb278c6a02
do not recreate Compiled.profile_events in helper_collect_profile ( #11171 )
2025-07-10 23:55:12 +03:00
qazal
bde80c0cdf
record GraphEvents in metal graph ( #11145 )
...
* record GraphEvents in metal graph
* add TestProfiler.test_graph, revert old stuff
* move profile capture to MetalGraph
* comment
* don't double record graph command buffers
* wait_check
* explicit delete
2025-07-10 21:32:06 +03:00
qazal
3dfc0ff887
move cpu_profile and shared ProfileEvents from device.py to helpers [pr] ( #11126 )
...
* move cpu_profile and shared ProfileEvents to helpers [pr]
* TestProfiler.test_cpu_profile
* update test_viz.py
* TestProfiler.test_profile_multiops ordering, it's different streams now
2025-07-08 12:14:03 +03:00
qazal
988540f401
support capturing cpu_profile on error ( #11078 )
...
* support capturing cpu_profile on error
* spacing
* pylint complains
2025-07-04 11:53:12 +03:00
George Hotz
32e9949052
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
nimlgen
5afb0a4a81
metal: fix transfer profiling ( #8659 )
2025-01-17 23:47:01 +03:00
George Hotz
b71c51191b
tests from remove uop mutability [pr] ( #8442 )
...
* tests from remove uop mutability [pr]
* more test fix
* simpler test fix
* remove that
2024-12-29 12:14:10 -05:00
nimlgen
777d2aec05
metal profiler + cpu_profile ( #8291 )
...
* metal + cpu_profile
* gpt example
* linter + revert gpt2 for now
* a bit of readme
* linter
* unrelated
* tests
* linter
* b
2024-12-18 00:06:56 +03:00
nimlgen
af87e4b53c
viz profiler ( #8287 )
...
* only hcq
* fix get_metadata
* linter
* oops
* tiny
* linter
* time
* print pm
* hmm
* nits
2024-12-17 20:00:53 +03:00
George Hotz
e9ae2ccd09
_prg to match _buf [pr] ( #7816 )
2024-11-21 12:44:48 +08:00
George Hotz
c5d458ce02
BufferSpec and ProgramSpec [pr] ( #7814 )
...
* BufferSpec and ProgramSpec [pr]
* delete preallocate, it's unused
* Revert "delete preallocate, it's unused"
This reverts commit dcfcfaccde .
2024-11-21 12:18:05 +08:00
George Hotz
eb0bb7dc0b
final dname to device [pr] ( #7806 )
...
* final dname to device [pr]
* oops, fix nv
2024-11-20 20:20:28 +08:00
George Hotz
85a45164fb
remove pyint [pr] ( #7016 )
...
* remove pyint
* bump time on tp [pr]
* dont truncate in const fold
* remove dead code
* Revert "dont truncate in const fold"
This reverts commit 29c81db0f7 .
* remove define_var
2024-10-12 22:36:24 +08:00
George Hotz
6b063450df
move hcq device to runtime [pr] ( #6879 )
...
* things that are only used in one place don't belong in helpers [pr]
* start moving hcq device [pr]
* fix paths
2024-10-04 22:26:50 +08:00
George Hotz
cdff1d75b6
things that are only used in one place don't belong in helpers [pr] ( #6878 )
...
* things that are only used in one place don't belong in helpers [pr]
* pretty print moved
2024-10-04 17:27:38 +08:00
chenyu
62f9f273f7
increase test_profile_multidev_transfer threshold ( #6370 )
...
flaky, bumpped to 16000 for CI
2024-09-05 05:49:32 -04:00
nimlgen
78c94abe9c
raise time limit for ci in test_profile_multidev_transfer ( #6227 )
2024-08-21 22:42:03 +03:00
nimlgen
38d5eecc68
hcq profiler support args ( #5989 )
...
* hcq profiler support args
* bytes -> _bytes
* fix
* add test
* mypy
* not f strings
* percison
2024-08-09 00:18:36 +03:00
nimlgen
183c4c91a3
fix non-jitted transfers in profile ( #5980 )
...
* fix transfers in profile
* fix linter
* sync to be sure everythin is recorded
2024-08-08 17:58:08 +03:00