Adeeb Shihadeh
dca7fb0a49
qcom: make priority configurable ( #13120 )
2025-11-05 22:27:54 -08:00
nimlgen
442218266d
qcom: fix profiler ( #12778 )
...
* qcom: fix profiler
* this way
2025-10-19 01:27:59 +08:00
chenyu
0e266f376c
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
nimlgen
fb96394ff5
auto-select available compilers ( #12094 )
...
* device: auto select compilers
* fix
* metal+opencl
* nv/cuda
* test without ptx
* ptx
* fix tests
* fix
* fix test
* rename
* test + cleaner
* xx
* ops
* better test
* win?
* um?
* types
* debug
* win??
* sep rung
* wtf?
* debug
* skip win
* revert this
* types
2025-09-10 19:52:01 +03:00
nimlgen
9a88bd841c
hcq: refactor into peer_groups ( #11277 )
...
* hcq: refactor into peer_groups
* fix fors
* fixes
* ooops
* mypy
* tiny fixes
2025-07-18 16:34:18 +03:00
nimlgen
cfb229473f
hcq: refactor buffer mapping ( #11271 )
...
* hcq: refactor buffer mapping
* fix
* fix mypy
2025-07-17 15:16:49 +03:00
nimlgen
a825608dc2
hcq: fix progs' __del__ when shutdown ( #10341 )
...
* debug ci
* better?
* and mute this?
* revrt that
2025-05-15 23:26:48 +03:00
nimlgen
5c7d004da5
hcq: refactor int ptrs to hcqbuffers ( #10105 )
...
* hcq: refactor int ptrs to hcqbuffers
* more refactors
* linter
* use in allocator
* test fiz
* fx
* ops
* final?
* simpler
* keep this for now
2025-04-30 00:12:18 +03:00
nimlgen
db51133537
rename HWInterface -> FileIOInterface ( #9989 )
...
* rename HWInterface -> FileIOInterface
* ugh
2025-04-22 22:18:57 +03:00
nimlgen
70db8c3003
hcq: dyn alloc signals ( #9238 )
...
* hcq: dyn alloc signals
* types and uniqueue devs
* typing
* mypy
* mypy one more time
* test
* make fds to not intersect in mockgpu between drivers
2025-02-25 17:22:24 +03:00
nimlgen
2f0856c1e2
qcom: use hwinterface ( #8565 )
...
* qcom: use hwinterface
* ops
* not needed anymore
2025-01-11 17:11:23 +03:00
nimlgen
5d37d33fc5
update typing.Optional to 3.10 for hcq ( #8479 )
2025-01-03 16:20:49 +03:00
nimlgen
a562ee2c6e
BumpAllocator rename start -> base ( #8415 )
2024-12-25 23:12:55 +03:00
chenyu
3f46425f1e
typos found by gemini [pr] ( #8400 )
...
not very effective... maybe due to tokenizer
2024-12-24 22:32:25 -05:00
chenyu
e63c7818dc
few type cleanups [pr] ( #8347 )
2024-12-20 01:56:01 -05:00
George Hotz
9c77e9f9b7
replace Tuple with tuple [pr] ( #8344 )
...
* replace Tuple with tuple [pr]
* replace List with list [pr]
* replace Dict with dict [pr]
* replace Set with set [pr]
2024-12-19 21:27:56 -08:00
nimlgen
3a7d64b96c
hcq remove update from args state ( #8104 )
...
* hcq remove update from args state
fix amd
ugh
qcom?
qcom ops
ops
qcom fix
qcom texture info
fx
qcom fix
qcom
qcom, sry
minor
works
* remove old code
* unrelated+sint
* qcom
* typing
* rm comments
2024-12-08 15:22:05 +03:00
nimlgen
d6e66095fd
hcq buffer is a class ( #8106 )
...
* hcq buffer is a class
* qcom
* no from_mv in qcom
* remove qcombuffer
* useless cast
* mypy
* qcom fix
* _md -> meta
2024-12-08 13:29:43 +03:00
nimlgen
d1282da7e8
hcq bump alloc ( #8078 )
...
* hcq bump alloc
* hm
* nv
* typo
2024-12-06 19:19:04 +03:00
nimlgen
c0240855b9
qcom has not transfer ( #8075 )
...
* qcom alloc is not hcq alloc
* maybe base?
* test
2024-12-06 14:45:01 +03:00
nimlgen
10f431b96d
hcq replace update with sint ( #7899 )
...
* try sym hcq
* start with amd
* move to nv
* nv works
* cache and qcom
* fixes
* signals
* fix nv
* qcom fixes
* linter
* linter
* cache + typings
* fixes
* tiny fixes
* linter
* linter
* lntr
* ugh
* comments
2024-11-29 20:08:13 +03:00
nimlgen
d3660ccc51
prereqs for hcq updates removal ( #7959 )
...
* hcq signals touch ups
* hcq compiled has device id
* helpers
* prreq hcq api
* oops
2024-11-29 18:20:07 +03:00
nimlgen
309dcb1044
hcq signal add sleep ( #7955 )
...
* hcqsignal sleep
* fixes
* typing
* time ms is int
2024-11-29 14:04:45 +03:00
nimlgen
84f96e48a1
hcq signal tiny refactor ( #7913 )
...
* hcq signal tiny refactor
* no mv
* fix
* fix2
* fix3
2024-11-26 21:48:38 +03:00
George Hotz
439911b2e6
disable disable_abstract_method [pr] ( #7815 )
2024-11-21 12:28:57 +08:00
George Hotz
c5d458ce02
BufferSpec and ProgramSpec [pr] ( #7814 )
...
* BufferSpec and ProgramSpec [pr]
* delete preallocate, it's unused
* Revert "delete preallocate, it's unused"
This reverts commit dcfcfaccde .
2024-11-21 12:18:05 +08:00
George Hotz
9df5a62c5e
unify to HWQueue [pr] ( #7812 )
...
* unify to HWCommandQueue [pr]
* all is HWQueue
2024-11-21 10:33:08 +08:00
George Hotz
0a74acd90e
add proper typing to HCQ [pr] ( #7803 )
...
* add proper typing to HCQ [pr]
* more types
* and qcom
* HCQProgram has device type
* typed allocator
2024-11-20 17:20:39 +08:00
George Hotz
6688539bc9
rename device to dev so Buffer can be Allocator [pr] ( #7799 )
...
* rename device to dev to Buffer can be Allocator [pr]
* missed those
* update the Program classes also
* more renames
* oops
2024-11-20 15:47:26 +08:00
George Hotz
d71fe7faa5
rename allocator methods to not conflict [pr] ( #7788 )
...
* rename allocator methods to not conflict [pr]
* forgot those
* transfer + offset
2024-11-20 00:10:29 +08:00
nimlgen
4d81b7952a
qcom match texture/sampler descriptors to OpenCL ( #7622 )
...
* qcom ioctl compare more regs
* bug fix
2024-11-11 21:56:51 +03:00
George Hotz
6bb230287b
pass the src into Metal [pr] ( #7518 )
...
* pass the src into Metal [pr]
* put that comment back
* keep old functionality
* move all to disassembler
* metal supports parallel beam
* touchups
* comment in correct place
2024-11-04 12:35:30 +08:00
nimlgen
4ed2c40d48
qcom a bit cleaner ( #7380 )
2024-10-29 23:50:28 +03:00
George Hotz
de7b9d7c42
improve pre-commit [pr] ( #7256 )
...
* improve pre-commit [pr]
* mypy passes on windows
2024-10-24 15:38:47 +08:00
nimlgen
21acfc39d4
qcom cleanup allocs ( #7200 )
...
* qcom cleanup allocs
* oops
2024-10-21 23:20:15 +03:00
nimlgen
942a17109a
qcom use QCOMBuffer for all allocated buffers ( #7023 )
...
* qcom use QCOMBuffer for all allocated buffers
* checks
2024-10-12 23:44:36 +03:00
nimlgen
f9d454aed5
correct kernargs alignment ( #6984 )
2024-10-11 00:06:28 +03:00
nimlgen
fad575ec76
qcom tiny cleanups ( #6973 )
2024-10-10 12:26:41 +03:00
nimlgen
42609300ff
hcq no timeline signals in init ( #6944 )
2024-10-07 23:36:19 +03:00
George Hotz
6b063450df
move hcq device to runtime [pr] ( #6879 )
...
* things that are only used in one place don't belong in helpers [pr]
* start moving hcq device [pr]
* fix paths
2024-10-04 22:26:50 +08:00
nimlgen
b95f47784a
qcom sleep when sync ( #6785 )
...
* qcom sleep when sync
* linter
* short
2024-09-27 19:14:10 +08:00
nimlgen
3c56aeee70
add Tensor.from_blob ( #6765 )
...
* draft tensor from pointer init
* some docs and types
* comment
* cleaner
* test
* malloc
* qcom cl interop
* jit example
* cleaner
* dealoc
* wording
* docs
2024-09-26 18:33:19 +08:00
nimlgen
e31552e2e0
qcom reinit queue on exec ( #6728 )
...
* qcom setup on exec as gpu=1
* linter
* gpulike
* offsets
2024-09-25 16:08:50 +08:00
nimlgen
e1caa24a92
qcom fix binded queue might be overwritten ( #6712 )
2024-09-25 12:45:23 +08:00
nimlgen
75b7627db7
qcom do not recreate memoryviews on updates ( #6701 )
2024-09-24 15:36:22 +08:00
nimlgen
d3ed50c769
fix typo in 'Too many resources requested for launch' ( #6705 )
2024-09-24 15:33:01 +08:00
nimlgen
94cbb1cd32
qcom image copyout ( #6667 )
...
* qcom copyout
* copyin
* linter
* fix
* linter
* myoy
2024-09-23 16:11:43 +08:00
nimlgen
8a9195d86e
qcom texs refactor ( #6613 )
...
* qcom texs refactor
* fix
* linter
* qcombuf
* linter
2024-09-23 09:03:17 +08:00
nimlgen
21f2d79461
qcom match gpu impl for reg a6xx_sp_cs_unknown_a9b1 ( #6631 )
2024-09-20 18:14:00 +08:00
Comma Device
76aa6416d7
qcom: add disassembler with DEBUG >= 5
2024-09-20 07:04:28 +00:00