George Hotz
32e9949052
rename lazydata to uop ( #10698 )
2025-06-08 08:42:22 -07:00
George Hotz
411392dfb7
move files into uop dir ( #10399 )
...
* move files into uop dir [pr]
* tinygrad.uop is a thing
* fix uop docs, no pr
* fix viz
2025-05-18 11:38:28 -07:00
Elnur Rakhmatullin
de2b323d97
Fixed a typo in "simplify" ( #10358 )
2025-05-16 14:45:14 -07:00
qazal
0bee225a58
Tensor.kernelize docs ( #9946 )
...
* Tensor.kernelize docs
* syntax
* test_kernelize_bw
* Tensor.kernelize docstring
* pruning
* tiny details
* details 2
* becomes_map terminology
* more changes to becomes
2025-04-21 16:34:03 +08:00
nimlgen
23b67f532c
amd: minor comments and readme updates ( #9865 )
2025-04-12 23:24:05 +03:00
uuuvn
c631c72f22
HCQ: Increment timeline signal before submitting ( #9550 )
...
`AMDComputeQueue.__del__` frees `hw_page` which is safe because
`AMDAllocator._free` does `self.dev.synchronize()` which is supposed
to wait for execution of IB to finish, however that doesn't happen if
AMDComputeQueue is dropped right after submit before timeline signal is
incremented, which it is in most places leading to a race if .bind() is
also used (required for multi-xcc because bug in mec fw treats all
PACKET3_PRED_EXECs outside IBs as if they had EXEC_COUNT of zero).
2025-03-23 18:30:38 +07:00
chenyu
2e7c2780a9
CLANG -> CPU ( #9189 )
2025-02-20 18:03:09 -05:00
George Hotz
a3c78d47b3
speed docs + upgrades [pr] ( #8964 )
...
* add some docs about speed [pr]
* better torch gemm
* enable locals on llvm/clang
* disable locals for beam speed on LLVM/CLANG
* 0x20 alignment in llvm allows ymm use
2025-02-08 17:28:52 +08:00
George Hotz
a6e496b195
remove Function class [pr] ( #8753 )
...
* remove Function class [pr]
* actually remove function
* fix docs
2025-01-26 18:58:02 +09:00
nimlgen
6733a3a96b
am: fix typo ( #8700 )
2025-01-21 14:35:15 +03:00
nimlgen
b3efeeb717
docs: start am docs ( #8638 )
...
* docs: init am docs
* missing
2025-01-16 00:22:35 +03:00
uuuvn
5ffc50d58c
Clang JIT ( #8481 )
...
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2025-01-03 11:12:55 -05:00
George Hotz
803a47494e
Revert "Clang JIT ( #8312 )" ( #8452 )
...
This reverts commit b6266c8e41 .
2024-12-30 17:49:35 -05:00
uuuvn
b6266c8e41
Clang JIT ( #8312 )
...
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2024-12-30 17:37:53 -05:00
George Hotz
8a04a3a77a
rename LazyBuffer -> UOp [pr] ( #8169 )
...
* rename LazyBuffer -> UOp [pr]
* fix docs
2024-12-11 16:15:52 -08:00
nimlgen
10f431b96d
hcq replace update with sint ( #7899 )
...
* try sym hcq
* start with amd
* move to nv
* nv works
* cache and qcom
* fixes
* signals
* fix nv
* qcom fixes
* linter
* linter
* cache + typings
* fixes
* tiny fixes
* linter
* linter
* lntr
* ugh
* comments
2024-11-29 20:08:13 +03:00
George Hotz
9df5a62c5e
unify to HWQueue [pr] ( #7812 )
...
* unify to HWCommandQueue [pr]
* all is HWQueue
2024-11-21 10:33:08 +08:00
George Hotz
c8bf09b7d4
s/UOps/Ops ( #7500 )
...
* s/UOps/Ops [pr]
* fix
2024-11-03 11:26:10 +08:00
George Hotz
4df5c7a4ef
move lazy to engine [pr] ( #6886 )
...
* move lazy to engine [pr]
* engine.lazy
2024-10-04 23:19:26 +08:00
George Hotz
6b063450df
move hcq device to runtime [pr] ( #6879 )
...
* things that are only used in one place don't belong in helpers [pr]
* start moving hcq device [pr]
* fix paths
2024-10-04 22:26:50 +08:00
nimlgen
9b616cb33e
HCQArgsState lifetime docs ( #6323 )
2024-08-30 00:31:49 +03:00
qazal
8c50ef8b7c
start uop docs ( #6291 )
...
* start uop docs
* only need show_labels
* sink comes first
* hotfix: invalid
* touchups
* 2 space indent works
* limit some buffer uops
* better BARRIER doc, Op -> UOp when it makes sense.
* make KernelInfo optional
* more work
relative links don't work
* this can be local in multi reduce+pads
* add UOps.SHAPETRACKER details
* UOps.CONST both types
* nit: local buffer isn't device Buffer, habit
* nit2: dtype -> DType
2024-08-29 15:22:39 +03:00
nimlgen
fa84e6ec48
init hcq args state ( #6046 )
...
* init hcq args state
* cleaner
* amd
* fillargs
* fixes
* myoy
* docs
* fix
* not needed
* spacing
2024-08-13 17:11:58 +03:00
chenyu
d82370f6ef
docs: fix broken links and update is_floating_point ( #6023 )
...
* docs: fix broken links and update is_floating_point
broken links would only show as INFO and not an error.
* make doc andhors warn
2024-08-10 15:58:48 -04:00
nimlgen
71e1472290
hcq more types ( #5791 )
...
* mhcq more types
* linter
* pylint
* docs: bind
2024-07-29 18:03:23 +03:00
nimlgen
fff19b961b
docs: user runtime docs ( #5756 )
2024-07-27 23:21:54 +03:00