George Hotz
|
36918a7d7a
|
cleanups
|
2025-10-27 18:29:17 +08:00 |
|
George Hotz
|
c5a420ed7e
|
optional forced_reshape
|
2025-10-27 18:22:11 +08:00 |
|
George Hotz
|
f55056af48
|
viz always
|
2025-10-27 18:19:13 +08:00 |
|
George Hotz
|
0b81d9b0d6
|
pyrender
|
2025-10-27 18:04:29 +08:00 |
|
George Hotz
|
ab7d8c24be
|
Merge branch 'master' into real_pyrender
|
2025-10-27 17:39:37 +08:00 |
|
George Hotz
|
533f81bfc8
|
kernelize
|
2025-10-27 17:38:30 +08:00 |
|
George Hotz
|
701a632907
|
move VECTORIZE/CONST (#12942)
|
2025-10-27 17:37:13 +08:00 |
|
nimlgen
|
95748a4518
|
nv: map vram after resets (#12938)
|
2025-10-27 17:17:07 +08:00 |
|
George Hotz
|
8084aa06b2
|
Merge branch 'master' into real_pyrender
|
2025-10-27 17:08:07 +08:00 |
|
George Hotz
|
8fb545c475
|
don't late simplify on marg (#12941)
|
2025-10-27 17:07:41 +08:00 |
|
George Hotz
|
1a3498e6dc
|
v const
|
2025-10-27 17:07:32 +08:00 |
|
George Hotz
|
8f9b09b4f6
|
.sintify
|
2025-10-27 17:00:02 +08:00 |
|
George Hotz
|
8743a21d3f
|
Merge branch 'master' into real_pyrender
|
2025-10-27 16:57:19 +08:00 |
|
George Hotz
|
7139e036c5
|
bugfixes from pyrender (#12940)
|
2025-10-27 16:56:53 +08:00 |
|
George Hotz
|
0b85d31a68
|
Merge branch 'master' into real_pyrender
|
2025-10-27 16:53:54 +08:00 |
|
George Hotz
|
804133cffd
|
rename RECIP to RECIPROCAL (#12939)
|
2025-10-27 16:53:13 +08:00 |
|
George Hotz
|
4c63cf3914
|
work
|
2025-10-27 16:40:50 +08:00 |
|
nimlgen
|
f4da94af28
|
system: reset is a method of pcidevice (#12936)
|
2025-10-27 16:21:10 +08:00 |
|
George Hotz
|
97a4296d8b
|
work
|
2025-10-27 16:04:19 +08:00 |
|
George Hotz
|
b987b8b22a
|
work
|
2025-10-27 15:29:26 +08:00 |
|
George Hotz
|
af3211f73c
|
test pyrender
|
2025-10-27 14:47:35 +08:00 |
|
wozeparrot
|
6b54378eba
|
working kitten matmul (#12935)
|
2025-10-26 23:40:49 -07:00 |
|
George Hotz
|
dc6d667941
|
pyrender always works with SPEC=3
|
2025-10-27 14:18:02 +08:00 |
|
qazal
|
189582db5e
|
viz graph drawing cleanups (#12933)
* viz: make node label dims optional
* inplace edge updates
* change that
|
2025-10-27 13:59:32 +08:00 |
|
qazal
|
70ba84eb04
|
viz: generic node label centering (#12925)
* viz: correct node label centering
* matches
* overlay
* the other way
|
2025-10-27 12:02:34 +08:00 |
|
Sieds Lykles
|
eaeaea2f9c
|
pyrender Ops.SPECIAL and use correct dtype for Ops.RANGE rendering (#12931)
|
2025-10-27 03:21:34 +01:00 |
|
nimlgen
|
8c1368cab6
|
system: class PCIBarInfo (#12930)
* system: class PCIBarInfo
* fix
|
2025-10-27 03:57:42 +08:00 |
|
nimlgen
|
f00009c731
|
hcq: drivers take pcidev (#12929)
* hcq: drivers take pcidev
* fix nv
|
2025-10-26 20:43:51 +08:00 |
|
ttomsa
|
99a519f068
|
linearizer cleanup (#12923)
* cleanup
* comments
* also this
|
2025-10-26 18:30:12 +08:00 |
|
George Hotz
|
c0c24d3a70
|
cleanup wmma (#12927)
* cleanup wmma
* fix test_ops failures on android
|
2025-10-26 18:26:47 +08:00 |
|
George Hotz
|
0a32ab0006
|
nitpicks from typecheckers (#12926)
* nitpicks from the typechecker
* more
|
2025-10-26 17:52:55 +08:00 |
|
George Hotz
|
db5c918215
|
source extra/cl_android.sh to fix opencl on android
|
2025-10-26 15:27:51 +08:00 |
|
qazal
|
c94e597b3e
|
viz ui selector cleanups (#12924)
|
2025-10-26 14:40:47 +08:00 |
|
chenyu
|
94701d4838
|
clean up divide_exact order [pr] (#12919)
do the const first since ADD can also call into that
|
2025-10-25 18:47:57 -04:00 |
|
chenyu
|
e18922f111
|
limit AND const min max to ints [pr] (#12918)
|
2025-10-25 16:07:52 -04:00 |
|
nimlgen
|
92324172be
|
amd: refactor usb into usbdevice (#12916)
* amd: refactor usb into usbdevice
* nu
* my bad
* ops
* my bad
|
2025-10-26 01:00:19 +08:00 |
|
qazal
|
3b192f5eac
|
split viz graph rendering from dag layout (#12914)
|
2025-10-25 15:36:44 +08:00 |
|
George Hotz
|
6415e3e8a7
|
use Ops.GROUP instead of Ops.NOOP for merging stores (#12912)
* use Ops.GROUP instead of Ops.NOOP for merging stores
* fs noop
|
2025-10-25 12:26:12 +08:00 |
|
George Hotz
|
b4f6a2c7a3
|
add kernel spec (#12911)
* add kernel spec
* fix kernel spec
|
2025-10-25 11:49:20 +08:00 |
|
George Hotz
|
8a941d95a4
|
SPEC=2 is full spec, SPEC=1 is default (#12910)
* SPEC=1 passes all tests
* just use SPEC, not __debug__
|
2025-10-25 11:10:43 +08:00 |
|
wozeparrot
|
456560c1ff
|
stateless tinyfs copyin (#12908)
|
2025-10-24 19:18:38 -07:00 |
|
wozeparrot
|
a5b0f57067
|
clean: cleanup tinyfs copyout (#12907)
|
2025-10-24 18:32:55 -07:00 |
|
chenyu
|
4b7329001d
|
clean up test_avg_pool3d (#12905)
|
2025-10-24 14:31:36 -04:00 |
|
George Hotz
|
6b35467f53
|
stores don't end ranges (#12902)
* early endrange
* bugfixes
|
2025-10-24 23:05:03 +08:00 |
|
nimlgen
|
5b5ba31a86
|
amd: make sqtt bufs uc (#12898)
|
2025-10-24 18:55:14 +08:00 |
|
Sieds Lykles
|
e1f8c82938
|
Onnx Layer/Group/RMS/Batch-Norm ReduceL2 fp32 intermediates for fp16 (#12109)
* match onnx spec
* use least_upper_dtype
* promote the square
* just cast before the square
|
2025-10-24 12:26:11 +02:00 |
|
George Hotz
|
0bde87d8d7
|
cleanups from flash attention branch (#12897)
|
2025-10-24 14:14:56 +08:00 |
|
wozeparrot
|
9dac505565
|
variable bs keccak (#10731)
|
2025-10-23 14:10:21 -07:00 |
|
chenyu
|
154b4f9f40
|
test FUSE_OPTIM=1 test/test_optim.py (#12895)
|
2025-10-23 15:54:27 -04:00 |
|
chenyu
|
6e4ee8deea
|
small heuristic cleanup [pr] (#12892)
|
2025-10-23 10:50:15 -04:00 |
|