nimlgen
7c81ab94a7
not weak
2026-04-28 18:43:23 +00:00
nimlgen
f8626197ce
remove
2026-04-28 17:37:29 +00:00
nimlgen
39993b644d
x
2026-04-28 17:28:38 +00:00
nimlgen
e3d2f7e32e
remove
2026-04-28 17:19:31 +00:00
nimlgen
128408339b
ugh
2026-04-28 16:18:17 +00:00
nimlgen
37ea7deab0
fix
2026-04-28 15:18:33 +00:00
nimlgen
7ce23b9f0d
slighly rename
2026-04-28 13:00:22 +00:00
nimlgen
abab355291
x
2026-04-28 12:47:53 +00:00
nimlgen
d14e1ae351
better
2026-04-28 11:43:01 +00:00
nimlgen
718b36dbd3
local optimize as rewrite
2026-04-28 10:51:35 +00:00
wozeparrot
5e861cd2c4
llama: move llama kernels to llama_kernels ( #15952 )
2026-04-27 22:48:53 -07:00
Christopher Milan
987b6dd193
python -m tinygrad.device prints interface info ( #15950 )
2026-04-27 22:15:38 -04:00
qazal
54f00e1013
sqtt: correct rdna4 structs ( #15948 )
2026-04-28 07:35:50 +09:00
Charlie Kerfoot
890d7be0c3
fix: muon not using device ( #15936 )
2026-04-27 14:56:48 -07:00
qazal
c58fd85a99
sqtt: add needs_rocprof decorator ( #15947 )
...
* sqtt: add needs_rocprof decorator
* version string
2026-04-28 06:22:50 +09:00
Christopher Milan
3f508810d8
cpu: lowercase arch ( #15943 )
2026-04-27 17:05:25 -04:00
chenyu
77f9125c21
move Tensor.pad to OpMixin ( #15946 )
2026-04-27 16:56:04 -04:00
nimlgen
4164666c72
programinfo ( #15942 )
...
* programinfo
* fix
* m
* x
* x
* changes
* x
* fix
* rm
2026-04-27 23:12:03 +03:00
chenyu
fe38d6de94
_pad_circular and _pad_reflect_replicate to mixin ( #15944 )
2026-04-27 16:07:05 -04:00
qazal
8c174bdad4
viz/sqtt: correct exec pipes ( #15885 )
...
* wmma
* p2
* test
* left
* work
* pickle
* handwritten failing tests
* start work
* test the pipes
* empirical evidence
* update rdna4 enum types
* VALU pipe 1
* TRANSCENDENTAL pipe
* transcendental function units
* reorder
* wmma pipe
* cleanup and notes
* smaller
* work
* diff cleanup
* pickle
* use se:1
* int
2026-04-28 05:05:49 +09:00
qazal
eeb8d5eb0c
viz: small ui changes ( #15940 )
...
* rename colors
* keep ctrl c
2026-04-27 04:00:13 +09:00
nimlgen
96165ff0d1
validate_with_cpu as rewrite ( #15938 )
...
* validate_with_cpu as rewrite
* compil
* x
* linter
* moved
* fix
2026-04-26 19:58:53 +03:00
nimlgen
117e9e22dd
estimates from graph ( #15937 )
...
* estimates from graph
* test
* x
2026-04-26 18:22:53 +03:00
chenyu
e9983e3516
remove unused QCOMTextureInfo, QueueType [pr] ( #15935 )
2026-04-25 14:32:31 -04:00
nimlgen
ac3494a7cc
remove some runners ( #15934 )
...
* remove runners
* mypy
2026-04-25 21:27:05 +03:00
nimlgen
bb652352c7
remove execitem ( #15932 )
...
* remove execitem
* f
* x
2026-04-25 19:33:04 +03:00
chenyu
e27444a0ff
remove unused UOp.shard_size [pr] ( #15933 )
2026-04-25 12:27:58 -04:00
nimlgen
e0ff6cc15c
remove old schedule ( #15930 )
...
* remove old schedule
* tests
* r
* x
2026-04-25 16:46:36 +03:00
qazal
9a23de7d27
viz/cli: unify profile and rewrites, -s ALL default ( #15931 )
...
* work
* workg
* better
* cleanup
* better defaults
* --ls
* better
* work
* update llama
* update
2026-04-25 22:31:24 +09:00
nimlgen
768106a542
remove schedule from extra/docs/examples ( #15929 )
...
* remove schedule from extra/docs/examples
* f
2026-04-25 14:09:12 +03:00
nimlgen
a5e9ea7a60
remove schedule batch 4 ( #15927 )
...
* remove schedule batch 4
* fini
2026-04-25 12:36:55 +03:00
nimlgen
d2ab6ea7a6
remove schedule batch 3 ( #15924 )
...
* remove shcedule batch 3
* batch 6
* batch 7
2026-04-25 11:53:16 +03:00
nimlgen
3c8a2db870
remove schedule() from tests batch 2 ( #15923 )
...
* remove schedule() from tests batch 2
* batch 4
2026-04-25 10:44:41 +03:00
Denys Melnyk
1fdcb13bfb
webgpu: fix weight lookup in export_model after compile_net key change ( #15919 )
...
* fix lookup site in export_model_webgpu after refactoring
webgpu (sd): fix export_model weight lookup after compile_net changes
fix lookup site in export_model_webgpu after refactoring
* add regression test
2026-04-25 10:04:55 +03:00
Christopher Milan
8b2826ef16
nv: fix shader local memory for NAK ( #15921 )
2026-04-25 01:03:11 -04:00
Christopher Milan
57fbaa3d49
amd: fallback to llvm when comgr is not available ( #15914 )
2026-04-24 23:30:16 -04:00
wozeparrot
4b908b6e2c
llama: fused ce loss ( #15920 )
2026-04-24 20:01:24 -07:00
nimlgen
d3378010ee
schedule() -> schedule_linear() in tests (batch 1) ( #15915 )
...
* schedule_with_vars -> linear_with_vars in tests
* tests batch 1
* batch 2
* estimate_uop
* simpler
* rm
2026-04-24 23:40:53 +03:00
chenyu
b501ba3e42
nll_loss to mixin ( #15918 )
2026-04-24 15:50:31 -04:00
chenyu
2f9fdb4a37
scatter to mixin ( #15917 )
2026-04-24 15:37:37 -04:00
nimlgen
f2751955cb
remove linear_to_schedule from tests ( #15912 )
...
* remove linear_to_schedule from tests
* x
2026-04-24 20:02:10 +03:00
nimlgen
56a9f1e3ff
remove last jit_cahce ( #15911 )
...
* remove last jit_cahce
* linter
2026-04-24 19:44:52 +03:00
chenyu
03a7604f76
sort argsort topk allclose to mixin ( #15910 )
2026-04-24 10:20:46 -04:00
nimlgen
4010aa4044
jit: no jit_cache in graphrunner ( #15907 )
...
* jit: no jit_cache in graphrunner
* m
2026-04-24 16:34:26 +03:00
chenyu
7a1adfd2aa
update Tensor.allclose to return Tensor ( #15904 )
...
matches jax
2026-04-24 08:27:17 -04:00
Eitan Turok
48d7ab2695
no uv.lock ( #15893 )
2026-04-24 20:07:07 +08:00
qazal
5eb641395a
viz/cli: select kernel events in -s DEV ( #15909 )
...
* simple test
* pass
2026-04-24 21:03:34 +09:00
nimlgen
c0f77c2e1c
hcq graph to linear ( #15888 )
...
* hcq
* f
* f
* linter
2026-04-24 12:42:49 +03:00
Christopher Milan
cbf4946ea6
usb: multiple gpus and better error messages ( #15900 )
2026-04-24 01:57:19 -04:00
wozeparrot
9d134a2848
llama: fix fakedata timing ( #15905 )
2026-04-23 21:37:03 -07:00