chenyu
db6b3e1edc
fix mixed setitem with both basic and tensor indexing ( #15050 )
2026-02-27 15:35:48 -05:00
chenyu
c9f6d8751b
don't remove_bufferize for Invalid ( #15053 )
...
* don't remove_bufferize for Invalid
* replaced
2026-02-27 15:16:09 -05:00
qazal
b8a55d5f68
sqtt: new packet types, add discovery script ( #14960 )
2026-02-28 04:27:27 +09:00
nimlgen
4e12fc3fe6
am: mi3xx recovery ( #15051 )
2026-02-27 22:10:47 +03:00
chenyu
81a35cef38
rearrange Tensor.getitem code ( #15049 )
...
no-op change to prepare setitem fix
2026-02-27 12:57:16 -05:00
chenyu
1406d49eef
failed test cases for advanced setitem ( #15048 )
2026-02-27 10:50:18 -05:00
qazal
ef1017f7ed
viz: skip drawing offscreen tracks in profiler ( #15047 )
2026-02-27 22:19:08 +09:00
qazal
ad99b77f6d
assembly/amd: add gfx12_asm_vflat llvm tests, disasm fixes ( #15046 )
...
* add gfx12_asm_vflat.s
* work
2026-02-27 20:20:31 +09:00
George Hotz
010d2790ce
fix multi minimal ( #15044 )
2026-02-27 14:31:58 +08:00
George Hotz
3e1e12528c
hotfix: disable tinyfs load test
2026-02-27 12:04:41 +08:00
George Hotz
d23b79530e
remove disk from GGUF GEMV test ( #15041 )
...
* remove disk from GGUF GEMV test
* keep copy
2026-02-27 12:03:00 +08:00
chenyu
d345f7f5dc
remove _pending_assigns ( #15040 )
2026-02-26 22:38:10 -05:00
George Hotz
37e31e7da4
gguf gemv test ( #15039 )
...
* add gemv tests
* gguf big
* skip
* make realize optional
2026-02-27 10:54:43 +08:00
Nick
af94bfc401
fix retinanet shared memory race condition in parallel tests ( #15030 )
...
Append PID to shared memory names in batch_load_retinanet to prevent
FileExistsError when pytest-xdist runs multiple test workers that each
call _setup_shared_mem with the same hardcoded name.
2026-02-27 08:36:24 +08:00
George Hotz
2bbf8bbefa
improve call/param rendering ( #15023 )
2026-02-27 08:35:04 +08:00
chenyu
0f94a4bb73
failed test case for early fixup const copy ( #15038 )
...
* failed test case for early fixup const copy
wrong with PAD
* test no copy
2026-02-26 19:09:33 -05:00
chenyu
3a4db53b43
raise RuntimeError in schedule for conflicted var_val [pr] ( #15031 )
2026-02-26 15:16:01 -05:00
qazal
d65db32395
viz: only compute aggregate memory graph, defer n² per buffer graph ( #15029 )
2026-02-27 04:14:51 +09:00
qazal
c61fe57cfd
viz: fix n² tiny device linking in profiler ( #15028 )
2026-02-27 02:25:39 +09:00
qazal
88d650d606
viz: clean up call node detection check ( #15025 )
2026-02-26 19:57:56 +09:00
qazal
1c09890f66
sqtt: map instructions in the command line tool ( #15024 )
2026-02-26 12:34:24 +02:00
George Hotz
fe3ee8c27e
fix symbolic shapes in calls ( #15021 )
...
* fix symbolic shapes in calls
* fix after in the big graph
* real tests
2026-02-26 17:17:18 +08:00
qazal
12d179f5f4
viz: brighter call.src[0] edge color ( #15022 )
...
* work
* 2
* better color
2026-02-26 16:07:22 +09:00
George Hotz
2655655a0c
call gradient creates a call ( #15020 )
...
* function creates a full subgraph
* tests
* fix var
* fix tests
* implict assign/contig
* move kv init
2026-02-26 14:15:29 +08:00
Christopher Milan
94acd85285
fix typo in nn/__init__.py ( #15019 )
2026-02-25 20:01:32 -05:00
Christopher Milan
e5c0db66d1
num_batches_tracked does not need is_dtype_supported ( #15018 )
2026-02-25 19:50:57 -05:00
George Hotz
3244131f59
update dagre with more recursion fixes ( #15012 )
2026-02-26 08:35:05 +08:00
chenyu
ed9d475a12
assign tests with test_function ( #15015 )
2026-02-25 16:15:59 -05:00
nimlgen
faa66e0a61
mi350 hive_reset am repro ( #15014 )
2026-02-25 21:30:18 +03:00
nimlgen
8983830aa8
am: code style consistency ( #15013 )
2026-02-25 21:30:10 +03:00
George Hotz
0d35b67f2c
revert realize to only be buffers ( #15008 )
...
* revert realize to only be buffers
* fix that
* broken attention
* Revert "broken attention"
This reverts commit a23c3cd96c .
* and that
2026-02-25 22:43:06 +08:00
qazal
35f85c393f
viz: keep recursively nested call collapsed ( #15010 )
2026-02-25 22:45:18 +09:00
qazal
421b1d4a56
viz: monospace font for tags, no dy overrides ( #15009 )
...
* viz: monospace font for tags, no dy overrides
* str
2026-02-25 22:15:31 +09:00
qazal
448e997be4
gemm/asm: cleanup custom function args ( #15007 )
2026-02-25 22:05:56 +09:00
qazal
c58e91942c
viz: support collapsing individual CALL nodes ( #15006 )
...
* all
* contracted all by default
* simple call mask
* work
* minus not hyphen
* color / cleanup
* detail
2026-02-25 21:27:25 +09:00
George Hotz
68831cd852
add more tests to test_function ( #15003 )
...
* add more tests to test_function
* add function to llm
* function decorator on llm
* works
* symbolic fixups
* minimum change
* implicit inputs
* don't actually update llama yet
2026-02-25 18:42:06 +08:00
wozeparrot
d941dd5aeb
llama3: pad vocab when mp sharding ( #14998 )
2026-02-25 00:04:06 -08:00
wozeparrot
e1c9985715
llama3: better time keeping ( #14999 )
2026-02-24 22:42:05 -08:00
Christopher Milan
4a2fc7ecbb
autogen: cache downloads ( #14997 )
2026-02-25 01:34:27 -05:00
George Hotz
e3fa9896b7
start function and add walk rewrite ( #14992 )
...
* start function and add walk rewrite
* work
* add function on feed_forward
* llm progress
* stuff
* none of that
2026-02-25 13:56:27 +08:00
chenyu
fde7a40bb0
allow dtype mismatched assign on disk ( #14993 )
...
reverted #14473 , that was a bad idea. also added a test that safe_save only has copy
2026-02-24 20:49:55 -05:00
chenyu
46d9a9a74f
minor indexing cleanups [pr] ( #14991 )
2026-02-24 16:49:35 -05:00
chenyu
8dae9be573
move realize_map fixup into realize_assign_src [pr] ( #14990 )
2026-02-24 15:51:40 -05:00
chenyu
9d9151a21e
remove const normalization in indexing [pr] ( #14989 )
...
rangeify can create const with device, and all is normalized in to_define_global
2026-02-24 15:09:11 -05:00
chenyu
f68a472244
end range for COPY/BUFFER_VIEW [pr] ( #14987 )
2026-02-24 13:33:35 -05:00
chenyu
e5d27a3773
remove BUFFER_VIEW from ended_ranges special case [pr] ( #14986 )
...
* remove BUFFER_VIEW from ended_ranges special case [pr]
* will fix later
2026-02-24 10:37:29 -05:00
chenyu
5fd4fc0c6d
fix tinyfs ( #14974 )
...
* fix tinyfs
* fix that
2026-02-24 08:50:53 -05:00
George Hotz
8a6dffc87e
Tensor.callify will be the JIT ( #14983 )
...
* close
* simple callify, support linear in the scheduler
* all tests pass
* everyone is happy
* dumb test
* Remove unnecessary blank line in rangeify.py
2026-02-24 18:42:24 +08:00
nimlgen
6f1cb6be86
am: tiny err handling cleanups ( #14981 )
...
* am: tiny err handling cleanups
* x
* x
2026-02-24 12:43:45 +03:00
George Hotz
b643fca51e
clean up complete_create_schedule_with_vars ( #14980 )
...
* clean up complete_create_schedule_with_vars
* transform_to_call
* update viz tests
2026-02-24 16:12:36 +08:00