nimlgen
118bd1cbed
hotfix: amd imports ( #9620 )
2025-03-29 20:19:53 +07:00
uuuvn
5908b89f71
MI300X support (WIP) ( #9585 )
2025-03-29 19:46:42 +08:00
George Hotz
77f0d09ecf
hotfix: HIP supports parallel BEAM search
2025-03-29 11:49:53 +08:00
chenyu
162f286a0e
add a few Tensor method to doc ( #9614 )
...
* add a few Tensor method to doc
* clone
2025-03-28 13:47:16 -04:00
uuuvn
dd9aae02c3
Refactor ops_amd.py (MI300X prereq) ( #9428 )
2025-03-29 00:17:20 +07:00
uuuvn
3e1168ff5e
am: module import in common ( #9615 )
2025-03-28 21:29:34 +07:00
nimlgen
a8ff85369e
cpugraph for dsp ( #9601 )
...
* cpugraph init
* fixes
* no cpu for now
* mypy
* fix
2025-03-28 19:06:31 +07:00
nimlgen
fa0ebbd237
jit: optimize before pickle ( #9611 )
...
* jit: optimize before pickle
* optimize weights
* fix
* mypy
* mypy2
2025-03-28 19:06:09 +07:00
George Hotz
392a311312
Revert "add copy button in VIZ code-block ( #9605 )" ( #9610 )
...
This reverts commit d1e8598c81 .
2025-03-28 17:05:44 +08:00
Harsh Natuskar
d1e8598c81
add copy button in VIZ code-block ( #9605 )
...
* works
* only second block has copy
* better function
* better
* ...
* smol function
* update copy-btn css
* updates
2025-03-28 16:52:21 +08:00
qazal
b4ea45b4a6
fix viz recenter + worker cleanup ( #9607 )
2025-03-28 15:24:53 +08:00
Andrew Furey
50dee4a7b3
add test for checking const gradients ( #9598 )
2025-03-27 15:17:37 -04:00
chenyu
5358b0904b
update uop_given_valid if a node becomes const ( #9604 )
...
* update uop_given_valid if a node becomes const
* cleanup
2025-03-27 14:57:46 -04:00
chenyu
a187dfd3df
bert BEAM_UOPS_MAX 3000->4000 ( #9603 )
...
more stable for the final step time
green 410ms (master) -> 397ms (BEAM=4) -> 392ms (this)
red 561ms (master) -> 550ms (this)
2025-03-27 11:58:47 -04:00
qazal
088a677e25
rescale to fit viz graph [pr] ( #9599 )
...
* zoom to fit the graph in viz [pr]
* always on screen fit graph
* space key recenters
2025-03-27 23:33:51 +08:00
nimlgen
3737821b9e
prepare for clang graph ( #9600 )
...
* prepare for clang graph
* emu
* ops
* ops2
* better type
* fix
2025-03-27 20:09:37 +07:00
qazal
bf94924d5a
fix viz with nested graph_rewrite ( #9595 )
2025-03-27 13:14:28 +08:00
qazal
c011751b41
statically define viz arrow heads ( #9594 )
2025-03-27 12:22:04 +08:00
qazal
0877497bad
hotfix: use captured uops in viz render [pr] ( #9593 )
...
* hotfix: use captured uops in viz render [pr]
* better error
2025-03-27 11:52:12 +08:00
qazal
e5ff7b23d7
refactor to @track_matches + add failing test_nested_rewrite ( #9592 )
...
* test_nested_rewrite
* refactor to track_matches
* positional arg
2025-03-27 11:11:56 +08:00
chenyu
62888614f6
lower bert eval bs to 24 ( #9590 )
...
oom during eval
2025-03-26 21:25:23 -04:00
nimlgen
dc9da1d917
memplan into one buffer ( #9526 )
...
* new memplanner
* new should works
* fix
* VALIDATE_MEMORY_PLANNER
* hm?
* ugh
* fix alignment
* fix2
* rm
* tiny fixes
* test
* comments and fixes
* fix2
* liiiinetr
* t
* fix
2025-03-27 01:46:50 +07:00
qazal
8b717c345c
cache viz worker at launch ( #9589 )
2025-03-27 01:10:02 +08:00
George Hotz
d62ced8981
symbolic -> symbolic_flat ( #9588 )
2025-03-26 23:34:43 +08:00
George Hotz
8aaa5e1ec5
generate the individual indexes ( #9587 )
2025-03-26 22:32:06 +08:00
George Hotz
5c6cd884e3
multiple simplifies is faster [pr] ( #9586 )
...
* multiple simplifies is faster [pr]
* cleanup
* cleanup
2025-03-26 21:42:52 +08:00
George Hotz
1e6e75e39a
little changes from dsp branch ( #9582 )
...
* little changes from dsp branch
* not that one
* need the where
* Revert "need the where"
This reverts commit 140f89c878 .
2025-03-26 20:01:21 +08:00
nimlgen
e88a640ca5
fix _access_resources for offset buffers ( #9580 )
...
* fix _access_resources for offset buffers
* test
2025-03-26 18:42:43 +07:00
Andrey
7b865ed03d
use tuple in isinstance for type checking ( #9583 )
2025-03-26 19:36:48 +08:00
George Hotz
9115ce8860
linearizer fixups from DSP branch ( #9581 )
2025-03-26 18:28:15 +08:00
qazal
e799df537e
prep viz UI cleanup for grid scales ( #9579 )
...
* less ways to make a button
* move collapse out
* work
* do not create extra resizers
* better
* ul
* safari
2025-03-26 17:48:15 +08:00
nimlgen
ccbcdca473
add memplanner tests ( #9577 )
2025-03-26 10:59:39 +07:00
qazal
c03dadfcb9
add TORCHVIZ=1 to beautiful_mnist_torch ( #9576 )
2025-03-26 11:17:08 +08:00
qazal
93bcb974c5
select torch device in examples/beautiful_mnist_torch.py ( #9575 )
2025-03-26 11:01:25 +08:00
uuuvn
2c32126fc8
am: AMRegister refactor ( #9572 )
2025-03-26 00:52:40 +07:00
chenyu
cddd750d68
add a failed test case for jit/nojit rand [pr] ( #9574 )
...
currently adding jit produced different rand values
2025-03-25 13:32:44 -04:00
nimlgen
4cf2b68ca8
am_smi: fix init for newer versions ( #9559 )
2025-03-25 23:48:05 +07:00
qazal
a6a5c0aec5
add NULL=1 backend ( #9573 )
...
* add NULL=1 backend
* NullAllocator
* line
* metadata should still work
* it shouldn't have memory usage
* Revert "it shouldn't have memory usage"
This reverts commit a9080fdd43 .
* back
* null flops
2025-03-25 22:20:52 +08:00
qazal
b60d9976b4
better yaxis formatting in viz memory graph ( #9570 )
...
* better bytes format
* pluralize
* 1 less line
2025-03-25 16:50:22 +08:00
qazal
faf3b5b245
display kernel metadata in memory viz ( #9569 )
...
* display kernel metadata in memory viz
* fix that
2025-03-25 13:14:54 +08:00
qazal
52301fe68e
move Buffer refcount increment out of schedule.py ( #9564 )
...
* move Buffer refcount increment out of schedule.py
* add TestGC.test_assign_refcount
* refcount refers to Ops.BUFFER UOps
2025-03-25 12:08:27 +08:00
qazal
262f5a2bd3
hotfix: replace link in viz/readme ( #9568 )
2025-03-25 10:24:49 +08:00
chenyu
6427272bf6
minor update to rand [pr] ( #9566 )
2025-03-24 18:49:50 -04:00
chenyu
b0e070e737
remove MOCKGPU workaround in rand ( #9565 )
...
also `requires_grad_` to save a line
2025-03-24 17:49:45 -04:00
qazal
d7c754ce49
failing test for UOp buffer ref count ( #9563 )
...
* failing test for UOp buffer ref count
* lint
2025-03-25 00:10:48 +08:00
b1tg
f90001e1a6
amd llvm render (no_comgr prereq) ( #9543 )
...
* amd llvm render
* skip test_div_rounding_mode
---------
Co-authored-by: b1tg <b1tg@users.noreply.github.com >
2025-03-24 22:50:51 +08:00
Priyank Patel
4f5e03bd60
better fix inplace detach ( #9557 )
2025-03-24 22:50:28 +08:00
qazal
1c40873962
show buffer info in memory viz ( #9562 )
2025-03-24 22:12:30 +08:00
qazal
efaee75656
start viz of memory usage ( #9561 )
...
* start viz of memory usage
* polygons/bars + use d3
2025-03-24 19:05:35 +08:00
qazal
1cfe6d02fe
refactor uop_to_json to return a dict [pr] ( #9560 )
2025-03-24 16:38:17 +08:00