Commit Graph

10417 Commits

Author SHA1 Message Date
nimlgen
118bd1cbed hotfix: amd imports (#9620) 2025-03-29 20:19:53 +07:00
uuuvn
5908b89f71 MI300X support (WIP) (#9585) 2025-03-29 19:46:42 +08:00
George Hotz
77f0d09ecf hotfix: HIP supports parallel BEAM search 2025-03-29 11:49:53 +08:00
chenyu
162f286a0e add a few Tensor method to doc (#9614)
* add a few Tensor method to doc

* clone
2025-03-28 13:47:16 -04:00
uuuvn
dd9aae02c3 Refactor ops_amd.py (MI300X prereq) (#9428) 2025-03-29 00:17:20 +07:00
uuuvn
3e1168ff5e am: module import in common (#9615) 2025-03-28 21:29:34 +07:00
nimlgen
a8ff85369e cpugraph for dsp (#9601)
* cpugraph init

* fixes

* no cpu for now

* mypy

* fix
2025-03-28 19:06:31 +07:00
nimlgen
fa0ebbd237 jit: optimize before pickle (#9611)
* jit: optimize before pickle

* optimize weights

* fix

* mypy

* mypy2
2025-03-28 19:06:09 +07:00
George Hotz
392a311312 Revert "add copy button in VIZ code-block (#9605)" (#9610)
This reverts commit d1e8598c81.
2025-03-28 17:05:44 +08:00
Harsh Natuskar
d1e8598c81 add copy button in VIZ code-block (#9605)
* works

* only second block has copy

* better function

* better

* ...

* smol function

* update copy-btn css

* updates
2025-03-28 16:52:21 +08:00
qazal
b4ea45b4a6 fix viz recenter + worker cleanup (#9607) 2025-03-28 15:24:53 +08:00
Andrew Furey
50dee4a7b3 add test for checking const gradients (#9598) 2025-03-27 15:17:37 -04:00
chenyu
5358b0904b update uop_given_valid if a node becomes const (#9604)
* update uop_given_valid if a node becomes const

* cleanup
2025-03-27 14:57:46 -04:00
chenyu
a187dfd3df bert BEAM_UOPS_MAX 3000->4000 (#9603)
more stable for the final step time

green 410ms (master) -> 397ms (BEAM=4) -> 392ms (this)
red 561ms (master) -> 550ms (this)
2025-03-27 11:58:47 -04:00
qazal
088a677e25 rescale to fit viz graph [pr] (#9599)
* zoom to fit the graph in viz [pr]

* always on screen fit graph

* space key recenters
2025-03-27 23:33:51 +08:00
nimlgen
3737821b9e prepare for clang graph (#9600)
* prepare for clang graph

* emu

* ops

* ops2

* better type

* fix
2025-03-27 20:09:37 +07:00
qazal
bf94924d5a fix viz with nested graph_rewrite (#9595) 2025-03-27 13:14:28 +08:00
qazal
c011751b41 statically define viz arrow heads (#9594) 2025-03-27 12:22:04 +08:00
qazal
0877497bad hotfix: use captured uops in viz render [pr] (#9593)
* hotfix: use captured uops in viz render [pr]

* better error
2025-03-27 11:52:12 +08:00
qazal
e5ff7b23d7 refactor to @track_matches + add failing test_nested_rewrite (#9592)
* test_nested_rewrite

* refactor to track_matches

* positional arg
2025-03-27 11:11:56 +08:00
chenyu
62888614f6 lower bert eval bs to 24 (#9590)
oom during eval
2025-03-26 21:25:23 -04:00
nimlgen
dc9da1d917 memplan into one buffer (#9526)
* new memplanner

* new should works

* fix

* VALIDATE_MEMORY_PLANNER

* hm?

* ugh

* fix alignment

* fix2

* rm

* tiny fixes

* test

* comments and fixes

* fix2

* liiiinetr

* t

* fix
2025-03-27 01:46:50 +07:00
qazal
8b717c345c cache viz worker at launch (#9589) 2025-03-27 01:10:02 +08:00
George Hotz
d62ced8981 symbolic -> symbolic_flat (#9588) 2025-03-26 23:34:43 +08:00
George Hotz
8aaa5e1ec5 generate the individual indexes (#9587) 2025-03-26 22:32:06 +08:00
George Hotz
5c6cd884e3 multiple simplifies is faster [pr] (#9586)
* multiple simplifies is faster [pr]

* cleanup

* cleanup
2025-03-26 21:42:52 +08:00
George Hotz
1e6e75e39a little changes from dsp branch (#9582)
* little changes from dsp branch

* not that one

* need the where

* Revert "need the where"

This reverts commit 140f89c878.
2025-03-26 20:01:21 +08:00
nimlgen
e88a640ca5 fix _access_resources for offset buffers (#9580)
* fix _access_resources for offset buffers

* test
2025-03-26 18:42:43 +07:00
Andrey
7b865ed03d use tuple in isinstance for type checking (#9583) 2025-03-26 19:36:48 +08:00
George Hotz
9115ce8860 linearizer fixups from DSP branch (#9581) 2025-03-26 18:28:15 +08:00
qazal
e799df537e prep viz UI cleanup for grid scales (#9579)
* less ways to make a button

* move collapse out

* work

* do not create extra resizers

* better

* ul

* safari
2025-03-26 17:48:15 +08:00
nimlgen
ccbcdca473 add memplanner tests (#9577) 2025-03-26 10:59:39 +07:00
qazal
c03dadfcb9 add TORCHVIZ=1 to beautiful_mnist_torch (#9576) 2025-03-26 11:17:08 +08:00
qazal
93bcb974c5 select torch device in examples/beautiful_mnist_torch.py (#9575) 2025-03-26 11:01:25 +08:00
uuuvn
2c32126fc8 am: AMRegister refactor (#9572) 2025-03-26 00:52:40 +07:00
chenyu
cddd750d68 add a failed test case for jit/nojit rand [pr] (#9574)
currently adding jit produced different rand values
2025-03-25 13:32:44 -04:00
nimlgen
4cf2b68ca8 am_smi: fix init for newer versions (#9559) 2025-03-25 23:48:05 +07:00
qazal
a6a5c0aec5 add NULL=1 backend (#9573)
* add NULL=1 backend

* NullAllocator

* line

* metadata should still work

* it shouldn't have memory usage

* Revert "it shouldn't have memory usage"

This reverts commit a9080fdd43.

* back

* null flops
2025-03-25 22:20:52 +08:00
qazal
b60d9976b4 better yaxis formatting in viz memory graph (#9570)
* better bytes format

* pluralize

* 1 less line
2025-03-25 16:50:22 +08:00
qazal
faf3b5b245 display kernel metadata in memory viz (#9569)
* display kernel metadata in memory viz

* fix that
2025-03-25 13:14:54 +08:00
qazal
52301fe68e move Buffer refcount increment out of schedule.py (#9564)
* move Buffer refcount increment out of schedule.py

* add TestGC.test_assign_refcount

* refcount refers to Ops.BUFFER UOps
2025-03-25 12:08:27 +08:00
qazal
262f5a2bd3 hotfix: replace link in viz/readme (#9568) 2025-03-25 10:24:49 +08:00
chenyu
6427272bf6 minor update to rand [pr] (#9566) 2025-03-24 18:49:50 -04:00
chenyu
b0e070e737 remove MOCKGPU workaround in rand (#9565)
also `requires_grad_` to save a line
2025-03-24 17:49:45 -04:00
qazal
d7c754ce49 failing test for UOp buffer ref count (#9563)
* failing test for UOp buffer ref count

* lint
2025-03-25 00:10:48 +08:00
b1tg
f90001e1a6 amd llvm render (no_comgr prereq) (#9543)
* amd llvm render

* skip test_div_rounding_mode

---------

Co-authored-by: b1tg <b1tg@users.noreply.github.com>
2025-03-24 22:50:51 +08:00
Priyank Patel
4f5e03bd60 better fix inplace detach (#9557) 2025-03-24 22:50:28 +08:00
qazal
1c40873962 show buffer info in memory viz (#9562) 2025-03-24 22:12:30 +08:00
qazal
efaee75656 start viz of memory usage (#9561)
* start viz of memory usage

* polygons/bars + use d3
2025-03-24 19:05:35 +08:00
qazal
1cfe6d02fe refactor uop_to_json to return a dict [pr] (#9560) 2025-03-24 16:38:17 +08:00