George Hotz
dbf8e641ac
binop also
2025-10-11 16:04:05 +08:00
George Hotz
ac53c8866e
improve typing
2025-10-11 15:57:37 +08:00
George Hotz
c260482d84
no need for Self yet
2025-10-11 15:53:47 +08:00
George Hotz
26d54530d6
improve typing and bump to 3.11
2025-10-11 15:46:39 +08:00
Sieds Lykles
4300ebc455
cache apply_movement_op ( #12609 )
...
* cache apply_movement_op
* pyling and clear cache
* fix types
* ignore
* cleanup
2025-10-11 08:53:10 +02:00
George Hotz
7596c1b8f5
TestOuterworldReduce works ( #12608 )
2025-10-10 20:06:41 +08:00
chenyu
001b3710d3
enable some test_ops tests ( #12607 )
2025-10-10 07:23:21 -04:00
qazal
a62dc9ceb5
viz: light up buffer path ( #12603 )
2025-10-10 14:07:30 +03:00
qazal
464c56862f
viz: update ansi regex ( #12605 )
...
* viz: update ansi regex
* better
* add ansi_colors_light
* javascript
2025-10-10 13:58:58 +03:00
George Hotz
ac96d98745
GROUP_REDUCE is now bright RED instead of green ( #12604 )
2025-10-10 18:23:57 +08:00
nimlgen
89be3590aa
amd: sqtt on gfx12 ( #12564 )
...
* amd: sqtt on gfx12
* cleaner
* thi
* and this
* ops
* ugh
* back
* rm this
* rm
2025-10-10 17:54:14 +08:00
chenyu
95ad047445
do not use sint_to_uop in renderer [pr] ( #12601 )
2025-10-10 05:29:10 -04:00
Sieds Lykles
e625c27598
update min step times openpilot ( #12600 )
2025-10-10 11:24:27 +02:00
nimlgen
6ec96f6088
amd: remove dup flags in sqtt ( #12595 )
2025-10-10 17:23:33 +08:00
wozeparrot
9471157346
feat: bump llvm version ( #12598 )
2025-10-10 02:20:22 -07:00
qazal
36c753bd63
viz: switch llvm mca info to tabulate ( #12596 )
2025-10-10 11:54:34 +03:00
qazal
b27470b6db
viz: add buffer details in the timeline sidebar ( #12591 )
2025-10-10 11:36:08 +03:00
chenyu
03ef5197fc
move get_contraction to helpers [pr] ( #12594 )
2025-10-10 04:28:57 -04:00
Sieds Lykles
965bd194f2
uop_given_valid cleanup ( #12592 )
...
* cleanup
* cleanup there
2025-10-10 10:18:53 +02:00
chenyu
af90dc00de
remove some View add logic [pr] ( #12584 )
...
no longer simplify the case of v0+v1 where v0 has a mask
2025-10-10 03:47:56 -04:00
wozeparrot
f12e2a75db
feat: add thunderkittens ( #12590 )
2025-10-10 00:32:33 -07:00
qazal
caae46cfba
fix process replay progress update ( #12587 )
2025-10-10 10:20:55 +03:00
nimlgen
1309cea247
rocprof parser in extra ( #12569 )
...
* rocprof parser
* viewer
* vw
* skip
2025-10-10 14:56:42 +08:00
Sieds Lykles
cbdc13279d
fix openpilot gated reads ( #12570 )
...
* fix gated image counts
* slice correctly
2025-10-10 04:52:57 +02:00
chenyu
c8dfd10257
ShapeTracker.real_strides -> is_expanded [pr] ( #12579 )
...
only keep the used part
2025-10-09 22:52:45 -04:00
qazal
88ce63a49a
remove outdated comment in multi [pr] ( #12580 )
2025-10-10 05:50:49 +03:00
George Hotz
5977df267f
outerworld uses expand ( #12578 )
2025-10-10 10:25:25 +08:00
chenyu
f2c3a72b0c
remove RANGEIFY flag [pr] ( #12577 )
2025-10-09 21:52:54 -04:00
George Hotz
9b66c2b0b7
fix weekly commits table (i didn't know we linted extra)
2025-10-10 09:23:33 +08:00
George Hotz
658b96cbfb
weekly commits table
2025-10-10 09:15:41 +08:00
qazal
b86ad6053a
test_schedule independent of RANGEIFY flag ( #12568 )
...
* test_schedule independent of RANGEIFY flag
* comment for expectedFailure + test_cast_padded_view
* test_cast_padded_const works
* don't use full_shape it's fine
* add todos for the rest
2025-10-09 20:00:50 +03:00
nimlgen
502e613c9c
amd: clean up uppercased vars ( #12571 )
2025-10-09 19:39:27 +08:00
George Hotz
840d2bf1ea
fix div rules ( #12567 )
...
* group div rules
* merge those pattern matchers
* revert
2025-10-09 19:28:21 +08:00
nimlgen
8a1c3dc1bf
amd: use soc headers from rocm ( #12566 )
2025-10-09 19:10:46 +08:00
qazal
e0694fdb8e
remove UPat.__repr__ [pr] ( #12565 )
2025-10-09 12:35:34 +03:00
chenyu
678f83e41b
delete ShapeTracker to_valid_uop and substitute [pr] ( #12563 )
2025-10-09 05:06:10 -04:00
nimlgen
a11b686c71
amd: sqtt for all gfx11 ( #12546 )
...
* amd: general sqtt for gfx11
* target
* ops
* no gfx12 here
2025-10-09 17:04:06 +08:00
chenyu
a0cbbc35ad
remove LLAMA_LAYERS in ci ( #12562 )
2025-10-09 04:46:41 -04:00
chenyu
fe94453d52
delete CONTIGUOUS with RANGE in st [pr] ( #12561 )
2025-10-09 04:32:31 -04:00
chenyu
f793cdeb87
clean up shape changing logic to not use st [pr] ( #12560 )
2025-10-09 04:13:02 -04:00
chenyu
1bcea19846
remove ShapeTracker.reduce [pr] ( #12559 )
2025-10-09 03:54:11 -04:00
chenyu
c1cc277fc3
don't call src[0].shape multiple times in MULTI st [pr] ( #12558 )
2025-10-09 03:40:17 -04:00
qazal
2551a60d97
viz: split out shape links ( #12557 )
2025-10-09 10:34:55 +03:00
George Hotz
e7aa26ed29
make remove bufferize fast ( #12555 )
...
* add more uop gc test
* make remove bufferize fast
* substitute is fast too
* fix tests
2025-10-09 15:20:02 +08:00
chenyu
cf8232ec6a
clean up more RANGEIFY flag ( #12556 )
2025-10-09 03:06:48 -04:00
nimlgen
658c566e22
vars in gated_read_image_count ( #12486 )
...
* vars in gated_read_image_count
* nc
2025-10-09 14:54:15 +08:00
George Hotz
a8a9ac0e95
add more uop gc test ( #12553 )
2025-10-09 14:49:32 +08:00
chenyu
250f05a776
run some hashing test only on METAL ( #12554 )
...
quite slow on CPU
2025-10-09 02:39:49 -04:00
qazal
da9425c1a7
viz: sum all buffers in zoomed out memory graph ( #11898 )
...
* viz: switch to transformation matrix
* simpler axes domains
* less domain
* split loops
* flatten
* tiny rects
* solid proxy but still too big
* cache FileNotFound
* gridlines instead of padding
* not this
* like METAL -> METAL memory -> graph
* less colors
* better
* more grid work
* glitch
* clamp
* add range index
* pixel grids
* set min width
* y cords
* pruning
* test: clip in world units
* keep linear scan
* switch to interval tree
* fps counter
* work
* visible is the easiest
* shapes api
* math
* test bitgrid
* checkout
* work
* simpler
* work
* draw
* it's just a polygon
* merge polygons
* cleanup old stuff
* switch to hashmap there too
* add tooltips
* fix that
* better color
* better
2025-10-09 09:30:37 +03:00
chenyu
ae51bdd06a
remove trivial use of RANGEIFY flag ( #12550 )
...
some tests need update still
2025-10-09 02:29:38 -04:00