qazal
|
7bd79f4922
|
pass viz render errors (#7369)
* pass viz render errors
* pcall
|
2024-10-29 22:48:27 +08:00 |
|
qazal
|
51c0c8d27e
|
cachable small graph rewrite (#7371)
|
2024-10-29 22:28:13 +08:00 |
|
chenyu
|
9b81931a36
|
make_pair -> make_tuple [pr] (#7372)
it's used more often as generic tuple, also removed the default 2.
|
2024-10-29 10:27:39 -04:00 |
|
qazal
|
d803a9c7c8
|
global metadata try 2 (#7367)
|
2024-10-29 20:21:00 +08:00 |
|
George Hotz
|
2cfc7b6695
|
Index everywhere 2 (#7363)
* indexing everywhere [pr]
* fix tests
|
2024-10-29 19:29:40 +08:00 |
|
qazal
|
7149eabb34
|
assert set equality in TestTensorMetadata [pr] (#7364)
|
2024-10-29 19:29:29 +08:00 |
|
qazal
|
0ebdb136e8
|
revert metadata with graph_rewrite (#7353) (#7362)
This reverts commit 540e4179e7.
|
2024-10-29 19:16:31 +08:00 |
|
qazal
|
f2044cfb22
|
hotfix: if getenv("RUN_PROCESS_REPLAY") (#7361)
|
2024-10-29 18:51:29 +08:00 |
|
George Hotz
|
0af1212164
|
use assertEqual with new style uops [pr] (#7360)
|
2024-10-29 18:43:21 +08:00 |
|
George Hotz
|
0beb2d8f84
|
ptx indexing (#7359)
* ptx indexing
* shorter
* fix load/store
|
2024-10-29 18:29:44 +08:00 |
|
George Hotz
|
572499c71a
|
add indexing to ops_python (#7358)
* add indexing to ops_python
* fix image
|
2024-10-29 18:11:03 +08:00 |
|
qazal
|
540e4179e7
|
global UOp to Metadata mapping + inverse DEBUG=2 metadata order [pr] (#7353)
* add ctx.buf_metadata [pr]
* revert metadata insertion order
* lint rename
|
2024-10-29 17:12:00 +08:00 |
|
George Hotz
|
2fdfcffe4c
|
improve ci speed [pr] (#7357)
|
2024-10-29 17:00:35 +08:00 |
|
qazal
|
8fab7b21df
|
everything is load (#7355)
* everything is load
* rename to ops
|
2024-10-29 16:47:33 +08:00 |
|
George Hotz
|
b647fa7514
|
rename MathTraits to maximum [pr] (#7356)
|
2024-10-29 16:43:04 +08:00 |
|
George Hotz
|
2bf55d8eda
|
make ops more like tensor [pr] (#7352)
* make ops more like tensor [pr]
* tensor is simple math trait
* no shifts
|
2024-10-29 16:23:41 +08:00 |
|
George Hotz
|
3989bd2682
|
idiv + reciprocal [pr] (#7354)
* idiv + reciprocal
* remove upcast from div
* fix docs
|
2024-10-29 15:54:19 +08:00 |
|
Bhavya Gada
|
3419ae282d
|
VIZ UI improvement: generic function for vscode opener (#7338)
* generic function for vscode opener
* eslint
* shorter
---------
Co-authored-by: qazal <qazal.software@gmail.com>
|
2024-10-29 15:02:04 +08:00 |
|
qazal
|
c03e1693fc
|
shorter gate folding [pr] (#7350)
|
2024-10-29 14:49:32 +08:00 |
|
George Hotz
|
3e8225299c
|
ext gate indexing (#7349)
* ext gate indexing
* copy paste better
|
2024-10-29 14:46:10 +08:00 |
|
Bhavya Gada
|
13ea4979d5
|
VIZ UI improvement: autoscroll kernel list when using arrow buttons (#7344)
|
2024-10-29 14:40:42 +08:00 |
|
George Hotz
|
d9d4dd6756
|
faster ci [pr] (#7348)
|
2024-10-29 14:01:44 +08:00 |
|
George Hotz
|
a5e0f59e41
|
move autogen to different CI runner [pr] (#7346)
* move autogen to different CI runner [pr]
* balance a bit
* readme back there
* compile enet in autogen
|
2024-10-29 13:35:22 +08:00 |
|
George Hotz
|
4cb236a495
|
index in cstyle (#7328)
* index only in cstyle
* fix prefix dtypes
* fix tests
* global indexing
* Revert "global indexing"
This reverts commit 4d507e8abb.
* fix image
* fix image
* ptx tests
* fix CUDA dtype rendering
|
2024-10-29 13:06:26 +08:00 |
|
George Hotz
|
f55c3dcff8
|
hotfix: bump ocelot
|
2024-10-29 12:46:24 +08:00 |
|
George Hotz
|
4fe1945df6
|
llvm if load (#7345)
* llvm if load
* unneeded line
* local llvm CI
|
2024-10-29 11:33:22 +08:00 |
|
chenyu
|
8625dd4eea
|
minor changes reading ops.py (#7343)
|
2024-10-28 19:04:12 -04:00 |
|
chenyu
|
6021bf87f4
|
unify T = TypeVar("T") (#7342)
|
2024-10-28 18:43:44 -04:00 |
|
chenyu
|
293adc141a
|
clean up get_shape [pr] (#7341)
* clean up get_shape [pr]
aapi is literal false
* more
|
2024-10-28 18:25:37 -04:00 |
|
chenyu
|
c398f2467c
|
test uop mul min/max do not have nan in 0*inf (#7340)
|
2024-10-28 17:52:01 -04:00 |
|
chenyu
|
0843734927
|
clean up nan handling in transcendental (#7332)
* clean up nan handling in transcendental
* skip remu crash
|
2024-10-28 16:21:49 -04:00 |
|
Sieds Lykles
|
75dcd98e79
|
Fix calculation of vmin and vmax in multiplication when one src is negative and the other src has negative min and positive max (#7333)
Co-authored-by: chenyu <chenyu@fastmail.com>
|
2024-10-28 16:01:46 -04:00 |
|
chenyu
|
603fcc96f2
|
limit UOps.ALU min/max to non-float only (#7336)
does this impact anything? some inf is incorrect now
|
2024-10-28 15:34:19 -04:00 |
|
ignaciosica
|
32fa297e6c
|
cleaner nan rendering (#7337)
|
2024-10-28 14:36:36 -04:00 |
|
qazal
|
00362a117c
|
scheduler bfs renames [pr] (#7335)
|
2024-10-29 00:24:23 +08:00 |
|
qazal
|
d8820644e0
|
split preschedule from ast rewrite [pr] (#7334)
|
2024-10-28 17:45:09 +02:00 |
|
chenyu
|
6b0e8cb04f
|
remove float_to_bits in transcendental [pr] (#7331)
it's just bitcast, and removed the weird bits_to_float indirection
|
2024-10-28 10:20:19 -04:00 |
|
qazal
|
b9b28e6883
|
viz stuff [pr] (#7330)
* viz stuff [pr]
* button
|
2024-10-28 21:46:18 +08:00 |
|
Bhavya Gada
|
9b7e76e508
|
VIZ UI improvement: resizable and collapsible sidebars (#7317)
* make left sidebar resizable
* add sidebar collapse/expand button
* refactor to reduce loc and make resize work correctly
* combine both resizers
|
2024-10-28 21:19:43 +08:00 |
|
qazal
|
e46edc22aa
|
use unittest helpers in TestTensorMetadata [pr] (#7329)
* use unittest helpers in TestTensorMetadata [pr]
* fix that
* 5 args
|
2024-10-28 18:38:30 +08:00 |
|
chenyu
|
96fcc47e27
|
touchup abstraction docs (#7327)
fix typing and use tinygrad tqdm
|
2024-10-27 22:29:55 -04:00 |
|
chenyu
|
cb5702f170
|
tiny cleanup to transcendental xexp2 (#7326)
also added test for exp and log of nan and inf
|
2024-10-27 21:54:20 -04:00 |
|
chenyu
|
4c855ae692
|
unit test transcendental helpers (#7325)
added a test to run UOps with const inputs. seems to have issue with both payne_hanek_reduction and cody_waite_reduction
|
2024-10-27 19:55:00 -04:00 |
|
qazal
|
8d9459f281
|
always run process replay with contextvars (#7323)
* always run process replay with contextvars [pr]
* not the last two
* extra
* no pr
|
2024-10-27 20:44:42 +02:00 |
|
qazal
|
adcdaa17bb
|
map BUFFER to Metadata [pr] (#7324)
|
2024-10-27 20:31:04 +02:00 |
|
qazal
|
d634261c51
|
late buffer uops [pr] (#7322)
|
2024-10-27 19:34:01 +02:00 |
|
chenyu
|
cdbe08b94b
|
use UOp.render in colored_shape (#7321)
similar to function name, print rendered str instead of raw UOp
|
2024-10-27 11:42:31 -04:00 |
|
chenyu
|
4a03e00aa1
|
fix llama3 download_model assert (#7320)
false positive if download_model and model are not provided
|
2024-10-27 11:20:24 -04:00 |
|
talati
|
d4d201d87b
|
fixing branch condition on UOps.IF in the ptx renderer (#7315)
* fixing branch condition on UOps.IF in the ptx renderer
* ptx works
---------
Co-authored-by: Nick Talati <nick.talati@quantworks.com>
Co-authored-by: qazal <77887910+Qazalin@users.noreply.github.com>
Co-authored-by: qazal <qazal.software@gmail.com>
|
2024-10-27 14:27:38 +02:00 |
|
qazal
|
a410b46c1d
|
unskip test_gated_store_with_if [pr] (#7319)
|
2024-10-27 14:03:12 +02:00 |
|