qazal
199a36d079
add pagination to viz [pr] ( #8794 )
...
* add pagination to viz [pr]
* work
* lint
2025-01-29 04:21:53 +02:00
qazal
ba17786068
do not construct unmasked VALID ( #8759 )
...
* new lines that exist in codegen/ops
* update tests
* update sops.gz (13071 -> 13070 asts)
* fix viz too
* remove that TODO
* diff pruning
* mask assert + device
* work
* diff pruning
* re: fix viz too
---------
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2025-01-28 20:51:21 +02:00
qazal
3417bc1814
fix ShapeTracker spec for const [pr] ( #8791 )
2025-01-28 19:53:36 +02:00
nimlgen
801ec9e697
am: no hardcoded clocks ( #8788 )
...
* am: no hardcoded clocks
* better
2025-01-28 20:18:46 +03:00
qazal
e724af74d7
allow VIEW source in DEFINE_VAR spec [pr] ( #8790 )
2025-01-28 17:42:14 +02:00
b1tg
da464d039f
fix windows ci cache ( #8787 )
...
Co-authored-by: b1tg <b1tg@users.noreply.github.com >
2025-01-28 13:22:15 +02:00
qazal
e8be8a5835
support lowering CONST(VIEW) in lowerer ( #8785 )
2025-01-28 12:04:41 +02:00
George Hotz
80089536e5
Revert "move llvm_bf16_cast to renderer for CLANG and LLVM [pr] ( #8720 )" ( #8786 )
...
This reverts commit af0452f116 .
2025-01-28 18:59:02 +09:00
b1tg
5d62aa28dc
Support CLANG backend on Windows ( #8768 )
...
* Support CLANG on Windows
* Put both backends in a windows ci
* remove coff loader
* use memmove
---------
Co-authored-by: b1tg <b1tg@users.noreply.github.com >
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2025-01-28 18:19:34 +09:00
mesozoic-egg
af0452f116
move llvm_bf16_cast to renderer for CLANG and LLVM [pr] ( #8720 )
...
* handle bf16 via bitcasting for CLANG and LLVM
* On LLVM, skip float16 cast
* float32 on llvm lite, float32 elsewhere
* code format
* trigger pr
* move to rewriter
---------
Co-authored-by: Mesozoic Egg <mesozoic.egg@proton.mail >
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2025-01-28 18:16:43 +09:00
nimlgen
d66680b17e
hotfix: am: fix hang ( #8783 )
2025-01-28 11:54:19 +03:00
qazal
a65d2917cb
remove unused fields from viz uop_to_json [pr] ( #8782 )
2025-01-28 10:50:11 +02:00
qazal
aefbc2637f
test fixups from unmasked valid deletion [pr] ( #8776 )
2025-01-28 09:23:30 +02:00
qazal
ed672881b0
remove additions/deletion in pr + check uops are equal [pr] ( #8779 )
...
* use warnings there [pr]
* remove those + move assert_diff [pr]
* warn after log
* remove
* back
2025-01-28 08:57:34 +02:00
Ignacio Sica
2c71c60719
opt arg is int or tuple ( #8780 )
2025-01-28 11:02:32 +09:00
George Hotz
62655e4999
move multi into engine [pr] ( #8778 )
...
* move multi into engine [pr]
* all runtime is one sz
2025-01-28 09:15:29 +09:00
nimlgen
299fa8f37b
am: unset high clocks for sleep ( #8775 )
2025-01-28 01:15:56 +03:00
chenyu
c99ae81f63
update default resnet LOSS_SCALER to 256 [pr] ( #8774 )
2025-01-27 16:59:05 -05:00
nimlgen
1c608ae34f
am_smi: better spacing ( #8773 )
...
* am_smi: better spacing
* not used
2025-01-27 23:01:02 +03:00
Ignacio Sica
b240f12593
[TIP-9] rename Opt's amt to arg 2 ( #8770 )
...
* rename Opt amt to arg
* ignore_beam_cache for test_tiny
* move ignore_beam_cache to test_tiny
* move to separate pr
* revert space change
---------
Co-authored-by: chenyu <chenyu@fastmail.com >
2025-01-27 14:19:04 -05:00
chenyu
9760688e7f
use IGNORE_BEAM_CACHE in search [pr] ( #8772 )
2025-01-27 13:41:01 -05:00
Ignacio Sica
ed1b573868
ignore beam cache in test_tiny for stateless beam ( #8771 )
2025-01-27 12:56:30 -05:00
George Hotz
3ed146a5ff
Revert "rename Opt amt to arg ( #8767 )" ( #8769 )
...
This reverts commit bf041659a5 .
2025-01-27 23:46:37 +09:00
Ignacio Sica
bf041659a5
rename Opt amt to arg ( #8767 )
2025-01-27 23:36:47 +09:00
George Hotz
96bff0b4f7
contiguous is no longer needed in SGD [pr] ( #8760 )
...
* contiguous is no longer needed in SGD [pr]
* add allow condition
2025-01-27 15:19:11 +09:00
b1tg
efc7971090
add windows test to ci ( #8761 )
...
Co-authored-by: b1tg <b1tg@users.noreply.github.com >
2025-01-27 14:53:21 +09:00
George Hotz
a9d9f98d05
hotfix: those tests fail locally on mac due to buffer count
2025-01-27 07:53:48 +09:00
George Hotz
2454bf01c3
hotfix: remove shapetracker spam in viz
2025-01-27 07:20:21 +09:00
qazal
d488bbb1ec
share merge_views/valid creation for CONST/DEFINE_VAR ( #8758 )
...
* share valid creation behavior for CONST/DEFINE_VAR
* work
2025-01-26 17:41:54 +02:00
qazal
bbb2dd8141
move VALID creation after merging the views ( #8757 )
...
* do valid creation later
* work for view_left
* only view(const) makes valids in view_left
* cleaner bind diff
2025-01-26 16:58:05 +02:00
George Hotz
a6e496b195
remove Function class [pr] ( #8753 )
...
* remove Function class [pr]
* actually remove function
* fix docs
2025-01-26 18:58:02 +09:00
qazal
ac70f63d4b
tensor_map cleanups [pr] ( #8754 )
...
* tensor_map cleanups [pr]
* update test_schedule too
2025-01-26 11:41:54 +02:00
George Hotz
b53fe7c2fc
remove unused ctx [pr] ( #8751 )
...
* remove unused ctx [pr]
* fix test
2025-01-26 17:59:15 +09:00
qazal
06b58aa7ec
move unneeded fields out of ScheduleContext [pr] ( #8752 )
2025-01-26 10:36:15 +02:00
George Hotz
1b4618e257
gradient cleanup ( #8750 )
...
* switch backward to use gradient [pr]
* set device correctly, dedup
* why does that fail?
* add noop cast
* simple backward
* fix beautiful_mnist
* touchups
* set in compute_gradient
* uop_count
* uop_count was wrong
* collections
* no note
* skip that test
* update sched kernel counts
* train mnist is 65
* fix metadata and gc
* fixes
* materialize_grads
* no pathlib stuff
* add contiguous_backward, fix bugs
* add some realize
* fix multi
* remove unused backward passes [pr]
* lower line count
2025-01-26 09:30:55 +09:00
George Hotz
b4bf6a7dea
switch backward to use gradient [pr] ( #8235 )
...
* switch backward to use gradient [pr]
* set device correctly, dedup
* why does that fail?
* add noop cast
* simple backward
* fix beautiful_mnist
* touchups
* set in compute_gradient
* uop_count
* uop_count was wrong
* collections
* no note
* skip that test
* update sched kernel counts
* train mnist is 65
* fix metadata and gc
* fixes
* materialize_grads
* no pathlib stuff
* add contiguous_backward, fix bugs
* add some realize
* fix multi
2025-01-26 09:12:16 +09:00
George Hotz
0ffd572e1e
fix multi with no real srcs ( #8749 )
2025-01-26 08:41:00 +09:00
qazal
0e42befc6e
viz cleanups 2 [pr] ( #8748 )
...
* viz cleanups 2 [pr]
* test_viz updates
2025-01-25 19:41:57 +02:00
nimlgen
c74c5901a8
am disable bind ( #8747 )
2025-01-25 19:06:35 +03:00
qazal
a037201168
test_viz cleanups + move to /unit directory ( #8746 )
...
* test_viz cleanups + move to /unit directory
* lint
2025-01-25 14:33:31 +02:00
chenyu
e2b380b743
make UOp.multi real a tuple instead of list [pr] ( #8744 )
...
tuple is immutable. also updated test_rand_like_from_alu test
2025-01-24 20:47:27 -05:00
George Hotz
cb0978b377
add Ops.CONTIGUOUS_BACKWARD ( #8743 )
2025-01-25 07:28:43 +09:00
nimlgen
2f06eccf1d
am: script and vfio msg ( #8742 )
...
* am: script and vfio msg
* use sysfs bars always for now
* tiny chnages
2025-01-25 00:33:00 +03:00
chenyu
0c759e1ff6
add bert to bechmark ci ( #8741 )
...
with `DISABLE_DROPOUT=1 BERT_LAYERS=2` for now
2025-01-24 14:45:11 -05:00
chenyu
e0e176efbc
failed test case for multi rand_like [pr] ( #8740 )
...
new multi broke multi device dropout
2025-01-24 13:56:51 -05:00
nimlgen
dc10187fc0
am: add am_smi ( #8739 )
...
* am: start monitor
* cleanups
* fixes
* hmm
* progress
* cleanup
2025-01-24 20:16:19 +03:00
George Hotz
7a2223a6c6
add merge views to ops_folding [pr] ( #8051 )
...
Co-authored-by: qazal <qazal.software@gmail.com >
2025-01-24 17:45:11 +02:00
qazal
0814a79cb4
cleanup the merge_views upats [pr] ( #8738 )
2025-01-24 16:49:54 +02:00
qazal
07069b9988
rename to tensor_uop [pr] ( #8737 )
2025-01-24 13:42:25 +02:00
George Hotz
e82ba1454b
MultiLazyBuffer is UOp [pr] ( #8662 )
...
* MultiLazyBuffer is UOp [pr]
* this is new mlb
* this is the idea
* progress
* multitensor works
* more movement ops
* this
* MultiLazyBuffer is UOp
* cleanups
* multi axis
* fix more tests
* work
* not that
* add multi grad and move shard to ops
* mops not views
* no double contig
* sweet, all mt tests passing
* port old logic
* remove lbs
* fix realized
* whitespace
* assign tweak
* test_assign_kv_cache_multi passes
* fix is_realized
* fix JIT for multi
* just a few more lines i'll pay them back soon i swear please bro just a few more
* no split reduceop for multi
2025-01-24 13:28:55 +09:00