Commit Graph

7653 Commits

Author SHA1 Message Date
qazal
199a36d079 add pagination to viz [pr] (#8794)
* add pagination to viz [pr]

* work

* lint
2025-01-29 04:21:53 +02:00
qazal
ba17786068 do not construct unmasked VALID (#8759)
* new lines that exist in codegen/ops

* update tests

* update sops.gz (13071 -> 13070 asts)

* fix viz too

* remove that TODO

* diff pruning

* mask assert + device

* work

* diff pruning

* re: fix viz too

---------

Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>
2025-01-28 20:51:21 +02:00
qazal
3417bc1814 fix ShapeTracker spec for const [pr] (#8791) 2025-01-28 19:53:36 +02:00
nimlgen
801ec9e697 am: no hardcoded clocks (#8788)
* am: no hardcoded clocks

* better
2025-01-28 20:18:46 +03:00
qazal
e724af74d7 allow VIEW source in DEFINE_VAR spec [pr] (#8790) 2025-01-28 17:42:14 +02:00
b1tg
da464d039f fix windows ci cache (#8787)
Co-authored-by: b1tg <b1tg@users.noreply.github.com>
2025-01-28 13:22:15 +02:00
qazal
e8be8a5835 support lowering CONST(VIEW) in lowerer (#8785) 2025-01-28 12:04:41 +02:00
George Hotz
80089536e5 Revert "move llvm_bf16_cast to renderer for CLANG and LLVM [pr] (#8720)" (#8786)
This reverts commit af0452f116.
2025-01-28 18:59:02 +09:00
b1tg
5d62aa28dc Support CLANG backend on Windows (#8768)
* Support CLANG on Windows

* Put both backends in a windows ci

* remove coff loader

* use memmove

---------

Co-authored-by: b1tg <b1tg@users.noreply.github.com>
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>
2025-01-28 18:19:34 +09:00
mesozoic-egg
af0452f116 move llvm_bf16_cast to renderer for CLANG and LLVM [pr] (#8720)
* handle bf16 via bitcasting for CLANG and LLVM

* On LLVM, skip float16 cast

* float32 on llvm lite, float32 elsewhere

* code format

* trigger pr

* move to rewriter

---------

Co-authored-by: Mesozoic Egg <mesozoic.egg@proton.mail>
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>
2025-01-28 18:16:43 +09:00
nimlgen
d66680b17e hotfix: am: fix hang (#8783) 2025-01-28 11:54:19 +03:00
qazal
a65d2917cb remove unused fields from viz uop_to_json [pr] (#8782) 2025-01-28 10:50:11 +02:00
qazal
aefbc2637f test fixups from unmasked valid deletion [pr] (#8776) 2025-01-28 09:23:30 +02:00
qazal
ed672881b0 remove additions/deletion in pr + check uops are equal [pr] (#8779)
* use warnings there [pr]

* remove those + move assert_diff [pr]

* warn after log

* remove

* back
2025-01-28 08:57:34 +02:00
Ignacio Sica
2c71c60719 opt arg is int or tuple (#8780) 2025-01-28 11:02:32 +09:00
George Hotz
62655e4999 move multi into engine [pr] (#8778)
* move multi into engine [pr]

* all runtime is one sz
2025-01-28 09:15:29 +09:00
nimlgen
299fa8f37b am: unset high clocks for sleep (#8775) 2025-01-28 01:15:56 +03:00
chenyu
c99ae81f63 update default resnet LOSS_SCALER to 256 [pr] (#8774) 2025-01-27 16:59:05 -05:00
nimlgen
1c608ae34f am_smi: better spacing (#8773)
* am_smi: better spacing

* not used
2025-01-27 23:01:02 +03:00
Ignacio Sica
b240f12593 [TIP-9] rename Opt's amt to arg 2 (#8770)
* rename Opt amt to arg

* ignore_beam_cache for test_tiny

* move ignore_beam_cache to test_tiny

* move to separate pr

* revert space change

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
2025-01-27 14:19:04 -05:00
chenyu
9760688e7f use IGNORE_BEAM_CACHE in search [pr] (#8772) 2025-01-27 13:41:01 -05:00
Ignacio Sica
ed1b573868 ignore beam cache in test_tiny for stateless beam (#8771) 2025-01-27 12:56:30 -05:00
George Hotz
3ed146a5ff Revert "rename Opt amt to arg (#8767)" (#8769)
This reverts commit bf041659a5.
2025-01-27 23:46:37 +09:00
Ignacio Sica
bf041659a5 rename Opt amt to arg (#8767) 2025-01-27 23:36:47 +09:00
George Hotz
96bff0b4f7 contiguous is no longer needed in SGD [pr] (#8760)
* contiguous is no longer needed in SGD [pr]

* add allow condition
2025-01-27 15:19:11 +09:00
b1tg
efc7971090 add windows test to ci (#8761)
Co-authored-by: b1tg <b1tg@users.noreply.github.com>
2025-01-27 14:53:21 +09:00
George Hotz
a9d9f98d05 hotfix: those tests fail locally on mac due to buffer count 2025-01-27 07:53:48 +09:00
George Hotz
2454bf01c3 hotfix: remove shapetracker spam in viz 2025-01-27 07:20:21 +09:00
qazal
d488bbb1ec share merge_views/valid creation for CONST/DEFINE_VAR (#8758)
* share valid creation behavior for CONST/DEFINE_VAR

* work
2025-01-26 17:41:54 +02:00
qazal
bbb2dd8141 move VALID creation after merging the views (#8757)
* do valid creation later

* work for view_left

* only view(const) makes valids in view_left

* cleaner bind diff
2025-01-26 16:58:05 +02:00
George Hotz
a6e496b195 remove Function class [pr] (#8753)
* remove Function class [pr]

* actually remove function

* fix docs
2025-01-26 18:58:02 +09:00
qazal
ac70f63d4b tensor_map cleanups [pr] (#8754)
* tensor_map cleanups [pr]

* update test_schedule too
2025-01-26 11:41:54 +02:00
George Hotz
b53fe7c2fc remove unused ctx [pr] (#8751)
* remove unused ctx [pr]

* fix test
2025-01-26 17:59:15 +09:00
qazal
06b58aa7ec move unneeded fields out of ScheduleContext [pr] (#8752) 2025-01-26 10:36:15 +02:00
George Hotz
1b4618e257 gradient cleanup (#8750)
* switch backward to use gradient [pr]

* set device correctly, dedup

* why does that fail?

* add noop cast

* simple backward

* fix beautiful_mnist

* touchups

* set in compute_gradient

* uop_count

* uop_count was wrong

* collections

* no note

* skip that test

* update sched kernel counts

* train mnist is 65

* fix metadata and gc

* fixes

* materialize_grads

* no pathlib stuff

* add contiguous_backward, fix bugs

* add some realize

* fix multi

* remove unused backward passes [pr]

* lower line count
2025-01-26 09:30:55 +09:00
George Hotz
b4bf6a7dea switch backward to use gradient [pr] (#8235)
* switch backward to use gradient [pr]

* set device correctly, dedup

* why does that fail?

* add noop cast

* simple backward

* fix beautiful_mnist

* touchups

* set in compute_gradient

* uop_count

* uop_count was wrong

* collections

* no note

* skip that test

* update sched kernel counts

* train mnist is 65

* fix metadata and gc

* fixes

* materialize_grads

* no pathlib stuff

* add contiguous_backward, fix bugs

* add some realize

* fix multi
2025-01-26 09:12:16 +09:00
George Hotz
0ffd572e1e fix multi with no real srcs (#8749) 2025-01-26 08:41:00 +09:00
qazal
0e42befc6e viz cleanups 2 [pr] (#8748)
* viz cleanups 2 [pr]

* test_viz updates
2025-01-25 19:41:57 +02:00
nimlgen
c74c5901a8 am disable bind (#8747) 2025-01-25 19:06:35 +03:00
qazal
a037201168 test_viz cleanups + move to /unit directory (#8746)
* test_viz cleanups + move to /unit directory

* lint
2025-01-25 14:33:31 +02:00
chenyu
e2b380b743 make UOp.multi real a tuple instead of list [pr] (#8744)
tuple is immutable. also updated test_rand_like_from_alu test
2025-01-24 20:47:27 -05:00
George Hotz
cb0978b377 add Ops.CONTIGUOUS_BACKWARD (#8743) 2025-01-25 07:28:43 +09:00
nimlgen
2f06eccf1d am: script and vfio msg (#8742)
* am: script and vfio msg

* use sysfs bars always for now

* tiny chnages
2025-01-25 00:33:00 +03:00
chenyu
0c759e1ff6 add bert to bechmark ci (#8741)
with `DISABLE_DROPOUT=1 BERT_LAYERS=2` for now
2025-01-24 14:45:11 -05:00
chenyu
e0e176efbc failed test case for multi rand_like [pr] (#8740)
new multi broke multi device dropout
2025-01-24 13:56:51 -05:00
nimlgen
dc10187fc0 am: add am_smi (#8739)
* am: start monitor

* cleanups

* fixes

* hmm

* progress

* cleanup
2025-01-24 20:16:19 +03:00
George Hotz
7a2223a6c6 add merge views to ops_folding [pr] (#8051)
Co-authored-by: qazal <qazal.software@gmail.com>
2025-01-24 17:45:11 +02:00
qazal
0814a79cb4 cleanup the merge_views upats [pr] (#8738) 2025-01-24 16:49:54 +02:00
qazal
07069b9988 rename to tensor_uop [pr] (#8737) 2025-01-24 13:42:25 +02:00
George Hotz
e82ba1454b MultiLazyBuffer is UOp [pr] (#8662)
* MultiLazyBuffer is UOp [pr]

* this is new mlb

* this is the idea

* progress

* multitensor works

* more movement ops

* this

* MultiLazyBuffer is UOp

* cleanups

* multi axis

* fix more tests

* work

* not that

* add multi grad and move shard to ops

* mops not views

* no double contig

* sweet, all mt tests passing

* port old logic

* remove lbs

* fix realized

* whitespace

* assign tweak

* test_assign_kv_cache_multi passes

* fix is_realized

* fix JIT for multi

* just a few more lines i'll pay them back soon i swear please bro just a few more

* no split reduceop for multi
2025-01-24 13:28:55 +09:00