Francis Lata
99efa2cfde
Merge branch 'master' into retinanet_mlperf
2024-11-18 04:42:57 -08:00
ignaciosica
597a239e28
Remove UnaryOps, BinaryOps, TernaryOps, MetaOps [pr] ( #7725 )
...
* remove unaryops
* remove ternaryops
* remove metaops
* hotfix
* remove binaryops
* hotfix: test_pattern_matcher
---------
Co-authored-by: qazal <77887910+Qazalin@users.noreply.github.com >
2024-11-16 20:56:56 +08:00
geohotstan
f8056a74d6
combine pad2d with pad ( #7677 )
...
* I have pad2d, I have pad, uuh~, pad2dpad~
* fix some small things
* strategically placed cast hack
* fix more
* fix more more
* tests
* periods
2024-11-14 17:56:02 +08:00
Francis Lata
a0c0a77f54
Merge branch 'master' into retinanet_mlperf
2024-11-13 21:30:12 -08:00
chenyu
4c5f7ddf1f
flux set model path in args ( #7660 )
...
in addition to default downloading through fetch, add an arg to pass model path directly
2024-11-12 22:11:40 -05:00
Francis Lata
e807cf817d
add LR scheduler and the start of training step
2024-11-12 02:48:31 -08:00
Francis Lata
50abdc22c8
add proper training loop over the training dataset
2024-11-09 17:45:55 -08:00
Francis Lata
bf2dc3ae33
Merge branch 'master' into retinanet_mlperf
2024-11-09 17:00:30 -08:00
Harald Schäfer
e7cbc29f48
openpilot benchmark: add cast from numpy to benchmark ( #7593 )
...
* openpilot benchmark: add cast from numpy to benchmark
* whitespace
* comment
2024-11-08 19:31:00 +08:00
Anthony DeMattos
953ef1b57e
tinychat ui +/- 20 lines ( #7471 )
...
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2024-11-06 14:23:55 +08:00
Francis Lata
bb6f27d2f3
Merge branch 'master' into retinanet_mlperf
2024-11-04 19:19:22 -08:00
George Hotz
c8bf09b7d4
s/UOps/Ops ( #7500 )
...
* s/UOps/Ops [pr]
* fix
2024-11-03 11:26:10 +08:00
George Hotz
72a9ac27e9
support image dtype in cloud [pr] ( #7482 )
...
* support image dtype in cloud [pr]
* remove outdated osx hack
* unused imports
2024-11-02 23:54:27 +08:00
Tobias Fischer
7c9a1d69f9
sdxl gen fix ( #7459 )
2024-11-01 13:57:01 -04:00
gonutz
e7cbc6dc23
Fix ValueError in Yolo 8 example ( #7387 )
...
Calling
python3 examples/yolov8.py ./test/models/efficientnet/Chicken.jpg
used to result in this error
ValueError: Calling nonzero on 0d arrays is not allowed.
Using np.atleast_1d makes sure we avoid a zero-dimension array.
Co-authored-by: gonutz <gonutz@fake.mail >
2024-10-30 10:18:39 +08:00
George Hotz
3989bd2682
idiv + reciprocal [pr] ( #7354 )
...
* idiv + reciprocal
* remove upcast from div
* fix docs
2024-10-29 15:54:19 +08:00
chenyu
4a03e00aa1
fix llama3 download_model assert ( #7320 )
...
false positive if download_model and model are not provided
2024-10-27 11:20:24 -04:00
eliotgolding
e920f1d663
Llama 3.2 1B load from GGUF ( #7295 )
...
* gguf 1b-instruct
* not needed
2024-10-27 09:29:02 +08:00
Francis Lata
8a5cbb14e4
Merge branch 'master' into retinanet_mlperf
2024-10-25 22:56:30 -07:00
Francis Lata
6e3efd4ed6
add validation set test
2024-10-25 22:55:49 -07:00
Francis Lata
65c561a618
update image to be float32
2024-10-25 21:18:34 -07:00
Francis Lata
4b21a8fb8d
got dataloader with normalize working
2024-10-25 20:25:07 -07:00
George Hotz
dc3148c677
hotfix: minor speed increase + stable diffusion relax
2024-10-25 16:27:21 +08:00
Francis Lata
967438ca71
Merge branch 'master' into retinanet_mlperf
2024-10-22 02:48:51 -07:00
leopf
87877d7a91
GGUF cleanup ( #7192 )
...
* cleanup
* remove vocab size hard code
2024-10-21 10:44:54 -04:00
leopf
b6d9b276bb
GGUF support ( #7046 )
...
* basic loader, untested
* testing
* remove utils import in test
* q8_0
* q4_1
* end to end testing
* minor cleanup
* fix casting
* moved to state
* move tests
* move dequant to fn
* fix lint elif
* remove gguf from extra
* fix dict union
* q6_k simpler
* naming and spacing
* gpt2-gguf example
* cleanup
* move gguf example
* minor cleanup
---------
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2024-10-21 16:15:34 +08:00
Francis Lata
d9d65b9537
cleanup dataloader test and revert shm path
2024-10-19 17:32:58 -07:00
qazal
30989fb459
changes from the big graph branch [pr] ( #7160 )
...
* metaops srcs
* delete multioutput ctx var
* always has metadata
* shorter path for realized
* this still needs inputs
This reverts commit a59cbb2886 .
2024-10-19 16:22:37 +03:00
Francis Lata
4bebe61a9c
add dataloader + test
2024-10-16 15:38:47 -04:00
Francis Lata
3d857d758e
Merge branch 'master' into retinanet_mlperf
2024-10-16 15:36:37 -04:00
Francis Lata
90eff347e2
tinytqdm write support ( #6359 )
...
* add write support
* add test
* update test case to compare write outputs
* assert final write output
* flush when using write
* update write logic
* Revert "update write logic"
This reverts commit 5e0e611b46 .
---------
Co-authored-by: chenyu <chenyu@fastmail.com >
2024-10-16 14:51:41 -04:00
Francis Lata
498141c579
Merge branch 'master' into retinanet_mlperf
2024-10-16 10:14:39 -04:00
George Hotz
3169cb386d
remove graph [pr] ( #7085 )
2024-10-16 11:40:07 +08:00
George Hotz
26df50cf43
move memory_planner to memory.py [pr] ( #7079 )
2024-10-16 10:04:35 +08:00
Francis Lata
d5813a3c42
Merge branch 'master' into retinanet_mlperf
2024-10-12 22:04:58 -04:00
chenyu
ed1ed9e4ff
bert use BS=72 ( #7015 )
...
memory 131 -> 138
green tflops 201 -> 209
red tflops 160 -> 169
2024-10-12 09:41:56 -04:00
George Hotz
a71bb09ec3
remove symbolic file [pr] ( #7012 )
2024-10-12 18:44:44 +08:00
Francis Lata
1295a3020f
Merge branch 'master' into retinanet_mlperf
2024-10-11 23:08:17 -04:00
Francis Lata
b802f74cee
add dataloader
2024-10-11 23:04:21 -04:00
George Hotz
5c9f76e274
hotfix: openpilot compile3 compare to i==1
2024-10-12 09:44:24 +08:00
chenyu
36056e0760
update mlperf systems and copy 4.1 to 5.0 ( #7004 )
2024-10-11 16:20:34 -04:00
chenyu
0e42662f2a
log seed at the right place for bert ( #7000 )
2024-10-11 10:39:40 -04:00
nimlgen
5496a36536
update red mlperf bert readme ( #6969 )
2024-10-11 13:08:06 +03:00
Friedrich Carl Eichenroth
859d6d0407
Fix mypy examples/beautiful_*.py ( #6978 )
...
* fix mypy examples/beautiful_*.py
* backwards
* add test
* Revert "add test"
This reverts commit 4d88845ba3 .
---------
Co-authored-by: chenyu <chenyu@fastmail.com >
2024-10-10 11:34:29 -04:00
Kinvert
960c495755
added beautiful fashion mnist and example ( #6961 )
...
* added beautiful fashion mnist and example
* fixing whitespace
* refactor Fashion MNIST to fewer lines
* fix newline to reduce diff
* Update beautiful_mnist.py
* Update beautiful_mnist.py
---------
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2024-10-10 12:01:07 +08:00
chenyu
b5546912e2
10% more TRAIN_STEPS for bert ( #6971 )
...
got two very close run, adding more steps for buffer
2024-10-09 19:21:43 -04:00
chenyu
35cf48659b
limit beam param for bert on green ( #6966 )
...
seems to mitigate the crash
2024-10-09 11:48:18 -04:00
chenyu
1ff2c98f8a
fix logfile name for bert red ( #6952 )
2024-10-08 05:37:52 -04:00
chenyu
a78c96273a
update bert epoch logging ( #6940 )
...
* update bert epoch logging
epoch for bert is simply number of examples seen (which is used for RCP check)
* update total steps too
* more changes
2024-10-08 00:34:06 -04:00
chenyu
102dfe5510
back to 2**10 for bert loss scaler ( #6934 )
...
getting 2 NaN for this, revert back to 2**10
2024-10-07 10:17:21 -04:00