Ean Garvey
19c0ae3702
Cleanup SD pipeline utils ( #1466 )
20230525.741
2023-05-25 12:50:11 -05:00
Ean Garvey
54e57f7771
Revive SD downloads from shark_tank. ( #1465 )
2023-05-25 12:03:21 -05:00
PhaneeshB
6d64b8e273
vic and slm common generation base
2023-05-25 20:29:41 +05:30
PhaneeshB
a8ea0326f5
correct SLM saved vmfb naming
2023-05-25 20:29:41 +05:30
PhaneeshB
58e9194553
add Lists import
2023-05-25 20:29:41 +05:30
PhaneeshB
eb360e255d
remove unused imports
2023-05-25 20:29:41 +05:30
PhaneeshB
a6f88d7f72
refactor mlir compile
2023-05-25 20:29:41 +05:30
Prashant Kumar
8e571d165f
Enable cpu f16 dtype tracing for the vicuna model. ( #1461 )
20230524.740
2023-05-24 09:37:57 -07:00
Ean Garvey
3cddd01b10
Update OPT tokenizer and xfail a few more large tests on macos CI ( #1459 )
...
* Update opt_torch_test.py
* Update all_models.csv
20230523.739
2023-05-23 14:36:57 -07:00
Chi_Liu
64c2b2d96b
Add gpt2 to stablehlo support in shark tank ( #1447 )
...
- Add torch decomposition support when generating shark tank
- Add gpt2 stablehlo
20230522.738
2023-05-22 10:45:51 -07:00
Phaneesh Barwaria
f5ce121988
SLM on Sharkstudio ( #1454 )
...
* localize import, fix file reading, device cpu
* extract out model args
20230521.737
20230520.736
20230519.735
20230519.734
2023-05-19 11:21:08 -07:00
Ean Garvey
991f144598
Add iree hidden imports to SD spec ( #1456 )
...
* Add iree hidden imports to SD spec
* Update shark_sd_cli.spec
2023-05-19 11:19:16 -07:00
PhaneeshB
09bea17e59
fix #2 SLM in SharkStudio
20230517.731
2023-05-18 00:56:22 +05:30
Daniel Garvey
aefcf80b48
swap to cpu an remove hardcoded paths ( #1448 )
...
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
2023-05-17 10:53:34 -07:00
PhaneeshB
512235892e
fix SLM for SharkStudio
2023-05-17 22:34:30 +05:30
PhaneeshB
6602a2f5ba
add continuous output for CLI
2023-05-17 18:33:46 +05:30
Boian Petkantchin
20114deea0
In MiniLM JAX example verify MLIR result against JAX
20230516.730
2023-05-16 09:54:07 -07:00
Boian Petkantchin
9acf519078
Add option to skip venv creation in setup script
2023-05-16 09:54:07 -07:00
Boian Petkantchin
bdf37b5311
If device/backend is unknown pass it to IREE verbatim
2023-05-16 09:54:07 -07:00
powderluv
8ee2ac89f8
Rename sharded_vicuna_fp32_web.py to vicuna_web.py
2023-05-16 09:41:35 -07:00
powderluv
60cb48be2e
Rename sharded_vicuna_fp32.py to vicuna.py
2023-05-16 09:40:51 -07:00
powderluv
86a215b063
Delete sharded_vicunia.py
2023-05-16 09:37:39 -07:00
powderluv
d6e3a9a236
Delete standalone_vicuna.py
2023-05-16 09:37:26 -07:00
Chi_Liu
a0097a1ead
Add mlir_type for torch_model_list.csv ( #1428 )
...
- Enable stablehlo/tosa mlir output for torch model
- Add BERT stablehlo support
20230515.729
2023-05-15 10:23:54 -07:00
Ean Garvey
a9bae00606
Fix vulkan device selection at compile time and adapt to IREE python changes. ( #1407 )
...
* Add support for vulkan device selection at compile time.
* Don't convert device ID to int and fix .exe imports
20230514.728
20230513.727
2023-05-12 23:31:50 -07:00
Daniel Garvey
4731c1a835
prevent loading tokenizer on import ( #1432 )
...
also adds sentencepiece dep for exe
moved vicuna imports to after an if statement
in general we should avoid importing files that load whole models as
global variables
20230512.725
20230512.724
2023-05-12 19:11:45 -07:00
Ean Garvey
4c07e47e8c
Specify a few models for expected failure on CUDA CI. ( #1430 )
2023-05-12 17:03:37 -05:00
Gaurav Shukla
e0cc2871bb
[SD] Yield 2 tokens at a time in vicuna
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
20230511.723
2023-05-11 23:49:01 +05:30
Gaurav Shukla
649f39408b
[SD] Fix vicuna response
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-05-11 18:06:21 +05:30
Gaurav Shukla
c142297d73
[SD] Fix gradio to 3.22.0 version
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com
2023-05-11 18:05:55 +05:30
Gaurav Shukla
9e07360b00
[SD] Standalone vicuna with web
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-05-11 17:23:44 +05:30
Gaurav Shukla
7b74c86e42
[SD] Fix SAMPLE_INPUT_LEN import issue
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-05-11 15:41:43 +05:30
Eliasj42
fa833f8366
fixed spacing issue with chat-bot ( #1417 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230510.722
2023-05-10 16:07:50 -07:00
Gaurav Shukla
fcb059aa38
[SD] Integrate vicuna in the web ( #1410 )
2023-05-10 11:30:22 -07:00
PhaneeshB
517c670f82
vicuna chat cli
2023-05-10 22:55:06 +05:30
Eliasj42
59df14f18b
added vicuna demo ( #1408 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230510.721
20230509.720
2023-05-09 21:18:20 -07:00
Ean Garvey
6c95ac0f37
Revert dialect registration in model annotator ( #1406 )
...
Matches https://github.com/nod-ai/SHARK-Runtime/pull/58
20230509.719
2023-05-09 11:50:19 -07:00
Daniel Garvey
7a4a51ae73
vulkan vic f16 ( #1404 )
...
Co-authored-by: dan <dan@nod-labs.com >
20230508.718
2023-05-08 16:46:53 -07:00
powderluv
d816cc015e
Revert "added standalone vicuna script ( #1399 )" ( #1402 )
...
This reverts commit 0e4a8ca240 .
20230507.717
20230506.716
20230505.715
2023-05-05 16:08:05 -07:00
Eliasj42
54ce3d48ca
added standalone vicuna script ( #1401 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
2023-05-05 18:05:52 -05:00
Eliasj42
0e4a8ca240
added standalone vicuna script ( #1399 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
2023-05-05 15:46:05 -07:00
Daniel Garvey
6ca1298675
maximizes window size for webview launch ( #1394 )
20230504.714
2023-05-04 20:43:06 -07:00
jinchen62
bbef7a6464
Redesign model manager webui ( #1391 )
2023-05-04 20:41:29 -07:00
Ean Garvey
cdf2d61d53
Remove imports from iree.compiler.transforms from model annotator. ( #1392 )
2023-05-04 20:40:19 -07:00
Ean Garvey
6c14847d1f
xfail some large tests on macOS builder and switch to hash updates. ( #1341 )
...
* Update test-models.yml
* Disable large tests on macOS builder
2023-05-04 19:47:03 -05:00
Gaurav Shukla
68ecdd2a73
[SD] Add LoRA as experimental tab
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
20230504.713
2023-05-04 22:30:25 +05:30
Gaurav Shukla
3f4d444d18
[SD] Fix stable LM chatbot
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-05-04 22:30:25 +05:30
m68k-fr
e473d0375b
[Web] Models folders cleanup ( #1365 )
20230503.712
2023-05-03 16:13:20 -05:00
Ean Garvey
e38d96850f
Fix input image loading in img2img rest API ( #1388 )
2023-05-03 15:51:00 -05:00
Gaurav Shukla
fed63dfd4b
[SD] Add stableLM chatbot ( #1383 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
2023-05-03 15:37:20 -05:00