Phaneesh Barwaria
436f58ddc4
cli using generate and mem fixes ( #1509 )
20230608.761
2023-06-08 13:13:32 -05:00
Phaneesh Barwaria
6b29bd17c8
Enable compilation vicuna ( #1507 )
...
* add cli for unsharded vic
* enable mlir download and compile
20230607.760
2023-06-07 13:08:22 -07:00
Ean Garvey
2c3485ca3e
Add standalone OPT sentence completion script. ( #1506 )
2023-06-07 10:58:03 -07:00
Daniel Garvey
f206ecc635
reenable compilation in vicuna pipeline, add flags ( #1505 )
...
* replace vicuna.py backend with pipeline
* add some memory management to fist vicuna compile
reenable compilation
2023-06-07 09:49:27 -07:00
Stefan Kapusniak
a187e05ae6
Prevent having no cuda devices breaking the UI ( #1503 )
...
Don't break the UI when the LLM tab only wants cuda devices but there
aren't any.
20230606.759
2023-06-06 11:41:16 -07:00
Gaurav Shukla
8c21960486
[vicuna] Set only cuda devices in vicuna UI for now
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-06-06 22:15:20 +05:30
Gaurav Shukla
be62fce676
[vicuna] Fix vicuna chatbot ( #1499 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-06-06 09:23:32 -07:00
PhaneeshB
f23b778a6c
remove old vicuna scripts
2023-06-06 21:35:58 +05:30
PhaneeshB
436edf900d
add vic sharded pipeline
2023-06-06 21:35:58 +05:30
Gaurav Shukla
ed58c2553f
[vicuna] Integrate vicuna in shark studio
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-06-06 20:57:48 +05:30
Stefan Kapusniak
f2ca58e844
Add .csv and .json param info to output gallery ( #1495 )
2023-06-06 07:08:34 -07:00
Ean Garvey
1dbcc736eb
[SD] (RDNA2) Enable new tuning for sd1.4 ( #1498 )
2023-06-06 06:48:58 -07:00
Phaneesh Barwaria
a83808ddc5
Vicuna cuda on A100 40G ( #1496 )
...
* vic chat with memory management (precompiled vmfb)
* fix vmfb path and download
2023-06-06 15:10:33 +05:30
Ean Garvey
a07fe80530
Update OPT, ResNet example scripts. ( #1492 )
...
* Update API in OPT example.
* fix resnet50 script
* Add OPT1.3b test script.
20230605.757
20230605.756
2023-06-05 20:19:35 -07:00
Ean Garvey
d0ba3ef8fa
disable use_tuned on SD1.4 for rdna2 ( #1490 )
...
this is a temporary measure while we retune SD1.4 for rdna2. The current config fails during iree-compile.
2023-06-05 19:46:16 -05:00
Stefan Kapusniak
8400529c2c
Fix output gallery not using shark_tmp ( #1493 )
...
This fix the gallery component of the output gallery dumping temporary
files into the standard folders rather than shark_tmp so those files never
got cleared out on restart and would build up.
2023-06-05 16:23:49 -05:00
powderluv
7eaee9c242
update SHARK to nodai SHARK
20230605.755
2023-06-05 00:44:49 -07:00
powderluv
8230eebce5
Switch to CPU torch builds for shark.whl
20230605.754
2023-06-05 00:36:03 -07:00
Ean Garvey
6296ea4be9
fix config handling for sd1.4 on rdna2 ( #1489 )
2023-06-05 00:02:30 -07:00
Ean Garvey
4151ec3a8f
(pytest) tag efficientnet, mobilenet as xfails on vulkan ( #1488 )
2023-06-04 23:22:32 -07:00
powderluv
a2467e8d43
Enable SHARK whl packages
2023-06-04 23:21:22 -07:00
Ean Garvey
e677178bcc
Replace RDNA2 SD lowering configs. ( #1486 )
2023-06-05 00:57:43 -05:00
Anush Elangovan
7ef1bea953
XFAIL some macos tests
20230604.752
2023-06-04 15:27:03 -07:00
Chi_Liu
ad89bb1413
Add distilgpt2 to stablehlo in shark tank ( #1481 )
20230603.751
20230602.750
2023-06-02 16:44:46 -05:00
Ean Garvey
218ed78c40
Change instances of input_type='mhlo' to 'auto' ( #1482 )
2023-06-02 16:43:47 -05:00
Stefan Kapusniak
6046f36ab6
UI/Web: Fix upscaler stop button (mostly) ( #1479 )
...
* UI/Web: Fix upscaler stop button
* Hook the cancel_sd function up to the Stop button.
* Adds checks for SD_STATE_CANCEL in the upscaler ui inference function.
* Set and check for SD_STATE_IDLE, SD_STATE_CANCEL in the upscaler
pipeline.
* UI/Web: lint fixes for upscaler stop button fix
---------
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
2023-06-01 22:26:55 -07:00
Foxlum
5915bf7de3
Add to and tweak vulkan configuration environments. ( #1475 )
...
* Update vulkan_target_env_utils.py
* Update vulkan_target_env_utils.py
Adjust target environment capabilities.
* Update vulkan_target_env_utils.py
black linted?
2023-06-01 22:25:20 -07:00
Phaneesh Barwaria
f0a4e59758
LLM Pipeline Wrapper ( #1477 )
...
* [LLM] Add LLM pipeline
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
* add base pipeline and stableLM
* StableLM on UI - full block
* add SLM default model name
* add vicuna with pipeline
* add one token gen api for vic
* Fix stableLM bugs
* debug vic memory
* lint fix
---------
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
Co-authored-by: Gaurav Shukla <gaurav@nod-labs.com >
20230601.749
20230531.748
20230531.747
2023-05-31 10:17:20 -07:00
Stefan Kapusniak
1ddef26af5
Web/UI: Add an Output Gallery tab for SD ( #1470 )
...
* WebUI: Adds an Output Gallery tab
Adds an new Output Gallery tab to the ui/webui with these features:
* Subdirectory select dropdown listing subdirectories at any depth below
the <output_dir>/generated_imgs directory,
* Large, full height, gallery area displaying the images in the selected
subdirectory. Shows nod logo when no images are in the selected
subdirectory.
* Slider that changes the number of columns of images that the gallery
displays from between 1 to 16 columns (defaults to 4).
* Expandable parameter info panel showing any generation parameters
saved in the file of the selected image for PNGs, alternatively the
image's EXIF data for JPEGs
* Send to buttons for txt2img, img2img, inpaint, outpaint and upscaler.
* Auto update of gallery and gallery label (to show generation status),
when a new image is generated by any of the stable diffusion tabs, and
is outputted to the currently selected subdirectory.
* Command line option for enabling and disabling the output gallery
(defaults to enabled)
* Command line option for following symlinks when getting entries
for the subdirectory list (defaults to off, as Python os.walk doesn't
check for circular references if following symlinks)
* Reformat with black
Reformat changes with black and then adjust some places where black's
formatting then needed some rephrasing of the code to make things
clearer.
* Add back transformers and sd_cancel imports
Adds back the transformers import in index.py needed for .exe
generation. Add comment so it doesn't get mistakenly removed
next time.
Adds back sd_cancel import in upscaler.py that is currently unused
but should be being used for the 'Stop' button.
20230531.746
2023-05-30 13:47:48 -07:00
Chi_Liu
ba8eddb12f
Add GPT3/OPT to Stablehlo in shark tank ( #1468 )
...
Co-authored-by: AmosLewis <Amos_Lewsi@foxmail.com >
Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com >
20230530.745
2023-05-29 21:58:39 -07:00
yzhang93
47b346d428
Modify the lowering config format for SPIRVMatmulPromoteVectorize pipeline ( #1471 )
2023-05-29 21:53:48 -07:00
Ean Garvey
1b4f4f5f4d
Fix download path for SD1.4 Unet. ( #1469 )
20230528.744
20230527.743
20230526.742
2023-05-26 11:59:51 -07:00
Elias Joseph
73cd7e8320
added full vicuna to vicuna.py
2023-05-26 22:06:40 +05:30
Ean Garvey
19c0ae3702
Cleanup SD pipeline utils ( #1466 )
20230525.741
2023-05-25 12:50:11 -05:00
Ean Garvey
54e57f7771
Revive SD downloads from shark_tank. ( #1465 )
2023-05-25 12:03:21 -05:00
PhaneeshB
6d64b8e273
vic and slm common generation base
2023-05-25 20:29:41 +05:30
PhaneeshB
a8ea0326f5
correct SLM saved vmfb naming
2023-05-25 20:29:41 +05:30
PhaneeshB
58e9194553
add Lists import
2023-05-25 20:29:41 +05:30
PhaneeshB
eb360e255d
remove unused imports
2023-05-25 20:29:41 +05:30
PhaneeshB
a6f88d7f72
refactor mlir compile
2023-05-25 20:29:41 +05:30
Prashant Kumar
8e571d165f
Enable cpu f16 dtype tracing for the vicuna model. ( #1461 )
20230524.740
2023-05-24 09:37:57 -07:00
Ean Garvey
3cddd01b10
Update OPT tokenizer and xfail a few more large tests on macos CI ( #1459 )
...
* Update opt_torch_test.py
* Update all_models.csv
20230523.739
2023-05-23 14:36:57 -07:00
Chi_Liu
64c2b2d96b
Add gpt2 to stablehlo support in shark tank ( #1447 )
...
- Add torch decomposition support when generating shark tank
- Add gpt2 stablehlo
20230522.738
2023-05-22 10:45:51 -07:00
Phaneesh Barwaria
f5ce121988
SLM on Sharkstudio ( #1454 )
...
* localize import, fix file reading, device cpu
* extract out model args
20230521.737
20230520.736
20230519.735
20230519.734
2023-05-19 11:21:08 -07:00
Ean Garvey
991f144598
Add iree hidden imports to SD spec ( #1456 )
...
* Add iree hidden imports to SD spec
* Update shark_sd_cli.spec
2023-05-19 11:19:16 -07:00
PhaneeshB
09bea17e59
fix #2 SLM in SharkStudio
20230517.731
2023-05-18 00:56:22 +05:30
Daniel Garvey
aefcf80b48
swap to cpu an remove hardcoded paths ( #1448 )
...
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
2023-05-17 10:53:34 -07:00
PhaneeshB
512235892e
fix SLM for SharkStudio
2023-05-17 22:34:30 +05:30
PhaneeshB
6602a2f5ba
add continuous output for CLI
2023-05-17 18:33:46 +05:30
Boian Petkantchin
20114deea0
In MiniLM JAX example verify MLIR result against JAX
20230516.730
2023-05-16 09:54:07 -07:00