On Windows, this gets us all the way failing in iree compile of the with SD 2.1 base.
- Fix merge errors with sd right pane config UI tab.
- Remove non-requirement.txt install/build of torch/mlir/iree/SRT in setup_venv.ps1, fixing "torch.compile not supported on Windows" error.
- Fix gradio deprecation warning for `root=` FileExplorer kwarg.
- Comment out `precision` and `max_length` kwargs being passed to unet, as not yet supported on main Turbine branch. Avoids keyword argument error.
* Studio2/SD: Fix sd pipeline up to "Windows not supported"
A number of fixes to the SD pipeline as run from the UI, up until the point that dynamo
complains "Windows not yet supported for torch.compile".
* Remove separate install of iree-runtime and iree-compile in setup_venv.ps1, and rely on the
versions installed via the Turbine requirements.txt. Fixes#2063 for me.
* Replace any "None" strings with python None when pulling the config in the UI.
* Add 'hf_auth_token' param to api StableDiffusion class, defaulting to None, and then pass
that in to the various Models where it is required and wasn't already being done before.
* Fix clip custom_weight_params being passed to export_clip_model as "external_weight_file"
rather than "external_weights"
* Don't pass non-existing "custom_vae" parameter to the Turbine Vae Model, instead
pass custom_vae as the "hf_model_id" if it is set. (this may be wrong in the custom vae
cast, but stops the code *always* breaking).
* Studio2/SD/UI: Improve UI config None handling
* When populating the UI from a JSON Config set controls to "None" for null/None
values.
* When generating a JSON Config from the UI set props to null/None for controls
set to "None".
* Use null rather string 'None' in the default config
---------
Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com>
* HF-Reference LLM mode.
* Fixup test to match current output from Turbine.
* lint
* Fix test error message + Only initialize HF torch model when used.
* Remove redundant format_out change.
* Streaming LLM
* Update precision and add gpu support
* (studio2) Separate weights generation for quantization support
* Adapt prompt changes to studio flow
* Remove outdated flag from llm compile flags.
* (studio2) use turbine vmfbRunner
* tweaks to prompts
* Update CPU path and llm api test.
* Change device in test to cpu.
* Fixes to runner, device names, vmfb mgmt
* Use small test without external weights.
* HF-Reference LLM mode.
* Fixup test to match current output from Turbine.
* lint
* Fix test error message + Only initialize HF torch model when used.
* Remove redundant format_out change.
* Streaming LLM
* Update precision and add gpu support
* (studio2) Separate weights generation for quantization support
* Adapt prompt changes to studio flow
* Remove outdated flag from llm compile flags.
* (studio2) use turbine vmfbRunner
* tweaks to prompts
* Update CPU path and llm api test.
* Change device in test to cpu.
* Fixes to runner, device names, vmfb mgmt
* Use small test without external weights.
* Remove duplicate os import
* Remove duplicate parse_seed_input function
Migrating to JSON requests in SD UI
More UI and app flow improvements, logging, shared device cache
Model loading
Complete SD pipeline.
Tweaks to VAE, pipeline states
Pipeline tweaks, add cmd_opts parsing to sd api
UI/app structure and utility implementation.
- Initializers for webui/API launch
- Schedulers file for SD scheduling utilities
- Additions to API-level utilities
- Added embeddings module for LoRA, Lycoris, yada yada
- Added image_processing module for resamplers, resize tools,
transforms, and any image annotation (PNG metadata)
- shared_cmd_opts module -- sorry, this is stable_args.py. It lives on.
We still want to have some global control over the app exclusively
from the command-line. At least we will be free from shark_args.
- Moving around some utility pieces.
- Try to make api+webui concurrency possible in index.py
- SD UI -- this is just img2imgUI but hopefully a little better.
- UI utilities for your nod logos and your gradio temps.
Enable UI / bugfixes / tweaks