AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-04-03 03:00:17 -04:00

Author	SHA1	Message	Date
Ean Garvey	9f59a16596	fix formatting and disable explicit vulkan env settings.	2024-03-28 23:39:52 -05:00
Ean Garvey	0ade2ec00d	Cleanup sd model map.	2024-03-28 09:59:31 -05:00
gpetters-amd	44ef35f4db	Fix the .exe (#2101 )	2024-03-21 19:55:13 -05:00
Ean Garvey	f7d1af46f4	Small fixes to sd, pin mpmath	2024-02-29 19:16:47 -06:00
Ean Garvey	60c013e4f0	Tweak compile-time flags for SD submodels.	2024-02-20 08:51:00 -06:00
Ean Garvey	92c11be5c6	Merge branch 'main' into sd-studio2	2024-02-19 11:23:36 -06:00
Stefan Kapusniak	c507f7d6f6	Studio2/SD/UI: Further sd ui pipeline fixes (#2091 ) On Windows, this gets us all the way failing in iree compile of the with SD 2.1 base. - Fix merge errors with sd right pane config UI tab. - Remove non-requirement.txt install/build of torch/mlir/iree/SRT in setup_venv.ps1, fixing "torch.compile not supported on Windows" error. - Fix gradio deprecation warning for `root=` FileExplorer kwarg. - Comment out `precision` and `max_length` kwargs being passed to unet, as not yet supported on main Turbine branch. Avoids keyword argument error.	2024-02-18 20:55:16 -06:00
Stefan Kapusniak	6dc39e6a66	Studio2/SD: Fix sd pipeline up to "Windows not supported" (#2082 ) * Studio2/SD: Fix sd pipeline up to "Windows not supported" A number of fixes to the SD pipeline as run from the UI, up until the point that dynamo complains "Windows not yet supported for torch.compile". * Remove separate install of iree-runtime and iree-compile in setup_venv.ps1, and rely on the versions installed via the Turbine requirements.txt. Fixes #2063 for me. * Replace any "None" strings with python None when pulling the config in the UI. * Add 'hf_auth_token' param to api StableDiffusion class, defaulting to None, and then pass that in to the various Models where it is required and wasn't already being done before. * Fix clip custom_weight_params being passed to export_clip_model as "external_weight_file" rather than "external_weights" * Don't pass non-existing "custom_vae" parameter to the Turbine Vae Model, instead pass custom_vae as the "hf_model_id" if it is set. (this may be wrong in the custom vae cast, but stops the code always breaking). * Studio2/SD/UI: Improve UI config None handling * When populating the UI from a JSON Config set controls to "None" for null/None values. * When generating a JSON Config from the UI set props to null/None for controls set to "None". * Use null rather string 'None' in the default config --------- Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com>	2024-02-17 21:37:29 -06:00
Ean Garvey	39ebc45393	Small fixes	2024-02-12 16:31:14 -06:00
Ean Garvey	5f675e18af	Formatting and init files.	2024-02-12 16:31:12 -06:00
Ean Garvey	be4c49a1b9	Add rest API endpoint from LanguageModel API	2024-02-12 16:24:55 -06:00
Stanley Winata	230638ab9a	HF-Reference LLM mode + Update test result to match latest Turbine. (#2080 ) * HF-Reference LLM mode. * Fixup test to match current output from Turbine. * lint * Fix test error message + Only initialize HF torch model when used. * Remove redundant format_out change.	2024-02-12 16:24:55 -06:00
Ean Garvey	25312cd791	Add StreamingLLM support to studio2 chat (#2060 ) * Streaming LLM * Update precision and add gpu support * (studio2) Separate weights generation for quantization support * Adapt prompt changes to studio flow * Remove outdated flag from llm compile flags. * (studio2) use turbine vmfbRunner * tweaks to prompts * Update CPU path and llm api test. * Change device in test to cpu. * Fixes to runner, device names, vmfb mgmt * Use small test without external weights.	2024-02-12 16:24:55 -06:00
Ean Garvey	019ba7051d	Small cleanup	2024-02-02 11:25:41 -06:00
Stanley Winata	6bf51f1f1d	HF-Reference LLM mode + Update test result to match latest Turbine. (#2080 ) * HF-Reference LLM mode. * Fixup test to match current output from Turbine. * lint * Fix test error message + Only initialize HF torch model when used. * Remove redundant format_out change.	2024-02-01 11:46:22 -06:00
Ean Garvey	05b498267e	Add StreamingLLM support to studio2 chat (#2060 ) * Streaming LLM * Update precision and add gpu support * (studio2) Separate weights generation for quantization support * Adapt prompt changes to studio flow * Remove outdated flag from llm compile flags. * (studio2) use turbine vmfbRunner * tweaks to prompts * Update CPU path and llm api test. * Change device in test to cpu. * Fixes to runner, device names, vmfb mgmt * Use small test without external weights.	2024-01-18 19:01:07 -06:00
Stefan Kapusniak	7a0017df33	Studio2: Remove duplications from api/utils.py (#2035 ) * Remove duplicate os import * Remove duplicate parse_seed_input function Migrating to JSON requests in SD UI More UI and app flow improvements, logging, shared device cache Model loading Complete SD pipeline. Tweaks to VAE, pipeline states Pipeline tweaks, add cmd_opts parsing to sd api	2024-01-17 12:14:39 -06:00
Ean Garvey	dbacc36a92	(WIP): Studio2 app infra and SD API UI/app structure and utility implementation. - Initializers for webui/API launch - Schedulers file for SD scheduling utilities - Additions to API-level utilities - Added embeddings module for LoRA, Lycoris, yada yada - Added image_processing module for resamplers, resize tools, transforms, and any image annotation (PNG metadata) - shared_cmd_opts module -- sorry, this is stable_args.py. It lives on. We still want to have some global control over the app exclusively from the command-line. At least we will be free from shark_args. - Moving around some utility pieces. - Try to make api+webui concurrency possible in index.py - SD UI -- this is just img2imgUI but hopefully a little better. - UI utilities for your nod logos and your gradio temps. Enable UI / bugfixes / tweaks	2024-01-17 12:14:36 -06:00
Ean Garvey	fa95ed30d1	Relocate quantized matmul reassociation flag (#2047 ) * Remove quantized matmul reassociation flag This flag should be a model/use-case specific addition, not a default CPU compile flag.	2023-12-20 12:48:40 -08:00
Daniel Garvey	ebfcfec338	remove shark 1.0 tests, add support for 2.0 llm * add support for external weights * add tests and edit deps	2023-12-14 21:44:37 -06:00
Ean Garvey	f6d41affd9	(SHARK Studio) Add Turbine-based llm chatbot. (#1933 ) * Dan shark studio (#1970) * Fix issue in Falcon-GPTQ * initial webui and llama2 --------- Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com> * Fix formatting. --------- Co-authored-by: Daniel Garvey <34486624+dan-garvey@users.noreply.github.com> Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2023-11-14 09:56:28 -06:00

21 Commits