SHARK-Studio

mirror of https://github.com/nod-ai/SHARK-Studio.git synced 2026-01-13 15:57:56 -05:00

Author	SHA1	Message	Date
Stefan Kapusniak	c2163488d8	SD/UI Restrict hires fix/img2img resamplers/schedulers (#1955 ) * Restrict resamplers for img2img and high res fix to the ones that PIL.Image actually supports, since it uses that to di the resampling. Removed: Antialias, Affine, Cubic. Added: Hamming. * Set list of available schedulers to CPU only when high res fix is selected in the web ui. Set list to all schdulers when high res fix is deselected. * Put hi res fix in its own Accordian in the txt2img UI instead of grouping it with Advanced Options.	2023-11-13 16:08:24 -06:00
PhaneeshB	32cab73a29	add iree-rocm-target-chip only if added by user	2023-11-13 23:35:55 +05:30
PhaneeshB	392bade0bf	enable non default rocm device selection for webui	2023-11-13 23:35:55 +05:30
Stefan Kapusniak	6285430d8a	UI: Fix webui launch on non-Windows (#1963 ) * Moves the imports of winreg and Tk, into the functions that use them, with winreg behind a guard clause. This should hopefully mean that if you're not on Window or not using `ui=app` we won't trip over either of these due to them not being there.	2023-11-10 16:38:32 -06:00
Phaneesh Barwaria	db89b1bdc1	Fix MacOS web execution flow (#1899 ) * fix metal device path for chatbot * single device remove indexing * lint fix	2023-11-09 10:59:29 -06:00
Stefan Kapusniak	fb30e8c226	UI: Fix some webui launch corner cases (#1952 ) * On windows insist on the presence of webview2 as the embeddable browser for `ui=app`. If we can't find it, effectively switch back to `ui=web`. This should prevent pywebview trying to use MSHTML, whilst saying its deprecated, and apparently we are too much for poor old IE11 * Add webview2 runtime droppings to .gitignore. * If we can't bind to args.server_port get another suitable port from the OS and advise the user that we did this in the UI. * Make `ui=web` mode use 'SHARK AI Studio' as its title. This makes it consistent with `ui=app`. * Replace the generic gradio favicon with a nod swirl one instead.	2023-11-09 10:53:28 -06:00
Ean Garvey	a07d542400	(Studio) Disable SD tunings and sub-model downloads (#1944 ) * sets --no-use_tuned and --import_mlir as defaults in SHARK Studio.	2023-11-07 15:55:30 -06:00
Stefan Kapusniak	ad55cb696f	SD/API: Add missing A1111 APIs to Shark to support koboldcpp image generation (#1924 ) * SD/API: Add missing a1111 API features for Koboldcpp * Refactors SD api functions into their own file * Adds the following apis implemented by a1111 as needed by koboldcpp: - adds /sdapi/v1/sd-models (lists available models) - adds /sdapi/v1/options (only the bare minimum needed) * Adds optional CORS support, use the '--api_accept_origin' command line argument to activate and configure. * Extends existing APIs to include optional sampler/scheduler selection * Extends /sdapi/v1/textimg to recognise the method used by koboldcpp to select the model. * Where possible take values not provided to the API in the request from the existing relevant command line parameters rather than hardcoding them. * return a 400 response when a request doesn't have required properties. * changed default schedulers and models for some apis to ones that actually seem to work. * Update api_test.py to include the new APIs. * Update api_test.py to include a '--verbose' command line option. * SD/API: Take more API values from args * Take LoRA from '--use_lora' command line arg if specified * Take device from '--device' command line arg if specified (substring match, so a short name such as 'vulkan://0' should work) * SD/API: add more endpoints and pydantic typing * Mount the whole of /sdapi from index.py as a FastAPI application, rather than each endpoint individually * Add the following additional API endpoints: * /sdapi/v1/samplers * /sdapi/v1/cmd-flags * Make scheduler/sampler selection checking and fallback much more robust. * Support aliasing some A1111 scheduler/sampler names to the diffusers ones we are using. * Expand response /sdapi/v1/options to add a few more things. * Split non-api functions and variables into their own utils.py file. * Support 'n_iter' request property and the return of multiple images from generation endpoints. Equivalent of '--batch_count', batch_size is stil hardcoded at 1 * Include (some) hires_fix request properties in txt2img endpoint * Rework endpoints using pydantic model classes for better request validation and so we get much improved swagger api docs at /sdapi/docs and redoc at /sdapi/redoc * SD/API Delete commented out code from index.py * Delete some code that is no longer needed by the SD API in index.py (and one line sdapi_v1.py) that I'd previously only commented out. * SD/UI: Add shark_sd_koboldcpp.md document * Add documentation on how to set up Koboldcpp with SHARK * Link this and the existing blender set up document from the main README.md * SD/API Improve stencil options in img2img endpoint In /sdapi/v1/img2img: * Add zoedepth to the controlnet use_stencil options * Require and use second image as stencil mask for controlnet scribble	2023-11-06 15:20:19 -06:00
gpetters94	72e27c96fc	Add ZoeDepth (#1834 ) * Add ZoeDepth * Add einops to Studio imports. * Specify ref for forked torch.hub repos. * Unpin timm. --------- Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: Ean Garvey <garveyej@gmail.com>	2023-10-30 11:57:45 -05:00
Phaneesh Barwaria	486202377a	update dependency on rocm/hip info command (#1900 ) * add support for rocm flags * add rocm target flag to chat args * rm rocm libs dependency message	2023-10-26 15:18:25 +05:30
Stefan Kapusniak	0361db46f9	SD: Fix unet untuned opt_flags (#1912 ) * correct my sloppy copy/paste for the untuned unet default compilation flags that introduced an extra 'detach' into what should have been 'iree-global-opt-convert-1x1-filter-conv2d-to-matmul'	2023-10-24 12:47:33 -05:00
xzuyn	a012433ffd	Save hiresfix info if used (#1914 )	2023-10-24 12:45:10 -05:00
xzuyn	5061193da3	Move Generate, Randomize Seed, & Stop Batch to same positions as txt2img (#1915 )	2023-10-24 12:44:39 -05:00
xzuyn	bff48924be	LLaMa 2 Chat template fix (#1913 )	2023-10-23 18:51:15 -05:00
Stefan Kapusniak	825b36cbdd	Fix MLIR Textual PassPipeline Error (#1910 )	2023-10-22 07:39:52 -07:00
Stefan Kapusniak	134441957d	SD - Fix civitai download on Windows +improvements (#1907 )	2023-10-21 11:17:41 -07:00
Stefan Kapusniak	7cd14fdc47	SD/UI: Use a single model selection box on UI tabs (#1906 ) * Allow entry of a huggingface model id or civitai download url to be done in the main model selection dropdown on SD tabs * Remove separate textbox for entering huggingface model id or civitai download url on SD Tabs * Remove 'None' option from the model selection dropdown (no longer needed) on SD tabs * Update png metadata drop zone on txt2img tab to work with a single argument for model selection * Update UI generate functions on SD tabs to work with single argument model selection * Update API code for changes to the UI generate functions * Move info about the custom model path to the logging textarea on SD tabs	2023-10-21 10:06:05 -07:00
Stefan Kapusniak	71d25ec5d8	SD: Fix repeatable seeds when intial seed is random (#1893 )	2023-10-14 22:50:42 -07:00
Stefan Kapusniak	a208302bb9	Fix repeatable seeds consistency over batch counts (#1889 ) * Set the input seed for the random number generator when generating repeatable seeds to exclude any negative numbers in the parsed seed input. The makes seeds generated for different batch counts consistent where they have the same input for the initial seed or set of seeds.	2023-10-12 17:15:19 -05:00
Phaneesh Barwaria	a731eb6ed4	Macos fixes (#1883 ) * fix venv setup for MacOS * allow stream fuse binding on mac * clean iree metal args	2023-10-09 23:36:12 -07:00
Ean Garvey	2004d16945	Revert "[SDXL] Add SDXL pipeline to SHARK (#1731 )" (#1882 ) This reverts commit `9f0a421764`.	2023-10-09 18:01:44 -07:00
Gaurav Shukla	77727d149c	[warning] Fix dropdown warning Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-10 05:18:43 +05:30
Abhishek Varma	9f0a421764	[SDXL] Add SDXL pipeline to SHARK (#1731 ) -- This commit adds SDXL pipeline to SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-10-09 13:01:37 -05:00
Gaurav Shukla	c28682110c	[chatbot] Flag to add system prompt Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-09 22:17:39 +05:30
Ean Garvey	caf6cc5d8f	Switch most compile flows to use ireec.compile_file. (#1863 ) * Switch most compile flows to use ireec.compile_file. * re-add input type to compile_str path. * Check if mlir_module exists before checking if it's a path or pyobject. * Fix some save_dir cases	2023-10-06 23:04:43 -05:00
Gaurav Shukla	e54517e967	[UI] Disable config generator, lora train and model manager (#1858 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-02 22:34:40 -07:00
Ean Garvey	326327a799	Collect pipeline submodules for diffusers ckpt preprocessing. (#1859 )	2023-10-03 00:29:28 -04:00
Ean Garvey	785b65c7b0	Add flag for specifying device-local caching allocator heap key. (#1856 )	2023-10-03 00:28:39 -04:00
Gaurav Shukla	e930ba85b4	[os] Remove os dependency from vmfb naming (#1854 ) Also fixes a small ui issue for chatbot. Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 12:38:17 -05:00
Gaurav Shukla	cd732e7a38	[chatbot] split execution time to prefill and decode Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 13:18:03 +05:30
Gaurav Shukla	8e0f8b3227	[ui] Update chatbot UI Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 13:18:03 +05:30
Gaurav Shukla	b8210ef796	[chatbot] Re-instantiate the chatbot object if device id changes Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 13:18:03 +05:30
PhaneeshB	94594542a9	remove use of vulkaninfo	2023-09-28 21:57:00 +05:30
Gaurav Shukla	82f833e87d	[vulkan] Update vmfb naming Update vmfb naming for vulkan devices in order to resolve naming conflicts in the presence of multiple vulkan devices. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-28 14:52:11 +05:30
Nelson Sharpe	6773278ec2	Fix checkpoint_path unexpected argument (#1832 )	2023-09-24 14:17:52 -07:00
Abhishek Varma	9a0efffcca	[Llama2] Fix wrong Vulkan device ID + Add Vulkan compile flags -- This commit fixes the wrong Vulkan device being selected during runtime. -- It also adds couple of IREE compilation flags to target specific Vulkan device. -- It also changes the Vulkan device listing to be more in tune with lowering control flow. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-09-22 22:24:18 +05:30
zjgarvey	9eceba69b7	local_tank_cache included into clear_all (#1833 )	2023-09-18 00:27:23 -05:00
Ean Garvey	684943a4a6	(SD) Fix tokenizers imports in pyinstaller builds. (#1828 ) * Fix tokenizers metadata. * (SD) Disable VAE lowering configs (rdna3) and add versioned tunings. * Update sd_annotation.py * (SD) Add cv2 to spec. * Update stencil pipeline with the new img2img arg.	2023-09-12 12:23:48 -05:00
PhaneeshB	b817bb8455	add roles for llama2	2023-09-12 10:59:28 +05:30
Ean Garvey	780f520f02	Fix vk.target_env extensions and remove redundant SD imports. (#1826 ) * Remove redundant IREE runtime imports. * Fix vulkan target env extensions.	2023-09-11 13:42:52 -05:00
Abhishek Varma	bde63ee8ae	Add logging feature in WebUI (#1821 )	2023-09-08 05:48:05 -07:00
Gaurav Shukla	ede6bf83e2	[vicuna] Disabling the IR generation path Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-06 20:13:17 +05:30
Gaurav Shukla	d2f64eefa3	[chatbot] Remove few outdated models from list (#1814 )	2023-09-04 09:26:32 -07:00
jinchen62	4c3d8a0a7f	Enable downloading vmfb/mlir for webui (#1807 )	2023-08-31 11:05:47 -07:00
Ean Garvey	9c8cbaf498	Add support for ROCM (Windows) in Studio + compile utils (#1770 ) * WIP: MSVC ROCM support for SHARK Studio * Make get_iree_rocm_args platform-agnostic. * Update stable_args.py * Update rocm arg handling in SD utils * Guard quantization imports. Co-authored-by: jam https://github.com/jammm	2023-08-25 20:56:05 -07:00
Ean Garvey	9697981004	Pipe through a debug option to iree compile utils. (#1796 ) * Update compile_utils.py * Pipe through a flag to toggle debug options in compile utils. * Update SharkLLMBase.py	2023-08-25 07:11:11 -07:00
Ean Garvey	8e3860c9e6	Remove flags that are default in upstream IREE (#1785 ) * Remove index bits flags now set by default * Update shark_studio_imports.py	2023-08-24 11:57:54 -05:00
xzuyn	e37d6720eb	Add Hires Fix (#1787 ) * improper test hiresfix * add sliders & use `clear_cache` * add resample choices & fix step adjustment * add step adjustment to img2img * add resample options to img2img * simplify hiresfix - import `img2img_inf` from `img2img_ui.py` instead of just copying it into `txt2img_ui.py` * set `hri` to None after using * add more resample types, and don't show output until hiresfix is done * cleaner implementation * ran black * ran black again with jupyter dependencies	2023-08-24 09:01:41 -07:00
Abhishek Varma	db990826d3	Add Llama2 13B int4 fp16 support (#1784 ) Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-08-23 10:00:32 -07:00
gpetters94	7ee3e4ba5d	Add stencil_unet_512 support (#1778 ) This should fix any remaining issues with stencils and long prompts.	2023-08-22 12:23:46 -04:00

1 2 3 4 5 ...

401 Commits