SHARK-Studio

mirror of https://github.com/nod-ai/SHARK-Studio.git synced 2026-01-09 22:07:55 -05:00

Author	SHA1	Message	Date
Ean Garvey	d051c3a4a7	Use clean_device_info() by default and don't write .mlir to /tmp/ (#1984 ) * Move clean_device_info to compile_utils * Update compile_utils.py * Fix .mlir writes for some user-level permissions * Fix cases where full URI is given * Fix conditionals. * Fix device path handling in vulkan utils. 20231121.1037 20231120.1036 20231120.1035	2023-11-20 13:10:31 -06:00
Ean Garvey	1b11c82c9d	Small UI tweaks for chatbot, fix torchvision requirements (#1988 ) - add torchvision to setup_venv.ps1 -- we need this for the torchvision::nms that is now a dependency of controlnet features. - Don't have bad flashy orange updates when using the chatbot - Don't limit the height of the chatbot -- there's mixed opinions and solutions around this one. I think the default (400) is just way too small and LLMs generate plenty enough to justify matching the output.	2023-11-21 00:09:10 +05:30
gpetters94	80a33d427f	Save intermediate values of controlnet (#1981 ) 20231119.1034 20231118.1033 20231117.1032	2023-11-17 19:05:41 -05:00
Stefan Kapusniak	4125a26294	API/Docs: Fix incorrect cors arguments listing (#1983 ) * Replace `api_cors_origin` in the api/koboldcpp doc, with the correct `api_accept_origin`	2023-11-17 12:29:01 -06:00
Ean Garvey	905d0103ff	Revert "Re-enable SD tunings without matmuls. (#1976 )" (#1979 ) This reverts commit `70817bb50a`. 20231117.1028	2023-11-17 23:44:33 +05:30
Stefan Kapusniak	192b3b2c61	UI: Output galllery cleanups (#1959 ) * Workaround gradio bug that causes the parameters frame to always show scrollbars. * Remove the original funky method of setting the number of image columns in the gallery using _fn= javacript events. The version of gradio we now have pinned allows doing this by setting the property on the gallery directly and also doesn't keep resetting the columns on other events being fired. 20231116.1027 20231115.1026	2023-11-15 22:20:42 -06:00
Stefan Kapusniak	8f9adc4a2a	UI: Display top tag frequencies for selected LoRA (#1972 ) * Adds a function to webui utils to read metadata from .safetensors LoRA files. and do limiting parsing of the format written out by the Kohya SS scripts (https://github.com/kohya-ss/sd-scripts) to get tag frequency and trained model information. * Adds a new common_ui_events.py file for gradio event handlers needed for multiple UI tabs, and adds an event handler for binding to the change event of the LoRA selection boxes, that outputs HTML to display the LoRA tag frequency and model information. * Adds an HTML gradio control to each of the SD tabs to show the LoRA model name, and most frequently trained tags. * Bind the change event of the LoRA selection box on each tab to our new event handler, with the output set to the relevant HTML control.	2023-11-15 22:19:54 -06:00
Ean Garvey	70817bb50a	Re-enable SD tunings without matmuls. (#1976 )	2023-11-15 20:42:53 -06:00
jinchen62	dd37c26d36	Update brevitas quant api (#1975 )	2023-11-15 10:04:07 -08:00
PhaneeshB	a708879c6c	fix iree version mismatch 20231114.1025	2023-11-15 01:24:42 +05:30
Ean Garvey	bb1b49eb6f	Add --no-index to setup_venv.sh runtime pip install.	2023-11-14 21:44:20 +05:30
Ean Garvey	f6d41affd9	(SHARK Studio) Add Turbine-based llm chatbot. (#1933 ) * Dan shark studio (#1970) * Fix issue in Falcon-GPTQ * initial webui and llama2 --------- Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com> * Fix formatting. --------- Co-authored-by: Daniel Garvey <34486624+dan-garvey@users.noreply.github.com> Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2023-11-14 09:56:28 -06:00
Stefan Kapusniak	c2163488d8	SD/UI Restrict hires fix/img2img resamplers/schedulers (#1955 ) * Restrict resamplers for img2img and high res fix to the ones that PIL.Image actually supports, since it uses that to di the resampling. Removed: Antialias, Affine, Cubic. Added: Hamming. * Set list of available schedulers to CPU only when high res fix is selected in the web ui. Set list to all schdulers when high res fix is deselected. * Put hi res fix in its own Accordian in the txt2img UI instead of grouping it with Advanced Options. 20231113.1024	2023-11-13 16:08:24 -06:00
PhaneeshB	54bff4611d	fix cli rocm device selection	2023-11-13 23:35:55 +05:30
PhaneeshB	11510d5111	add intra rocm vmfb differentiator	2023-11-13 23:35:55 +05:30
PhaneeshB	32cab73a29	add iree-rocm-target-chip only if added by user	2023-11-13 23:35:55 +05:30
PhaneeshB	392bade0bf	enable non default rocm device selection for webui	2023-11-13 23:35:55 +05:30
Stefan Kapusniak	91df5f0613	API/Docs: Fix an image link in koboldcpp doc (#1954 ) * Fix the image link for the koboldcpp style button pointing to the dialog image rather than the button image.	2023-11-13 11:14:29 -06:00
dependabot[bot]	df20cf9c8a	Bump langchain in /apps/language_models/langchain (#1968 ) Bumps [langchain](https://github.com/langchain-ai/langchain) from 0.0.325 to 0.0.329. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/v0.0.325...v0.0.329) --- updated-dependencies: - dependency-name: langchain dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 20231112.1023	2023-11-12 19:46:00 -08:00
Ean Garvey	c4a908c3ea	Pin pydantic to 2.4.1 in requirements (#1967 ) pyinstaller-hooks-contrib doesn't see beta versions of pydantic as versions greater than 2.0.0, and so it looks for an attribute `compile` only available in versions older than 2.0.0 if you have a beta version of pydantic. 20231111.1022 20231110.1021	2023-11-10 21:34:52 -06:00
Stefan Kapusniak	6285430d8a	UI: Fix webui launch on non-Windows (#1963 ) * Moves the imports of winreg and Tk, into the functions that use them, with winreg behind a guard clause. This should hopefully mean that if you're not on Window or not using `ui=app` we won't trip over either of these due to them not being there.	2023-11-10 16:38:32 -06:00
PhaneeshB	51afe19e20	fix rocm arch selection	2023-11-10 13:22:51 +05:30
Ean Garvey	31005bcf73	Don't require vulkan installation to query devices. (#1953 )	2023-11-09 14:46:44 -06:00
dependabot[bot]	f41ad87ef6	Bump langchain in /apps/language_models/langchain (#1926 ) Bumps [langchain](https://github.com/langchain-ai/langchain) from 0.0.202 to 0.0.325. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/v0.0.202...v0.0.325) --- updated-dependencies: - dependency-name: langchain dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-09 11:03:47 -06:00
dependabot[bot]	d811524a00	Bump pypdf from 3.12.2 to 3.17.0 in /apps/language_models/langchain (#1929 ) Bumps [pypdf](https://github.com/py-pdf/pypdf) from 3.12.2 to 3.17.0. - [Release notes](https://github.com/py-pdf/pypdf/releases) - [Changelog](https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md) - [Commits](https://github.com/py-pdf/pypdf/compare/3.12.2...3.17.0) --- updated-dependencies: - dependency-name: pypdf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-09 11:02:43 -06:00
Sungsoon Cho	51e1bd1c5d	(OPT) Fix typo in the message; s/reponse/response (#1920 )	2023-11-09 11:00:48 -06:00
Phaneesh Barwaria	db89b1bdc1	Fix MacOS web execution flow (#1899 ) * fix metal device path for chatbot * single device remove indexing * lint fix	2023-11-09 10:59:29 -06:00
Huang Qi	2754e2e257	Fix wrong parameter index passed to 'compile_module_to_flatbuffer' (#1921 ) compile_str is always False in compile_module_to_flatbuffer since there is a parameter 'model_name' before 'debug'. This issue is relative to https://github.com/nod-ai/SHARK/pull/1863. Then we can use mlir model buffer in RAM to run inference.	2023-11-09 10:58:05 -06:00
PhaneeshB	ab0e870c43	fix vicuna cli vulkan	2023-11-09 22:27:13 +05:30
Stefan Kapusniak	fb30e8c226	UI: Fix some webui launch corner cases (#1952 ) * On windows insist on the presence of webview2 as the embeddable browser for `ui=app`. If we can't find it, effectively switch back to `ui=web`. This should prevent pywebview trying to use MSHTML, whilst saying its deprecated, and apparently we are too much for poor old IE11 * Add webview2 runtime droppings to .gitignore. * If we can't bind to args.server_port get another suitable port from the OS and advise the user that we did this in the UI. * Make `ui=web` mode use 'SHARK AI Studio' as its title. This makes it consistent with `ui=app`. * Replace the generic gradio favicon with a nod swirl one instead.	2023-11-09 10:53:28 -06:00
Ean Garvey	a07d542400	(Studio) Disable SD tunings and sub-model downloads (#1944 ) * sets --no-use_tuned and --import_mlir as defaults in SHARK Studio. 20231107.1016	2023-11-07 15:55:30 -06:00
Stefan Kapusniak	ad55cb696f	SD/API: Add missing A1111 APIs to Shark to support koboldcpp image generation (#1924 ) * SD/API: Add missing a1111 API features for Koboldcpp * Refactors SD api functions into their own file * Adds the following apis implemented by a1111 as needed by koboldcpp: - adds /sdapi/v1/sd-models (lists available models) - adds /sdapi/v1/options (only the bare minimum needed) * Adds optional CORS support, use the '--api_accept_origin' command line argument to activate and configure. * Extends existing APIs to include optional sampler/scheduler selection * Extends /sdapi/v1/textimg to recognise the method used by koboldcpp to select the model. * Where possible take values not provided to the API in the request from the existing relevant command line parameters rather than hardcoding them. * return a 400 response when a request doesn't have required properties. * changed default schedulers and models for some apis to ones that actually seem to work. * Update api_test.py to include the new APIs. * Update api_test.py to include a '--verbose' command line option. * SD/API: Take more API values from args * Take LoRA from '--use_lora' command line arg if specified * Take device from '--device' command line arg if specified (substring match, so a short name such as 'vulkan://0' should work) * SD/API: add more endpoints and pydantic typing * Mount the whole of /sdapi from index.py as a FastAPI application, rather than each endpoint individually * Add the following additional API endpoints: * /sdapi/v1/samplers * /sdapi/v1/cmd-flags * Make scheduler/sampler selection checking and fallback much more robust. * Support aliasing some A1111 scheduler/sampler names to the diffusers ones we are using. * Expand response /sdapi/v1/options to add a few more things. * Split non-api functions and variables into their own utils.py file. * Support 'n_iter' request property and the return of multiple images from generation endpoints. Equivalent of '--batch_count', batch_size is stil hardcoded at 1 * Include (some) hires_fix request properties in txt2img endpoint * Rework endpoints using pydantic model classes for better request validation and so we get much improved swagger api docs at /sdapi/docs and redoc at /sdapi/redoc * SD/API Delete commented out code from index.py * Delete some code that is no longer needed by the SD API in index.py (and one line sdapi_v1.py) that I'd previously only commented out. * SD/UI: Add shark_sd_koboldcpp.md document * Add documentation on how to set up Koboldcpp with SHARK * Link this and the existing blender set up document from the main README.md * SD/API Improve stencil options in img2img endpoint In /sdapi/v1/img2img: * Add zoedepth to the controlnet use_stencil options * Require and use second image as stencil mask for controlnet scribble 20231106.1015	2023-11-06 15:20:19 -06:00
Jakub Kuderski	488a172292	[vicuna.py] Allow to pass extra arguments to iree-compile (#1935 ) Add a new flag `-Xiree_compile` to forward extra compiler arguments to `iree-compile`. This flag can be set multiple times to pass more than one extra argument.	2023-11-06 12:12:34 -05:00
Stanley Winata	500c4f2306	[compile utils] Fix ROCM to not expect config.id as a default. (#1939 )	2023-11-06 08:44:53 -08:00
Vivek Khandelwal	92b694db4d	Add support for Falcon-40b-GPTQ	2023-11-06 19:49:19 +05:30
Vivek Khandelwal	322874f7f9	Fix issue in Falcon-GPTQ 20231105.1014 20231104.1013 20231103.1012	2023-11-03 11:48:36 +05:30
Ean Garvey	5001db3415	Add 7800xt to target triples explicitly. (#1928 ) 20231102.1011 20231101.1010	2023-11-01 17:11:45 -05:00
Vivek Khandelwal	71846344a2	Add sharded Falcon-GPTQ support This commit adds the support for sharded Falcon-7b-GPTQ and Falcon-180B-GPTQ. This commit also adds the support for 4-way sharding of the Falcon model for the device ROCM. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-11-01 12:11:44 +05:30
gpetters94	72e27c96fc	Add ZoeDepth (#1834 ) * Add ZoeDepth * Add einops to Studio imports. * Specify ref for forked torch.hub repos. * Unpin timm. --------- Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: Ean Garvey <garveyej@gmail.com> 20231031.1009 20231030.1008	2023-10-30 11:57:45 -05:00
PhaneeshB	7963abb8ec	remove caching for rocm args 20231029.1007 20231028.1006	2023-10-29 07:07:57 +05:30
Ean Garvey	98244232dd	Add smoothquant OPT to examples. (#1922 ) 20231027.1005	2023-10-27 12:32:12 -05:00
PhaneeshB	679a452139	fix calls and remove unused imports for check_device_drivers 20231026.1004	2023-10-27 10:30:40 +05:30
PhaneeshB	72c0a8abc8	remove dependency on external commands for driver installation check	2023-10-27 10:30:40 +05:30
Vivek Khandelwal	ea920f2955	Add sharded Falcon support	2023-10-26 21:53:25 +05:30
Phaneesh Barwaria	486202377a	update dependency on rocm/hip info command (#1900 ) * add support for rocm flags * add rocm target flag to chat args * rm rocm libs dependency message	2023-10-26 15:18:25 +05:30
Sungsoon Cho	0c38c33d0a	Add opt_causallm_samples.py. (#1916 ) 20231025.1003	2023-10-25 11:52:51 -05:00
Ean Garvey	841773fa32	Updates to opt_causallm example (#1905 ) * Updates to opt_causallm example * Fixup opt_perf_comparison.py * Use same filenames across opt examples. 20231024.1002	2023-10-24 10:54:39 -07:00
Stefan Kapusniak	0361db46f9	SD: Fix unet untuned opt_flags (#1912 ) * correct my sloppy copy/paste for the untuned unet default compilation flags that introduced an extra 'detach' into what should have been 'iree-global-opt-convert-1x1-filter-conv2d-to-matmul'	2023-10-24 12:47:33 -05:00
xzuyn	a012433ffd	Save hiresfix info if used (#1914 )	2023-10-24 12:45:10 -05:00
xzuyn	5061193da3	Move Generate, Randomize Seed, & Stop Batch to same positions as txt2img (#1915 )	2023-10-24 12:44:39 -05:00

1 2 3 4 5 ...

1773 Commits