SHARK-Studio

mirror of https://github.com/nod-ai/SHARK-Studio.git synced 2026-01-13 15:57:56 -05:00

Author	SHA1	Message	Date
Ean Garvey	ee0233e370	Fix formatting.	2023-11-13 20:01:28 -06:00
Daniel Garvey	a3deeec870	Dan shark studio (#1970 ) * Fix issue in Falcon-GPTQ * initial webui and llama2 --------- Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2023-11-13 19:07:28 -06:00
Stefan Kapusniak	c2163488d8	SD/UI Restrict hires fix/img2img resamplers/schedulers (#1955 ) * Restrict resamplers for img2img and high res fix to the ones that PIL.Image actually supports, since it uses that to di the resampling. Removed: Antialias, Affine, Cubic. Added: Hamming. * Set list of available schedulers to CPU only when high res fix is selected in the web ui. Set list to all schdulers when high res fix is deselected. * Put hi res fix in its own Accordian in the txt2img UI instead of grouping it with Advanced Options. 20231113.1024	2023-11-13 16:08:24 -06:00
PhaneeshB	54bff4611d	fix cli rocm device selection	2023-11-13 23:35:55 +05:30
PhaneeshB	11510d5111	add intra rocm vmfb differentiator	2023-11-13 23:35:55 +05:30
PhaneeshB	32cab73a29	add iree-rocm-target-chip only if added by user	2023-11-13 23:35:55 +05:30
PhaneeshB	392bade0bf	enable non default rocm device selection for webui	2023-11-13 23:35:55 +05:30
Stefan Kapusniak	91df5f0613	API/Docs: Fix an image link in koboldcpp doc (#1954 ) * Fix the image link for the koboldcpp style button pointing to the dialog image rather than the button image.	2023-11-13 11:14:29 -06:00
dependabot[bot]	df20cf9c8a	Bump langchain in /apps/language_models/langchain (#1968 ) Bumps [langchain](https://github.com/langchain-ai/langchain) from 0.0.325 to 0.0.329. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/v0.0.325...v0.0.329) --- updated-dependencies: - dependency-name: langchain dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 20231112.1023	2023-11-12 19:46:00 -08:00
Ean Garvey	c4a908c3ea	Pin pydantic to 2.4.1 in requirements (#1967 ) pyinstaller-hooks-contrib doesn't see beta versions of pydantic as versions greater than 2.0.0, and so it looks for an attribute `compile` only available in versions older than 2.0.0 if you have a beta version of pydantic. 20231111.1022 20231110.1021	2023-11-10 21:34:52 -06:00
Stefan Kapusniak	6285430d8a	UI: Fix webui launch on non-Windows (#1963 ) * Moves the imports of winreg and Tk, into the functions that use them, with winreg behind a guard clause. This should hopefully mean that if you're not on Window or not using `ui=app` we won't trip over either of these due to them not being there.	2023-11-10 16:38:32 -06:00
PhaneeshB	51afe19e20	fix rocm arch selection	2023-11-10 13:22:51 +05:30
Ean Garvey	31005bcf73	Don't require vulkan installation to query devices. (#1953 )	2023-11-09 14:46:44 -06:00
dependabot[bot]	f41ad87ef6	Bump langchain in /apps/language_models/langchain (#1926 ) Bumps [langchain](https://github.com/langchain-ai/langchain) from 0.0.202 to 0.0.325. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/v0.0.202...v0.0.325) --- updated-dependencies: - dependency-name: langchain dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-09 11:03:47 -06:00
dependabot[bot]	d811524a00	Bump pypdf from 3.12.2 to 3.17.0 in /apps/language_models/langchain (#1929 ) Bumps [pypdf](https://github.com/py-pdf/pypdf) from 3.12.2 to 3.17.0. - [Release notes](https://github.com/py-pdf/pypdf/releases) - [Changelog](https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md) - [Commits](https://github.com/py-pdf/pypdf/compare/3.12.2...3.17.0) --- updated-dependencies: - dependency-name: pypdf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-09 11:02:43 -06:00
Sungsoon Cho	51e1bd1c5d	(OPT) Fix typo in the message; s/reponse/response (#1920 )	2023-11-09 11:00:48 -06:00
Phaneesh Barwaria	db89b1bdc1	Fix MacOS web execution flow (#1899 ) * fix metal device path for chatbot * single device remove indexing * lint fix	2023-11-09 10:59:29 -06:00
Huang Qi	2754e2e257	Fix wrong parameter index passed to 'compile_module_to_flatbuffer' (#1921 ) compile_str is always False in compile_module_to_flatbuffer since there is a parameter 'model_name' before 'debug'. This issue is relative to https://github.com/nod-ai/SHARK/pull/1863. Then we can use mlir model buffer in RAM to run inference.	2023-11-09 10:58:05 -06:00
PhaneeshB	ab0e870c43	fix vicuna cli vulkan	2023-11-09 22:27:13 +05:30
Stefan Kapusniak	fb30e8c226	UI: Fix some webui launch corner cases (#1952 ) * On windows insist on the presence of webview2 as the embeddable browser for `ui=app`. If we can't find it, effectively switch back to `ui=web`. This should prevent pywebview trying to use MSHTML, whilst saying its deprecated, and apparently we are too much for poor old IE11 * Add webview2 runtime droppings to .gitignore. * If we can't bind to args.server_port get another suitable port from the OS and advise the user that we did this in the UI. * Make `ui=web` mode use 'SHARK AI Studio' as its title. This makes it consistent with `ui=app`. * Replace the generic gradio favicon with a nod swirl one instead.	2023-11-09 10:53:28 -06:00
Ean Garvey	a07d542400	(Studio) Disable SD tunings and sub-model downloads (#1944 ) * sets --no-use_tuned and --import_mlir as defaults in SHARK Studio. 20231107.1016	2023-11-07 15:55:30 -06:00
Stefan Kapusniak	ad55cb696f	SD/API: Add missing A1111 APIs to Shark to support koboldcpp image generation (#1924 ) * SD/API: Add missing a1111 API features for Koboldcpp * Refactors SD api functions into their own file * Adds the following apis implemented by a1111 as needed by koboldcpp: - adds /sdapi/v1/sd-models (lists available models) - adds /sdapi/v1/options (only the bare minimum needed) * Adds optional CORS support, use the '--api_accept_origin' command line argument to activate and configure. * Extends existing APIs to include optional sampler/scheduler selection * Extends /sdapi/v1/textimg to recognise the method used by koboldcpp to select the model. * Where possible take values not provided to the API in the request from the existing relevant command line parameters rather than hardcoding them. * return a 400 response when a request doesn't have required properties. * changed default schedulers and models for some apis to ones that actually seem to work. * Update api_test.py to include the new APIs. * Update api_test.py to include a '--verbose' command line option. * SD/API: Take more API values from args * Take LoRA from '--use_lora' command line arg if specified * Take device from '--device' command line arg if specified (substring match, so a short name such as 'vulkan://0' should work) * SD/API: add more endpoints and pydantic typing * Mount the whole of /sdapi from index.py as a FastAPI application, rather than each endpoint individually * Add the following additional API endpoints: * /sdapi/v1/samplers * /sdapi/v1/cmd-flags * Make scheduler/sampler selection checking and fallback much more robust. * Support aliasing some A1111 scheduler/sampler names to the diffusers ones we are using. * Expand response /sdapi/v1/options to add a few more things. * Split non-api functions and variables into their own utils.py file. * Support 'n_iter' request property and the return of multiple images from generation endpoints. Equivalent of '--batch_count', batch_size is stil hardcoded at 1 * Include (some) hires_fix request properties in txt2img endpoint * Rework endpoints using pydantic model classes for better request validation and so we get much improved swagger api docs at /sdapi/docs and redoc at /sdapi/redoc * SD/API Delete commented out code from index.py * Delete some code that is no longer needed by the SD API in index.py (and one line sdapi_v1.py) that I'd previously only commented out. * SD/UI: Add shark_sd_koboldcpp.md document * Add documentation on how to set up Koboldcpp with SHARK * Link this and the existing blender set up document from the main README.md * SD/API Improve stencil options in img2img endpoint In /sdapi/v1/img2img: * Add zoedepth to the controlnet use_stencil options * Require and use second image as stencil mask for controlnet scribble 20231106.1015	2023-11-06 15:20:19 -06:00
Jakub Kuderski	488a172292	[vicuna.py] Allow to pass extra arguments to iree-compile (#1935 ) Add a new flag `-Xiree_compile` to forward extra compiler arguments to `iree-compile`. This flag can be set multiple times to pass more than one extra argument.	2023-11-06 12:12:34 -05:00
Stanley Winata	500c4f2306	[compile utils] Fix ROCM to not expect config.id as a default. (#1939 )	2023-11-06 08:44:53 -08:00
Vivek Khandelwal	92b694db4d	Add support for Falcon-40b-GPTQ	2023-11-06 19:49:19 +05:30
Vivek Khandelwal	322874f7f9	Fix issue in Falcon-GPTQ 20231105.1014 20231104.1013 20231103.1012	2023-11-03 11:48:36 +05:30
Ean Garvey	5001db3415	Add 7800xt to target triples explicitly. (#1928 ) 20231102.1011 20231101.1010	2023-11-01 17:11:45 -05:00
Vivek Khandelwal	71846344a2	Add sharded Falcon-GPTQ support This commit adds the support for sharded Falcon-7b-GPTQ and Falcon-180B-GPTQ. This commit also adds the support for 4-way sharding of the Falcon model for the device ROCM. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-11-01 12:11:44 +05:30
gpetters94	72e27c96fc	Add ZoeDepth (#1834 ) * Add ZoeDepth * Add einops to Studio imports. * Specify ref for forked torch.hub repos. * Unpin timm. --------- Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: Ean Garvey <garveyej@gmail.com> 20231031.1009 20231030.1008	2023-10-30 11:57:45 -05:00
PhaneeshB	7963abb8ec	remove caching for rocm args 20231029.1007 20231028.1006	2023-10-29 07:07:57 +05:30
Ean Garvey	98244232dd	Add smoothquant OPT to examples. (#1922 ) 20231027.1005	2023-10-27 12:32:12 -05:00
PhaneeshB	679a452139	fix calls and remove unused imports for check_device_drivers 20231026.1004	2023-10-27 10:30:40 +05:30
PhaneeshB	72c0a8abc8	remove dependency on external commands for driver installation check	2023-10-27 10:30:40 +05:30
Vivek Khandelwal	ea920f2955	Add sharded Falcon support	2023-10-26 21:53:25 +05:30
Phaneesh Barwaria	486202377a	update dependency on rocm/hip info command (#1900 ) * add support for rocm flags * add rocm target flag to chat args * rm rocm libs dependency message	2023-10-26 15:18:25 +05:30
Sungsoon Cho	0c38c33d0a	Add opt_causallm_samples.py. (#1916 ) 20231025.1003	2023-10-25 11:52:51 -05:00
Ean Garvey	841773fa32	Updates to opt_causallm example (#1905 ) * Updates to opt_causallm example * Fixup opt_perf_comparison.py * Use same filenames across opt examples. 20231024.1002	2023-10-24 10:54:39 -07:00
Stefan Kapusniak	0361db46f9	SD: Fix unet untuned opt_flags (#1912 ) * correct my sloppy copy/paste for the untuned unet default compilation flags that introduced an extra 'detach' into what should have been 'iree-global-opt-convert-1x1-filter-conv2d-to-matmul'	2023-10-24 12:47:33 -05:00
xzuyn	a012433ffd	Save hiresfix info if used (#1914 )	2023-10-24 12:45:10 -05:00
xzuyn	5061193da3	Move Generate, Randomize Seed, & Stop Batch to same positions as txt2img (#1915 )	2023-10-24 12:44:39 -05:00
xzuyn	bff48924be	LLaMa 2 Chat template fix (#1913 ) 20231023.1001	2023-10-23 18:51:15 -05:00
Stefan Kapusniak	825b36cbdd	Fix MLIR Textual PassPipeline Error (#1910 ) 20231022.1000	2023-10-22 07:39:52 -07:00
Stefan Kapusniak	134441957d	SD - Fix civitai download on Windows +improvements (#1907 ) 20231021.999	2023-10-21 11:17:41 -07:00
Stefan Kapusniak	7cd14fdc47	SD/UI: Use a single model selection box on UI tabs (#1906 ) * Allow entry of a huggingface model id or civitai download url to be done in the main model selection dropdown on SD tabs * Remove separate textbox for entering huggingface model id or civitai download url on SD Tabs * Remove 'None' option from the model selection dropdown (no longer needed) on SD tabs * Update png metadata drop zone on txt2img tab to work with a single argument for model selection * Update UI generate functions on SD tabs to work with single argument model selection * Update API code for changes to the UI generate functions * Move info about the custom model path to the logging textarea on SD tabs	2023-10-21 10:06:05 -07:00
Ean Garvey	e6cb5cef57	Add --additional_runtime_args option and use in OPT example. (#1855 ) * Add --additional_runtime_args option and use in OPT example. Fix the func name. (#1838) Co-authored-by: Sungsoon Cho <sungsoon.cho@gmail.com> 20231019.997 20231020.998	2023-10-19 13:29:39 -05:00
Huang Qi	66abee8e5b	SharkInference: Fix various examples and README.md (#1903 ) Follow https://github.com/nod-ai/SHARK/pull/708, remove parameter 'func_name' for SharkInference.	2023-10-19 09:28:36 -05:00
Ean Garvey	4797bb89f5	Stringify path for ireec.compile_file (#1901 ) * Stringify path for ireec.compile_file * Update test-models.yml 20231018.994	2023-10-18 14:59:23 -05:00
Vivek Khandelwal	205e57683a	Modify Falcon-180b-GPTQ sharded pipeline 20231017.993	2023-10-17 20:26:01 +05:30
Vivek Khandelwal	2866d665ee	Fix Sharded Falcon-180b-GPTQ Pipeline	2023-10-17 20:26:01 +05:30
Stefan Kapusniak	71d25ec5d8	SD: Fix repeatable seeds when intial seed is random (#1893 ) 20231016.992 20231015.991	2023-10-14 22:50:42 -07:00

1 2 3 4 5 ...

1663 Commits