SHARK-Studio

mirror of https://github.com/nod-ai/SHARK-Studio.git synced 2026-01-13 15:57:56 -05:00

Author	SHA1	Message	Date
Ean Garvey	ee0233e370	Fix formatting.	2023-11-13 20:01:28 -06:00
Daniel Garvey	a3deeec870	Dan shark studio (#1970 ) * Fix issue in Falcon-GPTQ * initial webui and llama2 --------- Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2023-11-13 19:07:28 -06:00
Stefan Kapusniak	c2163488d8	SD/UI Restrict hires fix/img2img resamplers/schedulers (#1955 ) * Restrict resamplers for img2img and high res fix to the ones that PIL.Image actually supports, since it uses that to di the resampling. Removed: Antialias, Affine, Cubic. Added: Hamming. * Set list of available schedulers to CPU only when high res fix is selected in the web ui. Set list to all schdulers when high res fix is deselected. * Put hi res fix in its own Accordian in the txt2img UI instead of grouping it with Advanced Options.	2023-11-13 16:08:24 -06:00
PhaneeshB	54bff4611d	fix cli rocm device selection	2023-11-13 23:35:55 +05:30
PhaneeshB	11510d5111	add intra rocm vmfb differentiator	2023-11-13 23:35:55 +05:30
PhaneeshB	32cab73a29	add iree-rocm-target-chip only if added by user	2023-11-13 23:35:55 +05:30
PhaneeshB	392bade0bf	enable non default rocm device selection for webui	2023-11-13 23:35:55 +05:30
dependabot[bot]	df20cf9c8a	Bump langchain in /apps/language_models/langchain (#1968 ) Bumps [langchain](https://github.com/langchain-ai/langchain) from 0.0.325 to 0.0.329. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/v0.0.325...v0.0.329) --- updated-dependencies: - dependency-name: langchain dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-12 19:46:00 -08:00
Stefan Kapusniak	6285430d8a	UI: Fix webui launch on non-Windows (#1963 ) * Moves the imports of winreg and Tk, into the functions that use them, with winreg behind a guard clause. This should hopefully mean that if you're not on Window or not using `ui=app` we won't trip over either of these due to them not being there.	2023-11-10 16:38:32 -06:00
dependabot[bot]	f41ad87ef6	Bump langchain in /apps/language_models/langchain (#1926 ) Bumps [langchain](https://github.com/langchain-ai/langchain) from 0.0.202 to 0.0.325. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/v0.0.202...v0.0.325) --- updated-dependencies: - dependency-name: langchain dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-09 11:03:47 -06:00
dependabot[bot]	d811524a00	Bump pypdf from 3.12.2 to 3.17.0 in /apps/language_models/langchain (#1929 ) Bumps [pypdf](https://github.com/py-pdf/pypdf) from 3.12.2 to 3.17.0. - [Release notes](https://github.com/py-pdf/pypdf/releases) - [Changelog](https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md) - [Commits](https://github.com/py-pdf/pypdf/compare/3.12.2...3.17.0) --- updated-dependencies: - dependency-name: pypdf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-11-09 11:02:43 -06:00
Phaneesh Barwaria	db89b1bdc1	Fix MacOS web execution flow (#1899 ) * fix metal device path for chatbot * single device remove indexing * lint fix	2023-11-09 10:59:29 -06:00
PhaneeshB	ab0e870c43	fix vicuna cli vulkan	2023-11-09 22:27:13 +05:30
Stefan Kapusniak	fb30e8c226	UI: Fix some webui launch corner cases (#1952 ) * On windows insist on the presence of webview2 as the embeddable browser for `ui=app`. If we can't find it, effectively switch back to `ui=web`. This should prevent pywebview trying to use MSHTML, whilst saying its deprecated, and apparently we are too much for poor old IE11 * Add webview2 runtime droppings to .gitignore. * If we can't bind to args.server_port get another suitable port from the OS and advise the user that we did this in the UI. * Make `ui=web` mode use 'SHARK AI Studio' as its title. This makes it consistent with `ui=app`. * Replace the generic gradio favicon with a nod swirl one instead.	2023-11-09 10:53:28 -06:00
Ean Garvey	a07d542400	(Studio) Disable SD tunings and sub-model downloads (#1944 ) * sets --no-use_tuned and --import_mlir as defaults in SHARK Studio.	2023-11-07 15:55:30 -06:00
Stefan Kapusniak	ad55cb696f	SD/API: Add missing A1111 APIs to Shark to support koboldcpp image generation (#1924 ) * SD/API: Add missing a1111 API features for Koboldcpp * Refactors SD api functions into their own file * Adds the following apis implemented by a1111 as needed by koboldcpp: - adds /sdapi/v1/sd-models (lists available models) - adds /sdapi/v1/options (only the bare minimum needed) * Adds optional CORS support, use the '--api_accept_origin' command line argument to activate and configure. * Extends existing APIs to include optional sampler/scheduler selection * Extends /sdapi/v1/textimg to recognise the method used by koboldcpp to select the model. * Where possible take values not provided to the API in the request from the existing relevant command line parameters rather than hardcoding them. * return a 400 response when a request doesn't have required properties. * changed default schedulers and models for some apis to ones that actually seem to work. * Update api_test.py to include the new APIs. * Update api_test.py to include a '--verbose' command line option. * SD/API: Take more API values from args * Take LoRA from '--use_lora' command line arg if specified * Take device from '--device' command line arg if specified (substring match, so a short name such as 'vulkan://0' should work) * SD/API: add more endpoints and pydantic typing * Mount the whole of /sdapi from index.py as a FastAPI application, rather than each endpoint individually * Add the following additional API endpoints: * /sdapi/v1/samplers * /sdapi/v1/cmd-flags * Make scheduler/sampler selection checking and fallback much more robust. * Support aliasing some A1111 scheduler/sampler names to the diffusers ones we are using. * Expand response /sdapi/v1/options to add a few more things. * Split non-api functions and variables into their own utils.py file. * Support 'n_iter' request property and the return of multiple images from generation endpoints. Equivalent of '--batch_count', batch_size is stil hardcoded at 1 * Include (some) hires_fix request properties in txt2img endpoint * Rework endpoints using pydantic model classes for better request validation and so we get much improved swagger api docs at /sdapi/docs and redoc at /sdapi/redoc * SD/API Delete commented out code from index.py * Delete some code that is no longer needed by the SD API in index.py (and one line sdapi_v1.py) that I'd previously only commented out. * SD/UI: Add shark_sd_koboldcpp.md document * Add documentation on how to set up Koboldcpp with SHARK * Link this and the existing blender set up document from the main README.md * SD/API Improve stencil options in img2img endpoint In /sdapi/v1/img2img: * Add zoedepth to the controlnet use_stencil options * Require and use second image as stencil mask for controlnet scribble	2023-11-06 15:20:19 -06:00
Jakub Kuderski	488a172292	[vicuna.py] Allow to pass extra arguments to iree-compile (#1935 ) Add a new flag `-Xiree_compile` to forward extra compiler arguments to `iree-compile`. This flag can be set multiple times to pass more than one extra argument.	2023-11-06 12:12:34 -05:00
Vivek Khandelwal	92b694db4d	Add support for Falcon-40b-GPTQ	2023-11-06 19:49:19 +05:30
Vivek Khandelwal	322874f7f9	Fix issue in Falcon-GPTQ	2023-11-03 11:48:36 +05:30
Vivek Khandelwal	71846344a2	Add sharded Falcon-GPTQ support This commit adds the support for sharded Falcon-7b-GPTQ and Falcon-180B-GPTQ. This commit also adds the support for 4-way sharding of the Falcon model for the device ROCM. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-11-01 12:11:44 +05:30
gpetters94	72e27c96fc	Add ZoeDepth (#1834 ) * Add ZoeDepth * Add einops to Studio imports. * Specify ref for forked torch.hub repos. * Unpin timm. --------- Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: Ean Garvey <garveyej@gmail.com>	2023-10-30 11:57:45 -05:00
Vivek Khandelwal	ea920f2955	Add sharded Falcon support	2023-10-26 21:53:25 +05:30
Phaneesh Barwaria	486202377a	update dependency on rocm/hip info command (#1900 ) * add support for rocm flags * add rocm target flag to chat args * rm rocm libs dependency message	2023-10-26 15:18:25 +05:30
Stefan Kapusniak	0361db46f9	SD: Fix unet untuned opt_flags (#1912 ) * correct my sloppy copy/paste for the untuned unet default compilation flags that introduced an extra 'detach' into what should have been 'iree-global-opt-convert-1x1-filter-conv2d-to-matmul'	2023-10-24 12:47:33 -05:00
xzuyn	a012433ffd	Save hiresfix info if used (#1914 )	2023-10-24 12:45:10 -05:00
xzuyn	5061193da3	Move Generate, Randomize Seed, & Stop Batch to same positions as txt2img (#1915 )	2023-10-24 12:44:39 -05:00
xzuyn	bff48924be	LLaMa 2 Chat template fix (#1913 )	2023-10-23 18:51:15 -05:00
Stefan Kapusniak	825b36cbdd	Fix MLIR Textual PassPipeline Error (#1910 )	2023-10-22 07:39:52 -07:00
Stefan Kapusniak	134441957d	SD - Fix civitai download on Windows +improvements (#1907 )	2023-10-21 11:17:41 -07:00
Stefan Kapusniak	7cd14fdc47	SD/UI: Use a single model selection box on UI tabs (#1906 ) * Allow entry of a huggingface model id or civitai download url to be done in the main model selection dropdown on SD tabs * Remove separate textbox for entering huggingface model id or civitai download url on SD Tabs * Remove 'None' option from the model selection dropdown (no longer needed) on SD tabs * Update png metadata drop zone on txt2img tab to work with a single argument for model selection * Update UI generate functions on SD tabs to work with single argument model selection * Update API code for changes to the UI generate functions * Move info about the custom model path to the logging textarea on SD tabs	2023-10-21 10:06:05 -07:00
Vivek Khandelwal	205e57683a	Modify Falcon-180b-GPTQ sharded pipeline	2023-10-17 20:26:01 +05:30
Vivek Khandelwal	2866d665ee	Fix Sharded Falcon-180b-GPTQ Pipeline	2023-10-17 20:26:01 +05:30
Stefan Kapusniak	71d25ec5d8	SD: Fix repeatable seeds when intial seed is random (#1893 )	2023-10-14 22:50:42 -07:00
Vivek Khandelwal	202ffff67b	Add support for sharded Falcon model	2023-10-13 22:05:10 +05:30
Stefan Kapusniak	a208302bb9	Fix repeatable seeds consistency over batch counts (#1889 ) * Set the input seed for the random number generator when generating repeatable seeds to exclude any negative numbers in the parsed seed input. The makes seeds generated for different batch counts consistent where they have the same input for the initial seed or set of seeds.	2023-10-12 17:15:19 -05:00
Vivek Khandelwal	b83d32fafe	Fix Falcon GPTQ Pipeline	2023-10-11 20:09:32 +05:30
Vivek Khandelwal	0a618e1863	Add support for Falcon GPTQ	2023-10-11 10:47:48 +05:30
Phaneesh Barwaria	a731eb6ed4	Macos fixes (#1883 ) * fix venv setup for MacOS * allow stream fuse binding on mac * clean iree metal args	2023-10-09 23:36:12 -07:00
Ean Garvey	2004d16945	Revert "[SDXL] Add SDXL pipeline to SHARK (#1731 )" (#1882 ) This reverts commit `9f0a421764`.	2023-10-09 18:01:44 -07:00
Gaurav Shukla	6e409bfb77	fix else if syntax error Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-10 06:23:56 +05:30
Gaurav Shukla	77727d149c	[warning] Fix dropdown warning Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-10 05:18:43 +05:30
Ean Garvey	66f6e79d68	Split CPU/GPU definitions conditionally outside of torch contexts. (#1879 )	2023-10-09 16:46:41 -07:00
Ean Garvey	3b825579a7	(LLaMa-2) Point to int4 + f32 acc .mlir for cpu (#1878 ) - fixes some issues with non-system prompt invocation Co-authored-by: Gaurav Shukla <gauravshukla789@gmail.com>	2023-10-09 14:37:35 -05:00
Abhishek Varma	9f0a421764	[SDXL] Add SDXL pipeline to SHARK (#1731 ) -- This commit adds SDXL pipeline to SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-10-09 13:01:37 -05:00
Gaurav Shukla	c28682110c	[chatbot] Flag to add system prompt Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-09 22:17:39 +05:30
Ean Garvey	caf6cc5d8f	Switch most compile flows to use ireec.compile_file. (#1863 ) * Switch most compile flows to use ireec.compile_file. * re-add input type to compile_str path. * Check if mlir_module exists before checking if it's a path or pyobject. * Fix some save_dir cases	2023-10-06 23:04:43 -05:00
Ean Garvey	8614a18474	Remove tf dependencies from importer path. (#1874 ) * Remove tf dependencies from import path. * Fix formatting.	2023-10-06 12:27:12 -07:00
Jakub Kuderski	86c1c0c215	Add aggregate statistics to microbenchmark (#1871 ) Print averaged results at the end of all iterations. Increase the default number of iterations to 5. Example: ``` Number of iterations: 5 Prefill: avg. 0.03 s, stddev 0.00 Decode: avg. 43.34 tokens/s, stdev 0.13 ``` Also remove the -2 in the number of generated tokens -- I did not find any evidence we need it.	2023-10-06 10:03:07 -07:00
Daniel Garvey	8bb364bcb8	enforce fp32 accumulates for cpu (#1873 )	2023-10-06 11:34:49 -05:00
Daniel Garvey	7abddd01ec	argmax inside model + brevitas pin (#1872 )	2023-10-05 20:15:21 -07:00

1 2 3 4 5 ...

518 Commits