AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Author	SHA1	Message	Date
Ean Garvey	2004d16945	Revert "[SDXL] Add SDXL pipeline to SHARK (#1731 )" (#1882 ) This reverts commit `9f0a421764`.	2023-10-09 18:01:44 -07:00
Gaurav Shukla	77727d149c	[warning] Fix dropdown warning Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-10 05:18:43 +05:30
Abhishek Varma	9f0a421764	[SDXL] Add SDXL pipeline to SHARK (#1731 ) -- This commit adds SDXL pipeline to SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-10-09 13:01:37 -05:00
Gaurav Shukla	c28682110c	[chatbot] Flag to add system prompt Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-09 22:17:39 +05:30
Gaurav Shukla	e54517e967	[UI] Disable config generator, lora train and model manager (#1858 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-10-02 22:34:40 -07:00
Gaurav Shukla	e930ba85b4	[os] Remove os dependency from vmfb naming (#1854 ) Also fixes a small ui issue for chatbot. Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 12:38:17 -05:00
Gaurav Shukla	cd732e7a38	[chatbot] split execution time to prefill and decode Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 13:18:03 +05:30
Gaurav Shukla	8e0f8b3227	[ui] Update chatbot UI Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 13:18:03 +05:30
Gaurav Shukla	b8210ef796	[chatbot] Re-instantiate the chatbot object if device id changes Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-29 13:18:03 +05:30
Gaurav Shukla	82f833e87d	[vulkan] Update vmfb naming Update vmfb naming for vulkan devices in order to resolve naming conflicts in the presence of multiple vulkan devices. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-28 14:52:11 +05:30
Abhishek Varma	9a0efffcca	[Llama2] Fix wrong Vulkan device ID + Add Vulkan compile flags -- This commit fixes the wrong Vulkan device being selected during runtime. -- It also adds couple of IREE compilation flags to target specific Vulkan device. -- It also changes the Vulkan device listing to be more in tune with lowering control flow. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-09-22 22:24:18 +05:30
PhaneeshB	b817bb8455	add roles for llama2	2023-09-12 10:59:28 +05:30
Abhishek Varma	bde63ee8ae	Add logging feature in WebUI (#1821 )	2023-09-08 05:48:05 -07:00
Gaurav Shukla	ede6bf83e2	[vicuna] Disabling the IR generation path Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-09-06 20:13:17 +05:30
Gaurav Shukla	d2f64eefa3	[chatbot] Remove few outdated models from list (#1814 )	2023-09-04 09:26:32 -07:00
jinchen62	4c3d8a0a7f	Enable downloading vmfb/mlir for webui (#1807 )	2023-08-31 11:05:47 -07:00
Ean Garvey	9c8cbaf498	Add support for ROCM (Windows) in Studio + compile utils (#1770 ) * WIP: MSVC ROCM support for SHARK Studio * Make get_iree_rocm_args platform-agnostic. * Update stable_args.py * Update rocm arg handling in SD utils * Guard quantization imports. Co-authored-by: jam https://github.com/jammm	2023-08-25 20:56:05 -07:00
xzuyn	e37d6720eb	Add Hires Fix (#1787 ) * improper test hiresfix * add sliders & use `clear_cache` * add resample choices & fix step adjustment * add step adjustment to img2img * add resample options to img2img * simplify hiresfix - import `img2img_inf` from `img2img_ui.py` instead of just copying it into `txt2img_ui.py` * set `hri` to None after using * add more resample types, and don't show output until hiresfix is done * cleaner implementation * ran black * ran black again with jupyter dependencies	2023-08-24 09:01:41 -07:00
Abhishek Varma	db990826d3	Add Llama2 13B int4 fp16 support (#1784 ) Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-08-23 10:00:32 -07:00
Gaurav Shukla	9ae8bc921e	[chatbot] Fix chatbot cli and webview warning Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-17 21:27:10 +05:30
Gaurav Shukla	32eb78f0f9	[chatbot] Fix switching parameters in chatbot Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-17 19:14:17 +05:30
Ean Garvey	9dee7ae652	fix tkinter window (#1766 )	2023-08-15 13:23:09 -07:00
Ean Garvey	343dfd901c	Update SHARK-Runtime links to SRT (#1765 ) * Update nightly.yml * Update setup_venv.ps1 * Update CMakeLists.txt * Update shark_iree_profiling.md * Update setup_venv.sh * Update README.md * Update .gitmodules * Update CMakeLists.txt * Update README.md * fix signtool flags * Update nightly.yml * Update benchmark_utils.py * uncomment tkinter launch	2023-08-15 12:40:44 -07:00
PhaneeshB	4f61d69d86	add support passing iree flags for LLMs	2023-08-15 00:22:56 +05:30
Gaurav Shukla	4dc9c59611	[chatbot] Add tokens generated per second (#1753 )	2023-08-13 11:25:41 -07:00
Gaurav Shukla	18801dcabc	[chat] Update chatbot ui Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-13 18:39:22 +05:30
Gaurav Shukla	3c577f7168	[vicuna] fix shard config generator script (#1747 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-10 11:26:03 -07:00
Stefan Kapusniak	f5e4fa6ffe	UI/Web - Revert tab order (#1724 ) * Revert ui tab order * Reverts the tab order, so that SD, LLM, and Experimental are grouped together again as far as is possible. * Labelled "Generate Sharding Config" as experimental as pressing the 'Get Model Config' errors for me. * Fix formatting in index.py	2023-08-10 11:25:36 -07:00
Gaurav Shukla	8e90f1b81a	[vicuna] add default config in case of sharded vicuna Signed-Off-by: Gaurav Shukla<gaurav@nod-labs.com>	2023-08-10 21:28:08 +05:30
Vivek Khandelwal	e4d7abb519	Final patch for fixing Langchain token streaming issue (#1744 )	2023-08-09 10:09:41 -07:00
Daniel Garvey	a686d7d89f	temporarily disable langchain stuff in webui (#1719 ) its breaking the exe	2023-08-04 12:48:06 -07:00
Eliasj42	ed484b8253	added functionality for int8 vicuna and 4 shards (#1712 ) combined vicuna_4_shards.py and vicuna.py to reduce code duplication Co-authored-by: Elias Joseph <elias@nod-labs.com>	2023-08-04 14:05:05 -05:00
gpetters94	7fe57ebaaf	Add vector database and add support on the web UI (#1699 )	2023-08-04 13:47:19 -04:00
Gaurav Shukla	51ec1a1360	[vicuna] Integrate sharded vicuna in web (#1717 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-04 11:46:53 -05:00
Gaurav Shukla	bd30044c0b	[Shard] Add sharding generation in shark studio Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-04 21:51:14 +05:30
Vivek Khandelwal	a5b13fcc2f	[Langchain] Patch for fixing streaming of tokens (#1709 )	2023-08-03 10:06:49 -07:00
Stefan Kapusniak	206c1b70f4	UI/Web: Reorder tabs to separate SD and LLM (#1701 ) Shuffle the tabs around so that: * All the SD tabs are together * All the LLM tabs are together * All the experimental tabs are together	2023-07-29 22:25:30 -04:00
Vivek Khandelwal	776a9c2293	Fix for Langchain (#1694 ) For CPU, remove max time stopping criteria Fix web UI issue	2023-07-26 09:00:23 -07:00
Abhishek Varma	47f8a79c75	[MiniGPT4] Add MiniGPT4 to SHARK (#1554 ) * [MiniGPT4] Add MiniGPT4 to SHARK -- This is the first installment of MiniGPT4 in SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> * Add int8 support for MiniGPT4 -- This commit adds int8 support for MiniGPT4. Signed-off-by: Abhishek Varma <abhishek@nod-lab.com> * Update .spec for MiniGPT4's config files * black format MiniGPT4 --------- Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> Signed-off-by: Abhishek Varma <abhishek@nod-lab.com>	2023-07-25 09:42:27 -07:00
Stefan Kapusniak	289f983f41	SD - Implement seed arrays for batch runs (#1690 ) * SD Scripts and UI tabs that support batch_count can now take a string containing a JSON array, or a list of integers, as their seed input. * Each batch in a run will now take the seed specified at the corresponding array index if one exists. If there is no seed at that index, the seed value will be treated as -1 and a random seed will be assigned at that position. If an integer rather than a list or json array has been, everything works as before. * UI seed input controls are now Textboxes with info lines about the seed formats allowed. * UI error handling updated to be more helpful if the seed input is invalid.	2023-07-24 19:22:34 -07:00
Gaurav Shukla	5497af1f56	[config] Add support for uploading sharding config file in chatbot (#1689 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-07-24 10:18:03 -07:00
Vivek Khandelwal	f3cb63fc9c	Fix Langchain multiple device isssue (#1688 )	2023-07-24 08:03:46 -07:00
Vivek Khandelwal	d7092aafaa	Fix multiple issue for Langchain This commit fixes the following issue for the Langchain: 1.) Web UI not able to fetch results. 2.) For each query model getting reloaded. 3.) SHARK module not using user provided device and precision. 4.) Create a class for main Langchain code. 5.) Misc issues	2023-07-21 21:56:27 +05:30
Vivek Khandelwal	a415f3f70e	Fix Langchain Prompt issue and add web UI support (#1682 )	2023-07-21 06:36:55 -07:00
Vivek Khandelwal	03c4d9e171	Add support for Llama-2-70b for web and cli, and for hf_auth_token	2023-07-20 14:57:48 +05:30
Vivek Khandelwal	db3f222933	Revert "Add Llama2 70B option in CLI and WebUI (#1673 )" (#1679 ) This reverts commit `41e5088908`.	2023-07-19 22:02:48 -07:00
Stefan Kapusniak	68b3021325	Fixes cosmetic problems with Gradio 3.37.0 (#1676 ) * Fix nod-ai logo having a white border * Fix control labels having a black background * Remove extra lower border below Save Prompt checkboxes in Txt2Img UI	2023-07-19 17:28:53 -07:00
Abhishek Varma	41e5088908	Add Llama2 70B option in CLI and WebUI (#1673 )	2023-07-19 10:41:42 -07:00
Vivek Khandelwal	4be80f7158	Add support for the Llama-2 model	2023-07-19 20:57:08 +05:30
Daniel Garvey	8c317e4809	fix cli for vicuna (#1666 )	2023-07-18 10:03:40 -07:00

1 2 3 4 5

206 Commits