AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Author	SHA1	Message	Date
Ean Garvey	9697981004	Pipe through a debug option to iree compile utils. (#1796 ) * Update compile_utils.py * Pipe through a flag to toggle debug options in compile utils. * Update SharkLLMBase.py	2023-08-25 07:11:11 -07:00
Ean Garvey	450c231171	Add tokenizers to requirements.txt (#1790 ) * Add tokenizers to requirements and pin version * Update process_skipfiles.py 20230824.911	2023-08-24 19:44:04 -05:00
Ean Garvey	07f6f4a2f7	Add a short README for the OPT examples and small tweaks. (#1793 ) * Small changes to OPT example. * Update opt README. * Add a few modes to batch script. * Update README.md	2023-08-24 17:26:11 -07:00
jinchen62	610813c72f	Add iree flag to strip assertions (#1791 )	2023-08-24 10:51:19 -07:00
Ean Garvey	8e3860c9e6	Remove flags that are default in upstream IREE (#1785 ) * Remove index bits flags now set by default * Update shark_studio_imports.py	2023-08-24 11:57:54 -05:00
xzuyn	e37d6720eb	Add Hires Fix (#1787 ) * improper test hiresfix * add sliders & use `clear_cache` * add resample choices & fix step adjustment * add step adjustment to img2img * add resample options to img2img * simplify hiresfix - import `img2img_inf` from `img2img_ui.py` instead of just copying it into `txt2img_ui.py` * set `hri` to None after using * add more resample types, and don't show output until hiresfix is done * cleaner implementation * ran black * ran black again with jupyter dependencies 20230824.909	2023-08-24 09:01:41 -07:00
Vivek Khandelwal	16160d9a7d	Fix combine mlir script	2023-08-24 19:10:49 +05:30
Sungsoon Cho	79075a1a07	Opt perf (#1786 ) * Define command line args, model-name, max-seq-len, platform, etc. * Add usage example. * Add opt_perf_comparision_batch.py. * Use shlex instead.	2023-08-24 08:33:12 -05:00
Abhishek Varma	db990826d3	Add Llama2 13B int4 fp16 support (#1784 ) Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> 20230823.908	2023-08-23 10:00:32 -07:00
gpetters94	7ee3e4ba5d	Add stencil_unet_512 support (#1778 ) This should fix any remaining issues with stencils and long prompts. 20230822.907	2023-08-22 12:23:46 -04:00
Vivek Khandelwal	05889a8fe1	Add LLaMa2-int4-fp16 support (#1782 ) 20230822.906	2023-08-22 07:45:50 -07:00
jinchen62	b87efe7686	Fix venv setup for brevitas (#1779 ) 20230821.905	2023-08-21 11:58:51 -07:00
gpetters94	82b462de3a	Fix stencils for long prompts (#1777 ) 20230820.903 20230820.904 20230819.902 20230819.901	2023-08-19 00:26:51 -07:00
Daniel Garvey	d8f0f7bade	replace public with private (#1776 ) unload footguns 20230818.899	2023-08-18 14:22:46 -07:00
gpetters94	79bd0b84a1	Fix an issue with diffusers>0.19.3 (#1775 )	2023-08-18 14:06:06 -04:00
jinchen62	8738571d1e	Adapt the change of brevitas custom op name (#1772 ) 20230818.898 20230817.897 20230817.896	2023-08-17 14:24:43 -07:00
Gaurav Shukla	a4c354ce54	[version] Pin diffusers==0.19.3 Once the latest works with LORA train, unpin it. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-17 21:27:10 +05:30
Gaurav Shukla	cc53efa89f	[cli] Fix chatbot cli Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-17 21:27:10 +05:30
Gaurav Shukla	9ae8bc921e	[chatbot] Fix chatbot cli and webview warning Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-17 21:27:10 +05:30
Gaurav Shukla	32eb78f0f9	[chatbot] Fix switching parameters in chatbot Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-17 19:14:17 +05:30
Ean Garvey	cb509343d9	Fix pytest benchmarks and shark_tank generation. (#1632 ) - fix setup_venv.sh for benchmarks/imports etc. - fix torch benchmarks in SharkBenchmarkRunner - generate SD artifacts using build_tools/stable_diffusion_testing.py and --import_mlir - decouple SD gen from tank/generate_sharktank for now 20230816.895	2023-08-16 17:48:47 -05:00
powderluv	6da391c9b1	update signtool to use /fd certHash 20230816.894 20230815.893 20230815.892	2023-08-15 15:11:40 -07:00
Ean Garvey	9dee7ae652	fix tkinter window (#1766 )	2023-08-15 13:23:09 -07:00
Ean Garvey	343dfd901c	Update SHARK-Runtime links to SRT (#1765 ) * Update nightly.yml * Update setup_venv.ps1 * Update CMakeLists.txt * Update shark_iree_profiling.md * Update setup_venv.sh * Update README.md * Update .gitmodules * Update CMakeLists.txt * Update README.md * fix signtool flags * Update nightly.yml * Update benchmark_utils.py * uncomment tkinter launch	2023-08-15 12:40:44 -07:00
Ean Garvey	57260b9c37	(Studio) Add hf-hub to pyinstaller metadata (#1761 ) 20230814.887	2023-08-14 23:01:50 -05:00
Ean Garvey	18e7d2d061	Enable vae tunings for rdna3. (#1764 )	2023-08-14 21:00:14 -07:00
Stanley Winata	51a1009796	Add Forward method to SHARKRunner and fix examples. (#1756 )	2023-08-14 19:20:37 -07:00
Daniel Garvey	045c3c3852	enable iree-opt-const-expr-hoisting in vicuna (#1742 ) Co-authored-by: powderluv <powderluv@users.noreply.github.com>	2023-08-14 18:43:42 -07:00
Ean Garvey	0139dd58d9	Specify max allocation size in IREE compile args. (#1760 )	2023-08-14 15:43:09 -05:00
Ean Garvey	c96571855a	prevents recompiles for cuda benchmarks + update benchmark_module path (#1759 ) * xfail resnet50_fp16 * Fix cuda benchmarks and prevent recompilation.	2023-08-14 15:30:32 -05:00
PhaneeshB	4f61d69d86	add support passing iree flags for LLMs	2023-08-15 00:22:56 +05:30
Phaneesh Barwaria	531d447768	set default allocator for metal device creation (#1755 )	2023-08-14 06:17:52 -07:00
Vivek Khandelwal	16f46f8de9	Update langchain_requirements.txt	2023-08-14 14:32:19 +05:30
Vivek Khandelwal	c4723f469f	Update langchain_requirements.txt	2023-08-14 14:32:19 +05:30
Vivek Khandelwal	d804f45a61	Update langchain_requirements.txt	2023-08-14 14:32:19 +05:30
Vivek Khandelwal	d22177f936	Update requirements.txt	2023-08-14 14:32:19 +05:30
George Petterson	75e68f02f4	Remove CUDNN	2023-08-14 14:32:19 +05:30
Gaurav Shukla	4dc9c59611	[chatbot] Add tokens generated per second (#1753 ) 20230813.883	2023-08-13 11:25:41 -07:00
Gaurav Shukla	18801dcabc	[chat] Update chatbot ui Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-13 18:39:22 +05:30
Gaurav Shukla	3c577f7168	[vicuna] fix shard config generator script (#1747 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com> 20230812.882 20230811.881 20230810.880	2023-08-10 11:26:03 -07:00
Stefan Kapusniak	f5e4fa6ffe	UI/Web - Revert tab order (#1724 ) * Revert ui tab order * Reverts the tab order, so that SD, LLM, and Experimental are grouped together again as far as is possible. * Labelled "Generate Sharding Config" as experimental as pressing the 'Get Model Config' errors for me. * Fix formatting in index.py	2023-08-10 11:25:36 -07:00
powderluv	48de445325	Enable caching and disable vma (#1746 ) * Enable caching allocator by default Going to toggle VMA off too and this is required for performance. Will have to monitor in the wild reports. * Disable VMA Disable VMA	2023-08-10 10:49:44 -07:00
Gaurav Shukla	8e90f1b81a	[vicuna] add default config in case of sharded vicuna Signed-Off-by: Gaurav Shukla<gaurav@nod-labs.com>	2023-08-10 21:28:08 +05:30
Vivek Khandelwal	e8c1203be2	Fix vicuna script (#1745 ) 20230810.879	2023-08-10 06:11:14 -07:00
Vivek Khandelwal	e4d7abb519	Final patch for fixing Langchain token streaming issue (#1744 ) 20230809.878	2023-08-09 10:09:41 -07:00
powderluv	96185c9dc1	pin safetensors to 0.3.1 (#1740 ) 20230808.876 20230809.877	2023-08-08 19:24:44 -07:00
powderluv	bc22a81925	re-enable constant folding (#1739 ) Tested and works well. (modulo unrelated driver issue)	2023-08-08 17:17:38 -07:00
Eliasj42	5203679f1f	Bandaid fix 2 (#1728 ) * download all mlirs * fixed install method * download all mlirs (#1727) Co-authored-by: Elias Joseph <elias@nod-labs.com> * added taggs * fix name check for file existence * Remove SD from all_models.csv (#1706) Removes SD from pytests as it has its own test suite. * gpt_langchain.py fixes for pydantic (#1722) * removed dead code --------- Co-authored-by: Elias Joseph <elias@nod-labs.com> Co-authored-by: PhaneeshB <b.phaneesh@gmail.com> Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: Stefan Kapusniak <121311569+one-lithe-rune@users.noreply.github.com>	2023-08-08 12:14:57 -05:00
Vivek Khandelwal	bf073f8f37	[Langchain] Expand pipelines to fix token streaming issue 20230807.875	2023-08-08 10:27:23 +05:30
Stella Laurenzo	cec6eda6b4	Optimize device enumeration overhead and log details on long operations. (#1734 ) * Optimize device enumeration overhead and log details on long operations. * Various fixes to add `@functools.cache` to what should be one time, expensive, device enumeration and setup activities. Cuts several seconds off of initialization on my machine. * Add detailed tracing to actual invocations if they exceed a certain timeout or have an exception. * Add detailed tracing to loading status. * By default detail logging is only printed if an operation takes an excessive amount of time. All logging/timing can be printed by setting the variable `$env:SHARK_DETAIL_TRACE = "1"` * Remove cache from unhashable functions 20230807.874	2023-08-07 17:20:53 -07:00

... 3 4 5 6 7 ...

1743 Commits