AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Author	SHA1	Message	Date
Ean Garvey	68e9281778	(Studio2) Refactors SD pipeline to rely on turbine-models pipeline, fixes to LLM, gitignore (#2129 ) * Shark Studio SDXL support, HIP driver support, simpler device info, small fixes * Fixups to llm API/UI and ignore user config files. * Small fixes for unifying pipelines. * Update requirements.txt for iree-turbine (#2130) * Fix Llama2 on CPU (#2133) * Filesystem cleanup and custom model fixes (#2127) * Fix some formatting issues * Remove IREE pin (fixes exe issue) (#2126) * Update find links for IREE packages (#2136) * Shark Studio SDXL support, HIP driver support, simpler device info, small fixes * Abstract out SD pipelines from Studio Webui (WIP) * Switch from pin to minimum torch version and fix index url * Fix device parsing. * Fix linux setup * Fix custom weights. --------- Co-authored-by: saienduri <77521230+saienduri@users.noreply.github.com> Co-authored-by: gpetters-amd <159576198+gpetters-amd@users.noreply.github.com> Co-authored-by: gpetters94 <gpetters@protonmail.com>	2024-05-28 13:18:31 -04:00
Stefan Kapusniak	58f194a450	Fix _IREE_TARGET_MAP (#2103 ) - Change target passed to iree for vulkan from 'vulkan' to 'vulkan-spriv', as 'vulkan' is not a valid value for --iree-hal-target-backends with the current iree compiler.	2024-03-18 00:21:44 -05:00
PhaneeshB	51afe19e20	fix rocm arch selection	2023-11-10 13:22:51 +05:30
PhaneeshB	72c0a8abc8	remove dependency on external commands for driver installation check	2023-10-27 10:30:40 +05:30
Phaneesh Barwaria	486202377a	update dependency on rocm/hip info command (#1900 ) * add support for rocm flags * add rocm target flag to chat args * rm rocm libs dependency message	2023-10-26 15:18:25 +05:30
Ean Garvey	9c8cbaf498	Add support for ROCM (Windows) in Studio + compile utils (#1770 ) * WIP: MSVC ROCM support for SHARK Studio * Make get_iree_rocm_args platform-agnostic. * Update stable_args.py * Update rocm arg handling in SD utils * Guard quantization imports. Co-authored-by: jam https://github.com/jammm	2023-08-25 20:56:05 -07:00
Stella Laurenzo	cec6eda6b4	Optimize device enumeration overhead and log details on long operations. (#1734 ) * Optimize device enumeration overhead and log details on long operations. * Various fixes to add `@functools.cache` to what should be one time, expensive, device enumeration and setup activities. Cuts several seconds off of initialization on my machine. * Add detailed tracing to actual invocations if they exceed a certain timeout or have an exception. * Add detailed tracing to loading status. * By default detail logging is only printed if an operation takes an excessive amount of time. All logging/timing can be printed by setting the variable `$env:SHARK_DETAIL_TRACE = "1"` * Remove cache from unhashable functions	2023-08-07 17:20:53 -07:00
PhaneeshB	28e0919321	Add AMD cpu device	2023-06-23 18:47:04 +05:30
Ranvir Singh Virk	18c8e9e51e	Metal typo fix (#1572 ) * fixing typos for metal changes * black formating	2023-06-21 21:56:11 -07:00
Ranvir Singh Virk	07c1e1d712	Adding metal_utils for iree_utils (#1561 ) * Adding metal_utils for iree_utils * Add patch for making compile API work for both MEGABYTE and MiniGPT4 (#1559) -- It also modifies the mega_test.py script Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> * [SD] Update unet in_channels API and add PIL metadata to spec. (#1560) * Fix deprecation warning for unet config. * Include PIL metadata instead of hidden imports in SD spec. * Fixing iree-metal-target-platform * adding metal to txt2img pipeline * Fixing Copyright date * removing debug prints * black lint formating * fixing device dump --------- Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> Co-authored-by: Abhishek Varma <avarma094@gmail.com> Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: powderluv <powderluv@users.noreply.github.com>	2023-06-21 19:09:03 -07:00
Phaneesh Barwaria	1980d7b2c3	Cpu device map (#1515 ) * update cpu iree device * fix vmfb paths vic unsharded	2023-06-09 11:27:02 -05:00
Boian Petkantchin	bdf37b5311	If device/backend is unknown pass it to IREE verbatim	2023-05-16 09:54:07 -07:00
Phoenix Meadowlark	d319f4684e	Add peak memory reporting for IREE, TF and PyTorch (#1216 )	2023-03-20 15:40:49 -05:00
Daniel Garvey	bdbe992769	Add IREE_SAVE_TEMPS for import_debug command (#1184 ) based on hf_model_id. Works on windows	2023-03-14 11:40:23 -07:00
Ean Garvey	a90812133b	Enable pytests on Windows (#901 )	2023-02-01 18:36:41 -06:00
Phaneesh Barwaria	831f206cd0	Revert "Add target triple selection for multiple cards" (#655 ) This reverts commit `acb905f0cc`.	2022-12-16 15:01:45 -08:00
PhaneeshB	acb905f0cc	Add target triple selection for multiple cards	2022-12-17 02:24:37 +05:30
Boian Petkantchin	aaf60bdee6	Simplify iree_device_map	2022-12-13 13:21:51 -08:00
Phaneesh Barwaria	749a2c2dec	add support for choosing vulkan device (#439 )	2022-11-12 14:00:41 -08:00
erman-gurses	fc8aa6ae63	Add ROCM parameters (#335 )	2022-09-16 09:12:19 -07:00
Ean Garvey	6cf5564c84	Remove "gpu" device alias and migrate to using "cuda" for NVIDIA GPU. (#325 ) * Replace instances of "gpu" alias for devices with "cuda"	2022-09-13 01:16:56 -05:00
Stanley Winata	55bcb2eb3c	Level Zero Backend (#280 )	2022-08-17 19:19:27 -07:00
powderluv	db6e2207ed	Update _common.py	2022-08-13 13:49:01 -07:00
Daniel Garvey	7975087ee2	change backend name (#265 )	2022-08-13 12:01:12 -07:00
Prashant Kumar	fa7ee7e099	Update pytorch tests to support vulkan and cuda. All the model validation pass except distilbert which is failing in torch-mlir lowering. Also, added the mobilebert-uncased model to the torch test suite.	2022-07-08 14:40:13 +05:30
Anush Elangovan	a7435973d9	Fix black formatting	2022-06-30 20:42:02 +00:00
Prashant Kumar	b07377cbfd	Refactor the shark_runner shark_inference to only support mlir_modules. 1. The shark_inference is divided into shark_importer and shark_inference. 2. All the tank/pytorch tests have been updated.	2022-06-28 18:46:18 +05:30
Prashant Kumar	e8aa105b2a	Divide iree_utils and do module imports on function calls.	2022-06-22 14:17:33 +05:30

28 Commits