AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-04-03 03:00:17 -04:00

Author	SHA1	Message	Date
Gaurav Shukla	3c577f7168	[vicuna] fix shard config generator script (#1747 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-10 11:26:03 -07:00
powderluv	48de445325	Enable caching and disable vma (#1746 ) * Enable caching allocator by default Going to toggle VMA off too and this is required for performance. Will have to monitor in the wild reports. * Disable VMA Disable VMA	2023-08-10 10:49:44 -07:00
Gaurav Shukla	8e90f1b81a	[vicuna] add default config in case of sharded vicuna Signed-Off-by: Gaurav Shukla<gaurav@nod-labs.com>	2023-08-10 21:28:08 +05:30
Stella Laurenzo	cec6eda6b4	Optimize device enumeration overhead and log details on long operations. (#1734 ) * Optimize device enumeration overhead and log details on long operations. * Various fixes to add `@functools.cache` to what should be one time, expensive, device enumeration and setup activities. Cuts several seconds off of initialization on my machine. * Add detailed tracing to actual invocations if they exceed a certain timeout or have an exception. * Add detailed tracing to loading status. * By default detail logging is only printed if an operation takes an excessive amount of time. All logging/timing can be printed by setting the variable `$env:SHARK_DETAIL_TRACE = "1"` * Remove cache from unhashable functions	2023-08-07 17:20:53 -07:00
Stella Laurenzo	9e37e03741	Clearly differentiate phases of loading modules to better understand if things are taking a long time. (#1733 )	2023-08-07 14:03:12 -07:00
PhaneeshB	872bd72d0b	fix name check for file existence	2023-08-05 21:33:53 +05:30
Nithin Meganathan	c287fd2be8	Add GPU ID's in model_confg.json by default for manual annotation (#1718 )	2023-08-04 12:46:27 -05:00
Gaurav Shukla	bd30044c0b	[Shard] Add sharding generation in shark studio Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-08-04 21:51:14 +05:30
PhaneeshB	4684afad34	update upscalar example	2023-07-28 21:06:28 +05:30
PhaneeshB	8d65456b7a	Move vulkan runtime flags to shark_args	2023-07-28 21:06:28 +05:30
jinchen62	4d5c55dd9f	Fix vicuna script (#1697 )	2023-07-27 17:24:26 -05:00
Abhishek Varma	47f8a79c75	[MiniGPT4] Add MiniGPT4 to SHARK (#1554 ) * [MiniGPT4] Add MiniGPT4 to SHARK -- This is the first installment of MiniGPT4 in SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> * Add int8 support for MiniGPT4 -- This commit adds int8 support for MiniGPT4. Signed-off-by: Abhishek Varma <abhishek@nod-lab.com> * Update .spec for MiniGPT4's config files * black format MiniGPT4 --------- Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> Signed-off-by: Abhishek Varma <abhishek@nod-lab.com>	2023-07-25 09:42:27 -07:00
Vivek Khandelwal	11f62d7fac	Minor fixes for MiniLM Training	2023-07-18 17:16:44 +05:30
Ranvir Singh Virk	9fcae4f808	Metal testing (#1595 ) * Fixing metal_platform and device selection * fixing for metal platform * fixed for black lint formating	2023-07-08 15:22:53 -07:00
powderluv	c41f58042a	Update compile_utils.py (#1617 ) * Update compile_utils.py * Update compile_utils.py * Update compile_utils.py	2023-07-05 10:06:48 -07:00
jinchen62	bc6fee1a0c	Add int4/int8 vicuna (#1598 )	2023-07-05 07:01:51 -07:00
Prashant Kumar	1c32915570	Add the shark compile downstream due to https://github.com/pytorch/pytorch/pull/104185#issuecomment-1615110613 (#1615 )	2023-07-01 08:30:58 -07:00
Abhishek Varma	d496053590	[SHARK] Add a compile API to use for quick testing of inference (#1606 )	2023-06-28 08:40:28 -07:00
Daniel Garvey	75672c0e28	set task_topology_max_group to cpu_count (#1594 ) by default. Can be overriden with a flag of the same str	2023-06-26 14:54:06 -07:00
Prashant Kumar	74a7202173	Make the tensors contiguous.	2023-06-26 17:29:54 +05:30
Prashant Kumar	27a08735db	Add the shark backend for torch.compile API. (#1596 )	2023-06-26 03:53:32 -07:00
Prashant Kumar	6ef78ee7ba	Add cpu compile time flags. (#1585 )	2023-06-23 07:23:26 -07:00
PhaneeshB	28e0919321	Add AMD cpu device	2023-06-23 18:47:04 +05:30
Daniel Garvey	28f4d44a6b	downloader was double downloading (#1580 )	2023-06-22 18:30:27 -07:00
Daniel Garvey	0ca3b9fce3	fix some mmap and vicuna bugs (#1576 )	2023-06-22 17:39:55 -05:00
Nithin Meganathan	045f2bb147	Add dispatch-level config file generator for manual annotation (#1566 )	2023-06-22 15:11:41 -07:00
Prashant Kumar	a811b867b9	Add shark_eager mode. -- Eager mode with step by step op compilation and execution.	2023-06-22 22:59:14 +05:30
Abhishek Varma	cdd505e2dd	[SharkInference-SharkRuntime] Adds capability to mmap vmfbs -- This commit is based on [VmModule.mmap() API](https://github.com/openxla/iree/pull/14124). -- It thereby adds capability to mmap vmfbs in SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-06-22 20:43:40 +05:30
Ranvir Singh Virk	18c8e9e51e	Metal typo fix (#1572 ) * fixing typos for metal changes * black formating	2023-06-21 21:56:11 -07:00
Ranvir Singh Virk	07c1e1d712	Adding metal_utils for iree_utils (#1561 ) * Adding metal_utils for iree_utils * Add patch for making compile API work for both MEGABYTE and MiniGPT4 (#1559) -- It also modifies the mega_test.py script Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> * [SD] Update unet in_channels API and add PIL metadata to spec. (#1560) * Fix deprecation warning for unet config. * Include PIL metadata instead of hidden imports in SD spec. * Fixing iree-metal-target-platform * adding metal to txt2img pipeline * Fixing Copyright date * removing debug prints * black lint formating * fixing device dump --------- Signed-off-by: Abhishek Varma <abhishek@nod-labs.com> Co-authored-by: Abhishek Varma <avarma094@gmail.com> Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: powderluv <powderluv@users.noreply.github.com>	2023-06-21 19:09:03 -07:00
Abhishek Varma	3fb72e192e	Add patch for making compile API work for both MEGABYTE and MiniGPT4 (#1559 ) -- It also modifies the mega_test.py script Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-06-20 10:04:17 -07:00
powderluv	a544f30a8f	Move mega to the shark examples (#1555 )	2023-06-19 11:10:51 -07:00
Vivek Khandelwal	015d0132c3	Modify falcon pipeline to add fp16 support (#1551 )	2023-06-19 09:57:13 -07:00
Chi_Liu	f60ab1f4fa	Add Deberta to stablehlo in shark tank (#1545 )	2023-06-16 13:24:44 -07:00
Surya Jasper	b203779462	Added Adreno target triples to vulkan_utils (#1543 )	2023-06-15 16:42:59 -07:00
Ean Garvey	eb6d11cfed	Change mlir dialects for tf tests to stablehlo. (#1535 ) * Change mlir dialects for tf tests to stablehlo * Update shark_runner.py	2023-06-14 10:43:49 -07:00
dan	bec82a665f	mega vicuna merge single endpoint in apps/language/models/scripts/vicuna.py removed main functions from pipelines replaced divergent utils compile with shark_importer adds support for different precisions	2023-06-14 19:06:29 +05:30
Ean Garvey	9551490341	Remove deprecared --iree-mhlo-demote-164-to-132 flag usage. (#1533 )	2023-06-13 22:40:47 -05:00
Ean Garvey	44273b0791	Fix conditional in transform_fx() (#1530 )	2023-06-13 16:24:53 -05:00
Prashant Kumar	0a4c8fcb3e	Minor changes in the fx transforms.	2023-06-13 21:23:35 +05:30
Stefan Kapusniak	2fec3c8169	re-indents add_upcast in shark importer (#1523 ) * The two with blocks in add_upcast appear to be underindented making SD 1.4 break on rdna3, I've pushed them out one more tab, and then everything appears to work again.	2023-06-12 14:41:10 -05:00
Prashant Kumar	b6dbd20250	Modify the fx transforms. (#1521 ) - The bounds are set properly. - The upcasting and downcasting is done for vicuna.	2023-06-12 09:40:14 -07:00
Nithin Meganathan	34f1295349	Add a model config generator (#1511 ) Model config generator takes a PyTorch model as input and generates a JSON file with model layers and other propperties that define sharding on a particular hardware.	2023-06-09 15:32:00 -07:00
Phaneesh Barwaria	1980d7b2c3	Cpu device map (#1515 ) * update cpu iree device * fix vmfb paths vic unsharded	2023-06-09 11:27:02 -05:00
Ean Garvey	a07fe80530	Update OPT, ResNet example scripts. (#1492 ) * Update API in OPT example. * fix resnet50 script * Add OPT1.3b test script.	2023-06-05 20:19:35 -07:00
Ean Garvey	218ed78c40	Change instances of input_type='mhlo' to 'auto' (#1482 )	2023-06-02 16:43:47 -05:00
Foxlum	5915bf7de3	Add to and tweak vulkan configuration environments. (#1475 ) * Update vulkan_target_env_utils.py * Update vulkan_target_env_utils.py Adjust target environment capabilities. * Update vulkan_target_env_utils.py black linted?	2023-06-01 22:25:20 -07:00
yzhang93	47b346d428	Modify the lowering config format for SPIRVMatmulPromoteVectorize pipeline (#1471 )	2023-05-29 21:53:48 -07:00
Ean Garvey	54e57f7771	Revive SD downloads from shark_tank. (#1465 )	2023-05-25 12:03:21 -05:00
Prashant Kumar	8e571d165f	Enable cpu f16 dtype tracing for the vicuna model. (#1461 )	2023-05-24 09:37:57 -07:00

1 2 3 4 5 ...

554 Commits