AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-04-03 03:00:17 -04:00

Author	SHA1	Message	Date
AyaanShah2204	d61b6641fb	Rest API: Resolved Generator Object not Subscripatable error (#1556 ) 20230620.779	2023-06-20 19:27:41 -07:00
Phaneesh Barwaria	88cc2423cc	Enable Vicuna fp16 cpu (#1562 ) * fix second vic mlir gen * fp16 mlir/vmfb download from shark_tank	2023-06-20 13:43:21 -05:00
Ean Garvey	ccf944c1bd	Enable tuner for upscaler unet. (#1563 )	2023-06-20 13:40:13 -05:00
Ean Garvey	0def74f520	[SD] Update unet in_channels API and add PIL metadata to spec. (#1560 ) * Fix deprecation warning for unet config. * Include PIL metadata instead of hidden imports in SD spec.	2023-06-20 10:26:36 -07:00
Abhishek Varma	3fb72e192e	Add patch for making compile API work for both MEGABYTE and MiniGPT4 (#1559 ) -- It also modifies the mega_test.py script Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-06-20 10:04:17 -07:00
Vivek Khandelwal	855435ee24	Fix for the user input for Falcon pipeline	2023-06-20 18:09:32 +05:30
Elias Joseph	6f9f868fc0	fixed a bug where designating device for vicuna didn't work	2023-06-20 17:09:32 +05:30
powderluv	fb865f1b99	Move to checkout@v3 This will break Windows again but we have to fix it up since the old node.js is now deprecated. 20230619.777	2023-06-19 18:44:36 -07:00
rprasad2	3e5c50f07b	changes for tuning (#1542 ) * Add tuning sizes for rdna3	2023-06-19 15:29:08 -05:00
powderluv	a544f30a8f	Move mega to the shark examples (#1555 )	2023-06-19 11:10:51 -07:00
Abhishek Varma	1fe56d460a	[MEGABYTE] Add script to compile MEGABYTE through SHARK (#1553 ) -- Usage: `python mega_test.py`. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-06-19 11:00:35 -07:00
Vivek Khandelwal	fafd713141	Minor change to falcon pipeline	2023-06-19 22:36:32 +05:30
Vivek Khandelwal	015d0132c3	Modify falcon pipeline to add fp16 support (#1551 )	2023-06-19 09:57:13 -07:00
powderluv	20ddd96ef7	unpin diffusers (#1550 )	2023-06-18 13:45:55 -07:00
powderluv	ee33cfd2d1	Add PIL in main index.py (#1549 ) * Add PIL in main index.py This is to ensure pyinstaller picks it up * Update index.py 20230618.775	2023-06-18 11:51:44 -07:00
Stefan Kapusniak	a3cba21d5b	Fix load of unet512 vmfb fail on get of iree opts (#1546 ) * Change retrieval of Iree options used when loading an existing unet512 vmfb to look up the "unet" options rather than attempt to find a non-existent set of options for "unet512" Co-authored-by: powderluv <powderluv@users.noreply.github.com> 20230618.774	2023-06-18 06:42:20 -07:00
Stefan Kapusniak	a7b6ec4095	Fix unet512 always being used when --max_length=77 (#1547 ) * Switches a few places in the SD pipeline where an assumption of max_length=64 was being made, to using the actual max_length as passed into the pipeline. This prevents unet512 always being used and producing different images than previously when --max_length=77	2023-06-18 06:41:25 -07:00
Ean Garvey	d80b087d95	Add PIL hidden imports to sd spec. (#1544 ) Co-authored-by: powderluv <powderluv@users.noreply.github.com>	2023-06-18 06:39:08 -07:00
Stefan Kapusniak	297a209608	Remove workarounds for gradio tempfile bugs (#1548 ) 20230617.773	2023-06-17 19:50:36 -07:00
gpetters94	b204113563	Add UNet512 (#1504 ) Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com>	2023-06-17 03:46:25 -04:00
Chi_Liu	f60ab1f4fa	Add Deberta to stablehlo in shark tank (#1545 ) 20230616.772	2023-06-16 13:24:44 -07:00
Surya Jasper	b203779462	Added Adreno target triples to vulkan_utils (#1543 ) 20230615.770	2023-06-15 16:42:59 -07:00
Stefan Kapusniak	38570a9bbb	Some Fixes for update to gradio 3.34.0 (#1538 ) * Fixes randomize seed buttons that stopped working. * Update now deprecated method to set initial colums for output gallery to the newer undeprecated one.	2023-06-15 01:10:36 -07:00
dependabot[bot]	a5c882f296	Bump gradio from 3.15.0 to 3.34.0 (#1518 ) Bumps [gradio](https://github.com/gradio-app/gradio) from 3.15.0 to 3.34.0. - [Release notes](https://github.com/gradio-app/gradio/releases) - [Changelog](https://github.com/gradio-app/gradio/blob/main/CHANGELOG.md) - [Commits](https://github.com/gradio-app/gradio/compare/v3.15.0...v3.34.0) --- updated-dependencies: - dependency-name: gradio dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 20230614.769	2023-06-14 18:13:48 -07:00
Ean Garvey	eb6d11cfed	Change mlir dialects for tf tests to stablehlo. (#1535 ) * Change mlir dialects for tf tests to stablehlo * Update shark_runner.py	2023-06-14 10:43:49 -07:00
Vivek Khandelwal	46184a81ac	Add Falcon pipeline (#1534 ) 20230614.768	2023-06-14 09:39:16 -07:00
PhaneeshB	149165a2f0	add multi-device mutli-precision vmfb names	2023-06-14 22:08:24 +05:30
dan	bec82a665f	mega vicuna merge single endpoint in apps/language/models/scripts/vicuna.py removed main functions from pipelines replaced divergent utils compile with shark_importer adds support for different precisions	2023-06-14 19:06:29 +05:30
Ean Garvey	9551490341	Remove deprecared --iree-mhlo-demote-164-to-132 flag usage. (#1533 ) 20230613.767	2023-06-13 22:40:47 -05:00
Ean Garvey	49b3ecdbca	(pytest) don't run redundant tests in cpu suite (#1532 )	2023-06-13 22:40:33 -05:00
Ean Garvey	f53e3594c3	OPT Refactor (#1516 ) * Change script to 1.3b model and add pytorch comparison * fix CLI command * Match OPT transformers model updates + numerics against latest version * Cleanup OPT sentence completion script. * Fix formatting and add standalone validation scripts. * Add minimal OPT wrapper and example with import_with_fx * Rename OPT full model wrapper. * Cleanup test scripts for OPT.	2023-06-13 22:40:07 -05:00
Ean Garvey	5562d1dfda	Fix xfails for cpu pytest cases (#1527 ) Adding cpu-sync and cpu-task device configs was allowing respective tests to bypass the xfail conditional for cpu pytests marked in tank/all_models.csv. This commit updates the conditional to xfail those cases for cpu-sync and cpu-task as well.	2023-06-13 17:01:51 -07:00
Stefan Kapusniak	c7b0c2961e	UI/Web Improve output gallery temp file handling (#1531 ) * On startup report that cleaning up of temp files is taking place, in case it takes a long time. * Have the output gallery tab delete any zero length temporary files generated by gradio < 3.32.0 for its gallery control whenever it needs to update that control with images. This prevents such files multiplying out of control.	2023-06-13 16:25:37 -05:00
Ean Garvey	44273b0791	Fix conditional in transform_fx() (#1530 )	2023-06-13 16:24:53 -05:00
Prashant Kumar	0a4c8fcb3e	Minor changes in the fx transforms. 20230613.766	2023-06-13 21:23:35 +05:30
Stefan Kapusniak	2fec3c8169	re-indents add_upcast in shark importer (#1523 ) * The two with blocks in add_upcast appear to be underindented making SD 1.4 break on rdna3, I've pushed them out one more tab, and then everything appears to work again.	2023-06-12 14:41:10 -05:00
Gaurav Shukla	5e7d5930dd	[vicuna] Add device and precision propagation in vicuna (#1520 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-06-12 12:14:43 -05:00
Prashant Kumar	b6dbd20250	Modify the fx transforms. (#1521 ) - The bounds are set properly. - The upcasting and downcasting is done for vicuna.	2023-06-12 09:40:14 -07:00
Nithin Meganathan	34f1295349	Add a model config generator (#1511 ) Model config generator takes a PyTorch model as input and generates a JSON file with model layers and other propperties that define sharding on a particular hardware.	2023-06-09 15:32:00 -07:00
Phaneesh Barwaria	1980d7b2c3	Cpu device map (#1515 ) * update cpu iree device * fix vmfb paths vic unsharded	2023-06-09 11:27:02 -05:00
powderluv	2cfacc5051	fix osx torch_mlir (#1513 ) * fix osx torch_mlir * Update index.py * Update index.py	2023-06-09 00:57:26 -07:00
Phaneesh Barwaria	436f58ddc4	cli using generate and mem fixes (#1509 ) 20230608.761	2023-06-08 13:13:32 -05:00
Phaneesh Barwaria	6b29bd17c8	Enable compilation vicuna (#1507 ) * add cli for unsharded vic * enable mlir download and compile 20230607.760	2023-06-07 13:08:22 -07:00
Ean Garvey	2c3485ca3e	Add standalone OPT sentence completion script. (#1506 )	2023-06-07 10:58:03 -07:00
Daniel Garvey	f206ecc635	reenable compilation in vicuna pipeline, add flags (#1505 ) * replace vicuna.py backend with pipeline * add some memory management to fist vicuna compile reenable compilation	2023-06-07 09:49:27 -07:00
Stefan Kapusniak	a187e05ae6	Prevent having no cuda devices breaking the UI (#1503 ) Don't break the UI when the LLM tab only wants cuda devices but there aren't any. 20230606.759	2023-06-06 11:41:16 -07:00
Gaurav Shukla	8c21960486	[vicuna] Set only cuda devices in vicuna UI for now Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-06-06 22:15:20 +05:30
Gaurav Shukla	be62fce676	[vicuna] Fix vicuna chatbot (#1499 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-06-06 09:23:32 -07:00
PhaneeshB	f23b778a6c	remove old vicuna scripts	2023-06-06 21:35:58 +05:30
PhaneeshB	436edf900d	add vic sharded pipeline	2023-06-06 21:35:58 +05:30

1 2 3 4 5 ...

1357 Commits