AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Author	SHA1	Message	Date
Prashant Kumar	b466b51247	Update opt_params.py	2022-12-19 23:39:04 +05:30
PhaneeshB	a17800da00	Add 64 len f16 untuned mlir	2022-12-19 22:53:17 +05:30
Prashant Kumar	059c1b3a19	Disable vae --use_tuned version. 20221219.398	2022-12-19 22:45:45 +05:30
Stanley Winata	9a36816d27	[SD][CLI] Add a warmup phase (#670 )	2022-12-20 00:14:23 +07:00
Gaurav Shukla	7986b9b20b	[SD][WEB] Update VAE model and wrapper This commit updates VAE model which significantly improves performance by an order of ~300ms. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-19 22:32:05 +05:30
Gaurav Shukla	b2b3a0a62b	[SD] Move initial latent generation out of inference time The initial random latent generation is not taken into account for total SD inference time. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-19 22:32:05 +05:30
Prashant Kumar	3173b7d1d9	Update VAE model and wrapper.	2022-12-19 19:54:50 +05:30
Gaurav Shukla	9d716d70d6	[SD][web] Fix performance issues on shark scheduler Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> 20221219.397	2022-12-19 17:44:37 +05:30
Stanley Winata	e1901a8608	[SD][CL] Disable print at every iteration. (#664 ) Printing might incur extra time to runtime. Hence, we add a flag to hide it. To disable printing please set this flag `--hide_steps`. Co-authored-by: Stanley <stanley@MacStudio.lan>	2022-12-19 15:39:57 +07:00
Quinn Dawkins	7d0cbd8d90	[SD][web] Set default tuned unet to v2 (#663 ) 20221219.396	2022-12-19 11:50:08 +07:00
Quinn Dawkins	59358361f9	[SD] Make clip batch 2 for positive and negative prompts (#662 ) Combines the forward passes for each input prompt type into a single batched clip pass.	2022-12-18 23:46:21 -05:00
Quinn Dawkins	7fea2d3b68	[SD] update default large heap size for web as well (#661 ) 20221219.395	2022-12-18 21:50:26 -05:00
Quinn Dawkins	b6d3ff26bd	[SD] Change default VMA large heap block size (#660 )	2022-12-18 21:41:46 -05:00
Stella Laurenzo	523e63f5c1	Fix NoneType exception if vulkan tuning flags not detected. (#659 ) (This goes on to produce compilation errors, but one step at a time)	2022-12-18 16:40:56 -08:00
Stella Laurenzo	10630ab597	Add config stanza for NVIDIA RTX 2080. (#658 ) Just happened to have this card on my Windows machine and verified that the SD demo works on it. ``` Average step time: 144.26142692565918ms/it Clip Inference Avg time (ms) = (205.001 + 44.000) / 2 = 124.501 VAE Inference time (ms): 281.001 Total image generation time: 7.856997728347778sec ``` I'd love to add an API upstream to derive compiler tuning flags from a host device.	2022-12-18 16:40:47 -08:00
Quinn Dawkins	2bc6de650d	[SD] Add support for a compiled version of the discrete Euler scheduler (#657 ) * Add Shark version of euler scheduler * Add Shark version of euler scheduler to web ui 20221218.394	2022-12-17 19:25:43 -08:00
powderluv	ffef1681e3	Update stable_diffusion_amd.md	2022-12-17 03:40:08 -08:00
yzhang93	d935006a4a	Update Unet tuned model to v2 (#656 )	2022-12-16 22:10:15 -08:00
powderluv	660cb5946e	Update to 392 release 20221217.393	2022-12-16 16:00:49 -08:00
Gaurav Shukla	10160a066a	[SD][WEB] Add vae tuned model in the SD web (#653 ) 1. Add tuned vae model in the SD web. 2. Use tuned models in case of rdna3 cards. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com> 20221216.392	2022-12-16 15:29:48 -08:00
Anush Elangovan	72976a2ece	Import env vars first 20221216.391	2022-12-16 15:12:28 -08:00
Phaneesh Barwaria	831f206cd0	Revert "Add target triple selection for multiple cards" (#655 ) This reverts commit `acb905f0cc`.	2022-12-16 15:01:45 -08:00
Gaurav Shukla	72648aa9f2	Revert "[SD][WEB] Deduce vulkan-target-triple in the presence of multiple cards" This reverts commit `35e623deaf`.	2022-12-17 04:28:18 +05:30
Gaurav Shukla	35e623deaf	[SD][WEB] Deduce vulkan-target-triple in the presence of multiple cards 1. Get the correct vulkan-target-triple for a specified device in the presence of multiple cards. 2. Use tuned unet model for rdna3 cards. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-17 03:04:47 +05:30
Anush Elangovan	6263636738	Fix more lints	2022-12-16 13:26:15 -08:00
Anush Elangovan	535d012ded	Fix lint	2022-12-16 13:24:51 -08:00
yzhang93	c73eed2e51	Add VAE winograd tuned model (#647 ) 20221216.390	2022-12-16 13:01:45 -08:00
Anush Elangovan	30fdc99f37	Set to enable llpc Use an env var to enable llpc	2022-12-16 12:57:30 -08:00
PhaneeshB	acb905f0cc	Add target triple selection for multiple cards	2022-12-17 02:24:37 +05:30
Gaurav Shukla	bba06d0142	[SD][WEB] Avoid passing args to utils APIs Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-17 01:41:33 +05:30
Ean Garvey	a14a47af12	Move most xfails to entries in tank/all_models.csv and temporarily remove multiprocessing and TF gpu support. (#646 ) -Adds date variable back to nightly.yml so shark_tank uploads are dated again -added specification for nightly pytests to not run tests on metal (vulkan is sufficient) -added some paths/filetypes to be ignored when triggering workflow runs. (no test-models on changes to .md files or anything in the shark/examples/ directory or its subdirectories. -pytest only picks up tank/test_models.py, so no need to specify which file to run when running pytest from SHARK base directory. -Cleaned up xfails so that they can be added to models as csv entries. Columns 7-9 in all_models.csv trigger xfails with cpu, cuda, vulkan, respectively, and row 10 can be populated with a reason for the xfails. -Fixed a few defaults for shark_args and pytest args (defined in conftest.py) -Fixes --update_tank option in shark_downloader removes some multiprocessing in pytest / TF+CUDA support because it breaks pytest and false passes, leaving regressions at large. -Adds xfails for and removes albert torch from gen_sharktank list (tank/torch_model_list.csv). -Cleans up xfails for cpu, cuda, vulkan (removing old ones) 20221216.389	2022-12-16 12:56:32 +05:30
Phaneesh Barwaria	73457336bc	add flag for toggling vulkan validation layers (#624 ) * add vulkan_validation_layers flag * categorize SD flags * stringify true and false for flag 20221216.388	2022-12-15 20:40:59 -06:00
Ean Garvey	a14c53ad31	Remove albert-base-v2 since it fails torch_mlir.compile() (#644 ) 20221215.387	2022-12-15 16:05:19 -06:00
Gaurav Shukla	e7e763551a	[WEB][SD] Make unet tuned model default for rdna3 devices (#642 )	2022-12-15 12:02:03 -08:00
nirvedhmeshram	2928179331	Add more NVIDIA targets (#640 )	2022-12-15 11:24:38 -06:00
Stanley Winata	24a16a4cfe	[Stable Diffusion] Disable binding fusion to work with moltenVK on mac. (#639 ) Co-authored-by: Stanley <stanley@MacStudio.lan>	2022-12-16 00:22:49 +07:00
Phaneesh Barwaria	6aed4423b2	add vulkan lib path (#638 )	2022-12-15 19:48:29 +07:00
yzhang93	6508e3fcc9	Update tuned model SD v2.1base (#634 ) 20221215.386	2022-12-14 16:02:35 -05:00
Gaurav Shukla	a15cb140ae	[WEB] Display the 512x512 image size Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-14 22:43:03 +05:30
Prashant Kumar	898bc9e009	Add the stable diffusion v2.1 version.	2022-12-14 20:19:41 +05:30
Gaurav Shukla	e67ea31ee2	[SHARK][SD] Add `--local_tank_cache` flag in the stable diffusion This flag can be used to set local shark_tank cache directory. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-14 20:00:25 +05:30
Gaurav Shukla	986c126a5c	[SHARK][SD] Add support for negative prompts Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-14 18:20:09 +05:30
Gaurav Shukla	0eee7616b9	[WEB] Launch only one SD version at a time Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-14 17:30:24 +05:30
powderluv	5ddce749b8	lint fix	2022-12-13 22:02:32 -08:00
powderluv	d946cffabc	Revert "Move most xfails to entries in tank/all_models.csv and temporarily remove multiprocessing and TF gpu support. (#602 )" (#622 ) This reverts commit `fe618811ee`.	2022-12-13 21:49:46 -08:00
Ean Garvey	fe618811ee	Move most xfails to entries in tank/all_models.csv and temporarily remove multiprocessing and TF gpu support. (#602 ) * Move most xfails to entries in tank/all_models.csv * enable usage of pytest without specifying tank/test_models.py * add dict_configs.py to gitignore. * Pin versions for runtimes and torch-mlir for setup. 20221214.385	2022-12-13 18:11:17 -08:00
powderluv	09c45bfb80	clean up cache printf	2022-12-13 14:11:14 -08:00
Boian Petkantchin	e9e9ccd379	Add stress test	2022-12-13 13:21:51 -08:00
Boian Petkantchin	a9b27c78a3	Return dynamic model if specified when downloading from the tank	2022-12-13 13:21:51 -08:00
Boian Petkantchin	bc17c29b2e	In get_iree_runtime_config get the specific device instead of the default	2022-12-13 13:21:51 -08:00

1 2 3 4 5 ...

747 Commits