AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-04-03 03:00:17 -04:00

Author	SHA1	Message	Date
Prashant Kumar	b466b51247	Update opt_params.py	2022-12-19 23:39:04 +05:30
PhaneeshB	a17800da00	Add 64 len f16 untuned mlir	2022-12-19 22:53:17 +05:30
Prashant Kumar	059c1b3a19	Disable vae --use_tuned version.	2022-12-19 22:45:45 +05:30
Stanley Winata	9a36816d27	[SD][CLI] Add a warmup phase (#670 )	2022-12-20 00:14:23 +07:00
Gaurav Shukla	b2b3a0a62b	[SD] Move initial latent generation out of inference time The initial random latent generation is not taken into account for total SD inference time. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-19 22:32:05 +05:30
Prashant Kumar	3173b7d1d9	Update VAE model and wrapper.	2022-12-19 19:54:50 +05:30
Gaurav Shukla	9d716d70d6	[SD][web] Fix performance issues on shark scheduler Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-19 17:44:37 +05:30
Stanley Winata	e1901a8608	[SD][CL] Disable print at every iteration. (#664 ) Printing might incur extra time to runtime. Hence, we add a flag to hide it. To disable printing please set this flag `--hide_steps`. Co-authored-by: Stanley <stanley@MacStudio.lan>	2022-12-19 15:39:57 +07:00
Quinn Dawkins	59358361f9	[SD] Make clip batch 2 for positive and negative prompts (#662 ) Combines the forward passes for each input prompt type into a single batched clip pass.	2022-12-18 23:46:21 -05:00
Quinn Dawkins	b6d3ff26bd	[SD] Change default VMA large heap block size (#660 )	2022-12-18 21:41:46 -05:00
Quinn Dawkins	2bc6de650d	[SD] Add support for a compiled version of the discrete Euler scheduler (#657 ) * Add Shark version of euler scheduler * Add Shark version of euler scheduler to web ui	2022-12-17 19:25:43 -08:00
powderluv	ffef1681e3	Update stable_diffusion_amd.md	2022-12-17 03:40:08 -08:00
yzhang93	d935006a4a	Update Unet tuned model to v2 (#656 )	2022-12-16 22:10:15 -08:00
powderluv	660cb5946e	Update to 392 release	2022-12-16 16:00:49 -08:00
Anush Elangovan	72976a2ece	Import env vars first	2022-12-16 15:12:28 -08:00
Phaneesh Barwaria	831f206cd0	Revert "Add target triple selection for multiple cards" (#655 ) This reverts commit `acb905f0cc`.	2022-12-16 15:01:45 -08:00
Gaurav Shukla	72648aa9f2	Revert "[SD][WEB] Deduce vulkan-target-triple in the presence of multiple cards" This reverts commit `35e623deaf`.	2022-12-17 04:28:18 +05:30
Gaurav Shukla	35e623deaf	[SD][WEB] Deduce vulkan-target-triple in the presence of multiple cards 1. Get the correct vulkan-target-triple for a specified device in the presence of multiple cards. 2. Use tuned unet model for rdna3 cards. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-17 03:04:47 +05:30
Anush Elangovan	6263636738	Fix more lints	2022-12-16 13:26:15 -08:00
yzhang93	c73eed2e51	Add VAE winograd tuned model (#647 )	2022-12-16 13:01:45 -08:00
Anush Elangovan	30fdc99f37	Set to enable llpc Use an env var to enable llpc	2022-12-16 12:57:30 -08:00
PhaneeshB	acb905f0cc	Add target triple selection for multiple cards	2022-12-17 02:24:37 +05:30
Gaurav Shukla	bba06d0142	[SD][WEB] Avoid passing args to utils APIs Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-17 01:41:33 +05:30
Phaneesh Barwaria	73457336bc	add flag for toggling vulkan validation layers (#624 ) * add vulkan_validation_layers flag * categorize SD flags * stringify true and false for flag	2022-12-15 20:40:59 -06:00
Stanley Winata	24a16a4cfe	[Stable Diffusion] Disable binding fusion to work with moltenVK on mac. (#639 ) Co-authored-by: Stanley <stanley@MacStudio.lan>	2022-12-16 00:22:49 +07:00
yzhang93	6508e3fcc9	Update tuned model SD v2.1base (#634 )	2022-12-14 16:02:35 -05:00
Prashant Kumar	898bc9e009	Add the stable diffusion v2.1 version.	2022-12-14 20:19:41 +05:30
Gaurav Shukla	e67ea31ee2	[SHARK][SD] Add `--local_tank_cache` flag in the stable diffusion This flag can be used to set local shark_tank cache directory. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-14 20:00:25 +05:30
Gaurav Shukla	986c126a5c	[SHARK][SD] Add support for negative prompts Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-14 18:20:09 +05:30
Gaurav Shukla	d913453e57	[WEB] Update models to 8dec and also default values (#620 ) 1. Update the models to 8 dec. 2. precision is default to `fp16` in CLI. 3. version is default to `v2.1base` in CLI as well as web. 4. The default scheduler is set to `EulerDiscrete` now. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-13 13:08:33 -08:00
powderluv	08e373aef4	Update stable_diffusion_amd.md	2022-12-13 11:47:29 -08:00
Prashant Kumar	4cb50a3d06	Update the models to 8th Dec version.	2022-12-14 00:01:46 +05:30
Prashant Kumar	8ae76d18b5	Add euler scheduler. Also, make it default for sd2.1.	2022-12-13 00:03:45 +05:30
Prashant Kumar	e5be1790e5	Enable the v2.1 base version with `--version="v2.1base"`. (#611 )	2022-12-12 07:02:01 -08:00
powderluv	2a59c4f670	Update stable_diffusion_amd.md	2022-12-09 16:54:47 -08:00
Ean Garvey	0225292a44	Remove print statements from compile utils (#593 )	2022-12-08 13:40:47 -08:00
Quinn Dawkins	b3a42cd0b1	Don't do nchw-to-nhwc transpose for stable diffusion models (#592 )	2022-12-08 12:19:23 -05:00
Phaneesh Barwaria	1096936a15	Enable f32 path for SD (#567 )	2022-12-06 19:29:12 +05:30
powderluv	2b8d784660	update latest sd build	2022-12-05 22:16:13 -08:00
Daniel Garvey	8859853b41	Revert "Revert "find gsutil on linux (#557 )" (#560 )" (#561 ) This reverts commit `3c46021102`.	2022-12-05 20:27:43 -08:00
Daniel Garvey	3c46021102	Revert "find gsutil on linux (#557 )" (#560 ) This reverts commit `bba8646669`.	2022-12-05 21:53:47 -06:00
Daniel Garvey	bba8646669	find gsutil on linux (#557 ) * find gsutil on linux * cleaned up downloader and ditched gsutil Co-authored-by: dan <dan@nod-labs.com>	2022-12-05 19:03:48 -08:00
Quinn Dawkins	e19a97f316	Don't do a numpy copy on the results from compiled vm (#543 )	2022-12-05 14:21:47 -05:00
Harish Anand	482ffd6275	Move discord link from advanced instructions (#542 )	2022-12-04 06:15:34 -08:00
Quinn Dawkins	5117e50602	Revert "Enable the clip f16 model." until correctness is fixed	2022-12-04 19:17:34 +05:30
Quinn Dawkins	1870cb4557	Add a note to the Stable Diffusion README about clearing vulkan cache (#545 )	2022-12-03 15:12:45 -08:00
Prashant Kumar	42ad5b9c5c	Enable the clip f16 model. -- Enabled the clip f16 model. -- Updated the location of sdv2 model.	2022-12-03 18:50:40 +05:30
yzhang93	333975eb8f	Update Unet fp16 tuned model and Vae flag (#539 )	2022-12-02 23:21:18 -05:00
Gaurav Shukla	aa0195e4ef	[SHARK] Add vae encoder wrapper Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-12-03 08:42:25 +05:30
powderluv	e74046478b	Update stable_diffusion_amd.md	2022-12-02 13:57:03 -08:00

1 2 3 4

179 Commits