Gaurav Shukla
dfd6ba67b3
[SD] Update SD CLI to use model_db.json
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-22 02:13:04 +05:30
yzhang93
1595254eab
Modify model annotation tool to walk through ops by shape ( #692 )
2022-12-21 10:46:30 -08:00
PhaneeshB
6964c5eeba
encapsulate relevant methods in one method
20221221.402
2022-12-21 23:56:17 +05:30
PhaneeshB
2befe771b3
Add support for automatic target triple selection for SD
2022-12-21 22:38:06 +05:30
Prashant Kumar
b133a035a4
Add the download progress bar.
2022-12-21 15:47:33 +05:30
Gaurav Shukla
726c062327
[SD] Update spec files
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-21 14:16:04 +05:30
Gaurav Shukla
9083672de3
[SD][web] Tuned models only for stablediffusion/fp16 and rdna3 cards
...
Currently tuned models are only available for stablediffusion/fp16 and
rdna3 cards.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-21 14:15:39 +05:30
Quinn Dawkins
cdbaf880af
[SD] [web] Add model variants to web
2022-12-21 13:42:22 +05:30
Quinn Dawkins
9434981cdc
Add random seed generation for seed = -1 in cli ( #689 )
2022-12-20 17:15:22 -05:00
Phaneesh Barwaria
8b3706f557
Add Anything v3 and AnalogDiffusion variants of SD ( #685 )
...
* base support for anythingv3
* add analogdiffusiont
* Update readme
* keep max len 77 till support for 64 added for variants
* lint fix
2022-12-20 13:08:13 -08:00
Gaurav Shukla
0d5173833d
[SD] Add a json file for model names information. ( #687 )
...
This commit simplifies the code to identify the model name for a
particular set of flags. This is achieved by introducing a json file
that stores the model names information. The models are uploaded in
gcloud with these names.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-20 11:47:31 -08:00
powderluv
bf1178eb79
roll to build 400
2022-12-20 10:34:31 -08:00
yzhang93
abcd3fa94a
[SD] Set model max length 64 as default ( #681 )
20221220.400
2022-12-19 21:13:04 -08:00
Quinn Dawkins
62aa1614b6
[SD] Add --use_base_vae flag to do conversion to pixel space on cpu ( #682 )
2022-12-19 21:09:39 -08:00
Quinn Dawkins
7027356126
[SD] Fix warmup for max length 64 ( #680 )
2022-12-19 21:04:44 -05:00
yzhang93
5ebe13a13d
Add Unet len 64 tuned model ( #679 )
2022-12-19 16:24:08 -08:00
Gaurav Shukla
c3bed9a2b7
[SD][web] Add flag to disable the progress bar animation
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-20 02:50:04 +05:30
yzhang93
f865222882
Update VAE 19dec tuned model ( #676 )
2022-12-19 12:42:28 -08:00
powderluv
e2fe2e4095
Point to 398
2022-12-19 12:08:30 -08:00
powderluv
0532a95f08
Update stable_diffusion_amd.md
2022-12-19 12:04:42 -08:00
Quinn Dawkins
ff536f6015
[SD] Deduplicate initial noise generation ( #677 )
2022-12-19 14:38:41 -05:00
Gaurav Shukla
097d0f27bb
[SD][web] Add 64 max_length support in SD web
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-20 00:00:58 +05:30
Prashant Kumar
2257f87edf
Update opt_params.py
2022-12-19 23:43:30 +05:30
PhaneeshB
a17800da00
Add 64 len f16 untuned mlir
2022-12-19 22:53:17 +05:30
Prashant Kumar
059c1b3a19
Disable vae --use_tuned version.
20221219.398
2022-12-19 22:45:45 +05:30
Stanley Winata
9a36816d27
[SD][CLI] Add a warmup phase ( #670 )
2022-12-20 00:14:23 +07:00
Gaurav Shukla
7986b9b20b
[SD][WEB] Update VAE model and wrapper
...
This commit updates VAE model which significantly improves performance
by an order of ~300ms.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-19 22:32:05 +05:30
Gaurav Shukla
b2b3a0a62b
[SD] Move initial latent generation out of inference time
...
The initial random latent generation is not taken into account
for total SD inference time.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-19 22:32:05 +05:30
Prashant Kumar
3173b7d1d9
Update VAE model and wrapper.
2022-12-19 19:54:50 +05:30
Gaurav Shukla
9d716d70d6
[SD][web] Fix performance issues on shark scheduler
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
20221219.397
2022-12-19 17:44:37 +05:30
Stanley Winata
e1901a8608
[SD][CL] Disable print at every iteration. ( #664 )
...
Printing might incur extra time to runtime. Hence, we add a flag to hide it. To disable printing please set this flag `--hide_steps`.
Co-authored-by: Stanley <stanley@MacStudio.lan >
2022-12-19 15:39:57 +07:00
Quinn Dawkins
7d0cbd8d90
[SD][web] Set default tuned unet to v2 ( #663 )
20221219.396
2022-12-19 11:50:08 +07:00
Quinn Dawkins
59358361f9
[SD] Make clip batch 2 for positive and negative prompts ( #662 )
...
Combines the forward passes for each input prompt type into a single batched clip pass.
2022-12-18 23:46:21 -05:00
Quinn Dawkins
7fea2d3b68
[SD] update default large heap size for web as well ( #661 )
20221219.395
2022-12-18 21:50:26 -05:00
Quinn Dawkins
b6d3ff26bd
[SD] Change default VMA large heap block size ( #660 )
2022-12-18 21:41:46 -05:00
Stella Laurenzo
523e63f5c1
Fix NoneType exception if vulkan tuning flags not detected. ( #659 )
...
(This goes on to produce compilation errors, but one step at a time)
2022-12-18 16:40:56 -08:00
Stella Laurenzo
10630ab597
Add config stanza for NVIDIA RTX 2080. ( #658 )
...
Just happened to have this card on my Windows machine and verified that the SD demo works on it.
```
Average step time: 144.26142692565918ms/it
Clip Inference Avg time (ms) = (205.001 + 44.000) / 2 = 124.501
VAE Inference time (ms): 281.001
Total image generation time: 7.856997728347778sec
```
I'd love to add an API upstream to derive compiler tuning flags from a host device.
2022-12-18 16:40:47 -08:00
Quinn Dawkins
2bc6de650d
[SD] Add support for a compiled version of the discrete Euler scheduler ( #657 )
...
* Add Shark version of euler scheduler
* Add Shark version of euler scheduler to web ui
20221218.394
2022-12-17 19:25:43 -08:00
powderluv
ffef1681e3
Update stable_diffusion_amd.md
2022-12-17 03:40:08 -08:00
yzhang93
d935006a4a
Update Unet tuned model to v2 ( #656 )
2022-12-16 22:10:15 -08:00
powderluv
660cb5946e
Update to 392 release
20221217.393
2022-12-16 16:00:49 -08:00
Gaurav Shukla
10160a066a
[SD][WEB] Add vae tuned model in the SD web ( #653 )
...
1. Add tuned vae model in the SD web.
2. Use tuned models in case of rdna3 cards.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
20221216.392
2022-12-16 15:29:48 -08:00
Anush Elangovan
72976a2ece
Import env vars first
20221216.391
2022-12-16 15:12:28 -08:00
Phaneesh Barwaria
831f206cd0
Revert "Add target triple selection for multiple cards" ( #655 )
...
This reverts commit acb905f0cc .
2022-12-16 15:01:45 -08:00
Gaurav Shukla
72648aa9f2
Revert "[SD][WEB] Deduce vulkan-target-triple in the presence of multiple cards"
...
This reverts commit 35e623deaf .
2022-12-17 04:28:18 +05:30
Gaurav Shukla
35e623deaf
[SD][WEB] Deduce vulkan-target-triple in the presence of multiple cards
...
1. Get the correct vulkan-target-triple for a specified device in the
presence of multiple cards.
2. Use tuned unet model for rdna3 cards.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2022-12-17 03:04:47 +05:30
Anush Elangovan
6263636738
Fix more lints
2022-12-16 13:26:15 -08:00
Anush Elangovan
535d012ded
Fix lint
2022-12-16 13:24:51 -08:00
yzhang93
c73eed2e51
Add VAE winograd tuned model ( #647 )
20221216.390
2022-12-16 13:01:45 -08:00
Anush Elangovan
30fdc99f37
Set to enable llpc
...
Use an env var to enable llpc
2022-12-16 12:57:30 -08:00