Merge branch 'development' into development

2026-04-23 03:00:31 -04:00 · 2022-09-21 03:10:49 -04:00
parent 408e3774e0 6987c77e2a
commit 60f0090786
29 changed files with 1038 additions and 484 deletions
--- a/docs/assets/outpainting/elven_princess.outpainted.png
+++ b/docs/assets/outpainting/elven_princess.outpainted.png
--- a/docs/assets/outpainting/elven_princess.png
+++ b/docs/assets/outpainting/elven_princess.png
--- a/docs/features/EMBIGGEN.md
+++ b/docs/features/EMBIGGEN.md
@@ -121,19 +121,34 @@ tiles:
 dream> a photo of puffy clouds over a forest at sunset -s 100 -W 512 -H 512 -I outputs/000002.seed.png -f 0.5 -embiggen_tiles 1 2 3
 ```

-!!! note
+## Fixing Previously-Generated Images

-    Because the same prompt is used on all the tiled images, and the model
-    doesn't have the context of anything outside the tile being run - it
-    can end up creating repeated pattern (also called 'motifs') across all
-    the tiles based on that prompt. The best way to combat this is
-    lowering the `--strength` (`-f`) to stay more true to the init image,
-    and increasing the number of steps so there is more compute-time to
-    create the detail.  Anecdotally `--strength` 0.35-0.45 works pretty
-    well on most things. It may also work great in some examples even with
-    the `--strength` set high for patterns, landscapes, or subjects that
-    are more abstract. Because this is (relatively) fast, you can also
-    always create a few Embiggen'ed images and manually composite them to
-    preserve the best parts from each.
+It is easy to apply embiggen to any previously-generated file without having to
+look up the original prompt and provide an initial image. Just use the
+syntax `!fix path/to/file.png <embiggen>`. For example, you can rewrite the
+previous command to look like this:
+
+~~~~
+dream> !fix ./outputs/000002.seed.png -embiggen_tiles 1 2 3
+~~~~
+
+A new file named `000002.seed.fixed.png` will be created in the output directory. Note that
+the `!fix` command does not replace the original file, unlike the behavior at generate time.
+You do not need to provide the prompt, and `!fix` automatically selects a good strength for
+embiggen-ing.
+
+
+**Note**
+Because the same prompt is used on all the tiled images, and the model
+doesn't have the context of anything outside the tile being run - it
+can end up creating repeated pattern (also called 'motifs') across all
+the tiles based on that prompt. The best way to combat this is
+lowering the `--strength` (`-f`) to stay more true to the init image,
+and increasing the number of steps so there is more compute-time to
+create the detail.  Anecdotally `--strength` 0.35-0.45 works pretty
+well on most things. It may also work great in some examples even with
+the `--strength` set high for patterns, landscapes, or subjects that
+are more abstract. Because this is (relatively) fast, you can also
+preserve the best parts from each.

 Author: [Travco](https://github.com/travco)
--- a/docs/features/INPAINTING.md
+++ b/docs/features/INPAINTING.md
@@ -49,40 +49,28 @@ We are hoping to get rid of the need for this workaround in an upcoming release.

 1. Open image in Photoshop

-    <figure markdown>
-    ![step1](../assets/step1.png)
-    </figure>
+![step1](../assets/step1.png)

 2. Use any of the selection tools (Marquee, Lasso, or Wand) to select the area you desire to inpaint.

-    <figure markdown>
-    ![step2](../assets/step2.png)
-    </figure>
+![step2](../assets/step2.png)

 3. Because we'll be applying a mask over the area we want to preserve, you should now select the inverse by using the ++shift+ctrl+i++ shortcut, or right clicking and using the "Select Inverse" option.

 4. You'll now create a mask by selecting the image layer, and Masking the selection. Make sure that you don't delete any of the undrlying image, or your inpainting results will be dramatically impacted.

-    <figure markdown>
-    ![step4](../assets/step4.png)
-    </figure>
+![step4](../assets/step4.png)

 5. Make sure to hide any background layers that are present. You should see the mask applied to your image layer, and the image on your canvas should display the checkered background.

-    <figure markdown>
-    ![step5](../assets/step5.png)
-    </figure>
+![step5](../assets/step5.png)

 6. Save the image as a transparent PNG by using the "Save a Copy" option in the File menu, or using the Alt + Ctrl + S keyboard shortcut

-    <figure markdown>
-    ![step6](../assets/step6.png)
-    </figure>
+![step6](../assets/step6.png)

 7. After following the inpainting instructions above (either through the CLI or the Web UI), marvel at your newfound ability to selectively dream. Lookin' good!

-    <figure markdown>
-    ![step7](../assets/step7.png)
-    </figure>
+![step7](../assets/step7.png)

 8. In the export dialogue, Make sure the "Save colour values from transparent pixels" checkbox is selected.  
--- a/docs/features/OUTPAINTING.md
+++ b/docs/features/OUTPAINTING.md
@@ -0,0 +1,78 @@
+---
+title: Outpainting
+---
+
+# :octicons-paintbrush-16: Outpainting
+
+## Continous outpainting
+
+This extension uses the inpainting code to extend an existing image to
+any direction of "top", "right", "bottom" or "left". To use it you
+need to provide an initial image with -I and an extension direction
+with -D (direction). When extending using outpainting a higher img2img
+strength value of 0.83 is the default.
+
+The code is not foolproof. Sometimes it will do a good job extending
+the image, and other times it will generate ghost images and other
+artifacts. In addition, the code works best on images that were
+generated by dream.py, because it will be able to recover the original
+prompt that generated the file and "understand" what you are trying to
+achieve.
+
+### Basic Usage
+
+To illustrate, consider this image generated with the prompt "fantasy
+portrait of eleven princess." It's nice, but rather annoying that the
+top of the head has been cropped off.
+
+![elven_princess](../assets/outpainting/elven_princess.png)
+
+We can fix that using the `!fix` command!
+
+~~~~
+dream> !fix my_images/elven_princess.png -D top 50
+~~~~
+
+This is telling dream.py to open up a rectangle 50 pixels high at the
+top of the image and outpaint into it. The result is:
+
+![elven_princess.fixed](../assets/outpainting/elven_princess.outpainted.png)
+
+Viola! You can similarly specify `bottom`, `left` or `right` to
+outpaint into these margins.
+
+There are some limitations to be aware of:
+
+1. You cannot change the size of the image rectangle. In the example,
+   notice that the whole image is shifted downwards by 50 pixels, rather
+   than the top being extended upwards.
+
+2. Attempting to outpaint larger areas will frequently give rise to ugly
+   ghosting effects.
+
+3. For best results, try increasing the step number.
+
+4. If you don't specify a pixel value in -D, it will default to half
+   of the whole image, which is likely not what you want.
+
+You can do more with `!fix` including upscaling and facial
+reconstruction of previously-generated images. See
+[./UPSCALE.md#fixing-previously-generated-images] for the details.
+
+### Advanced Usage
+
+For more control over the outpaintihg process, you can provide the
+`-D` option at image generation time. This allows you to apply all the
+controls, including the ability to resize the image and apply face-fixing
+and upscaling. For example:
+
+~~~~
+dream> man with cat on shoulder -I./images/man.png -D bottom 100 -W960 -H960 -fit
+~~~~
+
+Or even shorter, since the prompt is read from the metadata of the old image:
+
+~~~~
+dream> -I./images/man.png -D bottom 100 -W960 -H960 -fit -U2 -G1
+~~~~
+
--- a/docs/features/PROMPTS.md
+++ b/docs/features/PROMPTS.md
@@ -62,33 +62,25 @@ original prompt:

 `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180`

-<figure markdown>
 ![step1](../assets/negative_prompt_walkthru/step1.png)
-</figure>

 That image has a woman, so if we want the horse without a rider, we can influence the image not to have a woman by putting [woman] in the prompt, like this:

 `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve [woman]" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180`

-<figure markdown>
 ![step2](../assets/negative_prompt_walkthru/step2.png)
-</figure>

 That's nice - but say we also don't want the image to be quite so blue. We can add "blue" to the list of negative prompts, so it's now [woman blue]:

 `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve [woman blue]" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180`

-<figure markdown>
 ![step3](../assets/negative_prompt_walkthru/step3.png)
-</figure>

 Getting close - but there's no sense in having a saddle when our horse doesn't have a rider, so we'll add one more negative prompt: [woman blue saddle].

 `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve [woman blue saddle]" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180`

-<figure markdown>
 ![step4](../assets/negative_prompt_walkthru/step4.png)
-</figure>

 !!! notes "Notes about this feature:"

--- a/docs/features/UPSCALE.md
+++ b/docs/features/UPSCALE.md
@@ -2,12 +2,16 @@
 title: Upscale
 ---

-# :material-image-size-select-large: Upscale 
+## Intro

-## **GFPGAN and Real-ESRGAN Support**
+The script provides the ability to restore faces and upscale. You can apply these operations
+at the time you generate the images, or at any time to a previously-generated PNG file, using 
+the [!fix](#fixing-previously-generated-images) command.

-The script also provides the ability to do face restoration and upscaling with the help of GFPGAN
-and Real-ESRGAN respectively.
+## Face Fixing
+
+The default face restoration module is GFPGAN. The default upscale is Real-ESRGAN. For an alternative
+face restoration module, see [CodeFormer Support] below.

 As of version 1.14, environment.yaml will install the Real-ESRGAN package into the standard install
 location for python packages, and will put GFPGAN into a subdirectory of "src" in the
@@ -40,11 +44,11 @@ other GFPGAN related boot arguments if you wish to customize further._
    may run `python3 scripts/preload_models.py` after you have installed GFPGAN and all its
    dependencies.

-## **Usage**
+## Usage

 You will now have access to two new prompt arguments.

-### **Upscaling**
+### Upscaling

 `-U : <upscaling_factor> <upscaling_strength>`

@@ -58,7 +62,7 @@ retain some of those for natural looking results, we recommend using values betw

 If you do not explicitly specify an upscaling_strength, it will default to 0.75.

-### **Face Restoration**
+### Face Restoration

 `-G : <gfpgan_strength>`

@@ -75,7 +79,7 @@ When you use either `-U` or `-G`, the final result you get is upscaled or face m
 to save the original Stable Diffusion generation, you can use the `-save_orig` prompt argument to
 save the original unaffected version too.

-### **Example Usage**
+### Example Usage

 ```bash
 dream> superman dancing with a panda bear -U 2 0.6 -G 0.4
@@ -115,13 +119,13 @@ saving it to `ldm/restoration/codeformer/weights` folder.
 You can use `-ft` prompt argument to swap between CodeFormer and the default GFPGAN. The above
 mentioned `-G` prompt argument will allow you to control the strength of the restoration effect.

-### **Usage:**
+### Usage:

 The following command will perform face restoration with CodeFormer instead of the default gfpgan.

 `<prompt> -G 0.8 -ft codeformer`

-**Other Options:**
+### Other Options:

 - `-cf` - cf or CodeFormer Fidelity takes values between `0` and `1`. 0 produces high quality
  results but low accuracy and 1 produces lower quality results but higher accuacy to your original
@@ -137,3 +141,21 @@ that is the best restoration possible. This may deviate slightly from the origin
 excellent option to use in situations when there is very little facial data to work with.

 `<prompt> -G 1.0 -ft codeformer -cf 0.1`
+
+## Fixing Previously-Generated Images
+
+It is easy to apply face restoration and/or upscaling to any previously-generated file. Just use the
+syntax `!fix path/to/file.png <options>`. For example, to apply GFPGAN at strength 0.8 and upscale 2X
+for a file named `./outputs/img-samples/000044.2945021133.png`, just run:
+
+~~~~
+dream> !fix ./outputs/img-samples/000044.2945021133.png -G 0.8 -U 2
+~~~~
+
+A new file named `000044.2945021133.fixed.png` will be created in the output directory. Note that
+the `!fix` command does not replace the original file, unlike the behavior at generate time.
+
+### Disabling:
+
+If, for some reason, you do not wish to load the GFPGAN and/or ESRGAN libraries, you can disable them 
+on the dream.py command line with the `--no_restore` and `--no_upscale` options, respectively.
--- a/docs/features/VARIATIONS.md
+++ b/docs/features/VARIATIONS.md
@@ -45,10 +45,7 @@ Outputs:
 ./outputs/Xena/000001.3357757885.png: "prompt" -s50 -W512 -H512 -C7.5 -Ak_lms -S3357757885
 ```

-<figure markdown>
-  ![var1](../assets/variation_walkthru/000001.3357757885.png)
-  <figcaption>Seed 3357757885 looks nice</figcaption>
-</figure>
+![var1](../assets/variation_walkthru/000001.3357757885.png)

 ---

@@ -80,15 +77,9 @@ used to generate it.
 This gives us a series of closely-related variations, including the two shown
 here.

-<figure markdown>
-  ![var2](../assets/variation_walkthru/000002.3647897225.png)
-  <figcaption>subseed 3647897225</figcaption>
-</figure>
+![var2](../assets/variation_walkthru/000002.3647897225.png)

-<figure markdown>
-  ![var3](../assets/variation_walkthru/000002.1614299449.png)
-  <figcaption>subseed 1614299449</figcaption>
-</figure>
+![var3](../assets/variation_walkthru/000002.1614299449.png)

 I like the expression on Xena's face in the first one (subseed 3647897225), and
 the armor on her shoulder in the second one (subseed 1614299449). Can we combine
@@ -106,9 +97,7 @@ Outputs:
 Here we are providing equal weights (0.1 and 0.1) for both the subseeds. The
 resulting image is close, but not exactly what I wanted:

-<figure markdown>
-  ![var4](../assets/variation_walkthru/000003.1614299449.png)
-</figure>
+![var4](../assets/variation_walkthru/000003.1614299449.png)

 We could either try combining the images with different weights, or we can
 generate more variations around the almost-but-not-quite image. We do the
@@ -129,10 +118,7 @@ Outputs:
 This produces six images, all slight variations on the combination of the chosen
 two images. Here's the one I like best:

-<figure markdown>
-  ![var5](../assets/variation_walkthru/000004.3747154981.png)
-  <figcaption>000004.3747154981.png</figcaption>
-</figure>
+![var5](../assets/variation_walkthru/000004.3747154981.png)

 As you can see, this is a very powerful tool, which when combined with subprompt
 weighting, gives you great control over the content and quality of your
--- a/docs/other/CONTRIBUTORS.md
+++ b/docs/other/CONTRIBUTORS.md
@@ -56,6 +56,7 @@ We thank them for all of their time and hard work.
 - [Matthias Wild](https://github.com/mauwii)
 - [Kyle Schouviller](https://github.com/kyle0654)
 - [rabidcopy](https://github.com/rabidcopy)
+- [Dominic Letz](https://github.com/dominicletz)

 ## **Original CompVis Authors:**