InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-02-02 17:05:18 -05:00

Author	SHA1	Message	Date
Brandon Rising	69f080fb75	Move flux step callback code into the step_callback util scripts, use other services within the invocation context	2024-09-03 14:04:16 -04:00
Brandon Rising	04272a7cc8	Initial attempt at preview images	2024-09-03 14:04:16 -04:00
Lincoln Stein	8d35af946e	[MM] add API routes for getting & setting MM cache sizes (#6523 ) * [MM] add API routes for getting & setting MM cache sizes, and retrieving MM stats * Update invokeai/app/api/routers/model_manager.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> * code cleanup after @ryand review * Update invokeai/app/api/routers/model_manager.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> * fix merge conflicts; tested and working --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: Ryan Dick <ryanjdick3@gmail.com>	2024-09-02 12:18:21 -04:00
Ryan Dick	24065ec6b6	Add FLUX image-to-image and inpainting (#6798 ) ## Summary This PR adds support for Image-to-Image and inpainting workflows with the FLUX model. Full changelog: - Split out `FLUX VAE Encode` and `FLUX VAE Decode` nodes - Renamed `FLUX Text-to-Image` node to `FLUX Denoise` (since it now supports image-to-image too). This is a workflow-breaking change. - Added support for FLUX image-to-image via the `Latents` param on the FLUX denoising node. - Added support for FLUX masked inpainting via the `Denoise Mask` param on the FLUX denoising node. - Added "Denoise Start" and "Denoise End" params to the "FLUX Denoise" node. - Updated the "FLUX Text to Image" default workflow. - Added a "FLUX Image to Image" default workflow. ### Example FLUX inpainting workflow <img width="1282" alt="image" src="https://github.com/user-attachments/assets/86fc1170-e620-4412-8fd8-e119f875fc2e"> Input image ![image](https://github.com/user-attachments/assets/9c381b86-9f87-4257-bd2e-da22c56ca26c) Mask ![image](https://github.com/user-attachments/assets/8f774c5c-2a25-45fe-9d4b-b233e3d58d2c) Output image ![image](https://github.com/user-attachments/assets/8576a630-24ce-4a00-8052-e86bab59c855) ### Callouts for reviewers: - I renamed FLUXTextToImageInvocation -> FLUXDenoisingInvocation. This is, of course, a breaking change. It feels like the right move and now is the right time to do it. Any objection? - I added new `FLUX VAE Encode` and `FLUX VAE Decode` nodes. Alternatively, I could have tried to match these names to the corresponding SD nodes (e.g. `FLUX Image to Latents`, `FLUX Latents to Image`). Personally, I prefer the current names, but want to hear other opinions. ### Usage notes: - With the default dev timestep scheduler, the image structure is largely determined in the first ~3 steps. A consequence of this is that the denoise_start parameter provides limited 'granularity' of control. This will likely be improved in the future as we add more scheduler options. In the meantime, you will likely want to use small values for `denoise_start` (e.g. 0.03) to start denoising on step ~1-4 out of ~30. - Currently, there is no 'noise' parameter on the `FLUX Denoise` node, so the `denoise_end` parameter has limited utility. This will be added in the future. ## QA Instructions Test the following workflows: - [x] Vanilla FLUX text-to-image behaviour is unchanged - [x] Image-to-image with FLUX dev, no mask - [x] Image-to-image with FLUX dev, with mask - [x] Image-to-image with FLUX schnell, no mask (smoke test, not expected to work well) ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-09-02 09:50:31 -04:00
Ryan Dick	627b0bf644	Expose all FLUX model params in the default FLUX models.	2024-09-02 09:38:17 -04:00
Ryan Dick	b43da46b82	Rename 'FLUX VAE Encode'/'FLUX VAE Decode' to 'FLUX Image to Latents'/'FLUX Latents to Image'	2024-09-02 09:38:17 -04:00
Ryan Dick	4255a01c64	Restore line that was accidentally removed during development.	2024-09-02 09:38:17 -04:00
Ryan Dick	23adbd4002	Update schema.ts.	2024-09-02 09:38:17 -04:00
Ryan Dick	fb5a24fcc6	Update default workflows for FLUX.	2024-09-02 09:38:17 -04:00
Ryan Dick	cfdd5a1900	Rename flux_text_to_image.py -> flex_denoise.py	2024-09-02 09:38:17 -04:00
Ryan Dick	2313f326df	Add denoise_end param to FluxDenoiseInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	2e092a2313	Rename FluxTextToImageInvocation -> FluxDenoiseInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	763ef06c18	Use the existence of initial latents to decide whether we are doing image-to-image in the FLUX denoising node. Previously we were using the denoising_start value, but in some cases with an inpaintin mask you may want to run image-to-image from densoising_start=0.	2024-09-02 09:38:17 -04:00
Ryan Dick	8292f6cd42	Code cleanup and documentation around FLUX inpainting.	2024-09-02 09:38:17 -04:00
Ryan Dick	278bba499e	Split FLUX VAE decoding out into its own node from LatentsToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	dd99ed28e0	Split FLUX VAE encoding out into its own node from ImageToLatentsInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	9a8aca69bf	Get a rough version of FLUX inpainting working.	2024-09-02 09:38:17 -04:00
Ryan Dick	7ad62512eb	Update MaskTensorToImageInvocation to support input mask tensors with or without a channel dimension.	2024-09-02 09:38:17 -04:00
Ryan Dick	bd466661ec	Remove unused vae field from FLUXTextToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	7ebb509d05	Bump FLUX node versions after splitting out VAE encode/decode.	2024-09-02 09:38:17 -04:00
Ryan Dick	0aa13c046c	Split VAE decoding out from the FLUXTextToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	a7a33d73f5	Get FLUX non-masked image-to-image working - still rough.	2024-09-02 09:38:17 -04:00
Ryan Dick	ffa39857d3	Add FLUX VAE decoding support to LatentsToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	e85c3bc465	Add FLUX VAE support to ImageToLatentsInvocation.	2024-09-02 09:38:17 -04:00
psychedelicious	8185ba7054	scripts: add allocate_vram script Allocates the specified amount of VRAM, or allocates enough VRAM such that you have the specified amount of VRAM free. Useful to simulate an environment with a specific amount of VRAM.	2024-09-02 18:18:26 +10:00
Lincoln Stein	d501865bec	add a new FAQ for converting safetensors (#6736 ) Co-authored-by: Lincoln Stein <lstein@gmail.com>	2024-08-31 18:56:08 +00:00
Brandon Rising	d62310bb5f	Support HF repos with subfolders in source on windows OS	2024-08-30 19:31:42 -04:00
Brandon Rising	1835bff196	Fix source string in hugging face installs with subfolders	2024-08-30 19:31:42 -04:00
Ryan Dick	87261bdbc9	FLUX memory management improvements (#6791 ) ## Summary This PR contains several improvements to memory management for FLUX workflows. It is now possible to achieve better FLUX model caching performance, but this still requires users to manually configure their `ram`/`vram` settings. E.g. a `vram` setting of 16.0 should allow for all quantized FLUX models to be kept in memory on the GPU. Changes: - Check the size of a model on disk and free the requisite space in the model cache before loading it. (This behaviour existed previously, but was removed in https://github.com/invoke-ai/InvokeAI/pull/6072/files. The removal did not seem to be intentional). - Removed the hack to free 24GB of space in the cache before loading the FLUX model. - Split the T5 embedding and CLIP embedding steps into separate functions so that the two models don't both have to be held in RAM at the same time. - Fix a bug in `InvokeLinear8bitLt` that was causing some tensors to be left on the GPU when the model was offloaded to the CPU. (This class is getting very messy due to the non-standard state_dict handling in `bnb.nn.Linear8bitLt`. ) - Tidy up some dtype handling in FluxTextToImageInvocation to avoid situations where we hold references to two copies of the same tensor unnecessarily. - (minor) Misc cleanup of ModelCache: improve docs and remove unused vars. Future: We should revisit our default ram/vram configs. The current defaults are very conservative, and users could see major performance improvements from tuning these values. ## QA Instructions I tested the FLUX workflow with the following configurations and verified that the cache hit rates and memory usage matched the expected behaviour: - `ram = 16` and `vram = 16` - `ram = 16` and `vram = 1` - `ram = 1` and `vram = 1` Note that the changes in this PR are not isolated to FLUX. Since we now check the size of models on disk, we may see slight changes in model cache offload patterns for other models as well. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-08-29 15:17:45 -04:00
Ryan Dick	4e4b6c6dbc	Tidy variable management and dtype handling in FluxTextToImageInvocation.	2024-08-29 19:08:18 +00:00
Ryan Dick	5e8cf9fb6a	Remove hack to clear cache from the FluxTextToImageInvocation. We now clear the cache based on the on-disk model size.	2024-08-29 19:08:18 +00:00
Ryan Dick	c738fe051f	Split T5 encoding and CLIP encoding into separate functions to ensure that all model references are locally-scoped so that the two models don't have to be help in memory at the same time.	2024-08-29 19:08:18 +00:00
Ryan Dick	29fe1533f2	Fix bug in InvokeLinear8bitLt that was causing old state information to persist after loading from a state dict. This manifested as state tensors being left on the GPU even when a model had been offloaded to the CPU cache.	2024-08-29 19:08:18 +00:00
Ryan Dick	77090070bd	Check the size of a model on disk and make room for it in the cache before loading it.	2024-08-29 19:08:18 +00:00
Ryan Dick	6ba9b1b6b0	Tidy up GIG -> GB and remove unused GIG constant.	2024-08-29 19:08:18 +00:00
Ryan Dick	c578b8df1e	Improve ModelCache docs.	2024-08-29 19:08:18 +00:00
Ryan Dick	cad9a41433	Remove unused MOdelCache.exists(...) function.	2024-08-29 19:08:18 +00:00
Ryan Dick	5fefb3b0f4	Remove unused param from ModelCache.	2024-08-29 19:08:18 +00:00
Ryan Dick	5284a870b0	Remove unused constructor params from ModelCache.	2024-08-29 19:08:18 +00:00
Ryan Dick	e064377c05	Remove default model cache sizes from model_cache_default.py. These defaults were misleading, because the config defaults take precedence over them.	2024-08-29 19:08:18 +00:00
Mary Hipp	3e569c8312	feat(ui): add fields for CLIP embed models and Flux VAE models in workflows	2024-08-29 11:52:51 -04:00
maryhipp	16825ee6e9	feat(nodes): bump version of flux model node, update default workflow	2024-08-29 11:52:51 -04:00
Mary Hipp	3f5340fa53	feat(nodes): add submodels as inputs to FLUX main model node instead of hardcoded names	2024-08-29 11:52:51 -04:00
chainchompa	f2a1a39b33	Add selectedStylePreset to app parameters (#6787 ) ## Summary - Add selectedStylePreset to app parameters <!--A description of the changes in this PR. Include the kind of change (fix, feature, docs, etc), the "why" and the "how". Screenshots or videos are useful for frontend changes.--> ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [ ] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-08-28 10:53:07 -04:00
chainchompa	326de55d3e	remove api changes and only preselect style preset	2024-08-28 09:53:29 -04:00
chainchompa	b2df909570	added selectedStylePreset to preload presets when app loads	2024-08-28 09:50:44 -04:00
chainchompa	026ac36b06	Revert "added selectedStylePreset to preload presets when app loads" This reverts commit `e97fd85904`.	2024-08-28 09:44:08 -04:00
chainchompa	92125e5fd2	bug fixes	2024-08-27 16:13:38 -04:00
chainchompa	c0c139da88	formatting ruff	2024-08-27 15:46:51 -04:00
chainchompa	404ad6a7fd	cleanup	2024-08-27 15:42:42 -04:00

... 8 9 10 11 12 ...

13138 Commits