InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-01-15 02:18:02 -05:00

Author	SHA1	Message	Date
Brandon Rising	125b459e56	chore: 4.2.9rc2 version bump v4.2.9rc2	2024-09-04 10:42:16 -04:00
Brandon Rising	33edee1ba6	Delete all flux bundle state dict keys when extracting the transformer state dict	2024-09-04 09:36:23 -04:00
Brandon Rising	d20335dabc	convert_bundle_to_flux_transformer_checkpoint now removes processed keys to decrease memory usage	2024-09-04 09:36:23 -04:00
Brandon Rising	d10d258213	Add a comment for why we're converting scale tensors in flux models to bfloat16	2024-09-04 09:36:23 -04:00
Brandon	d57ba1ed8b	Update invokeai/backend/model_manager/probe.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com>	2024-09-04 09:36:23 -04:00
Brandon Rising	2d0e34e57b	Support non-quantized bundles	2024-09-04 09:36:23 -04:00
Brandon Rising	a005d06255	feat: support checkpoint bundles containing more than just the transformer	2024-09-04 09:36:23 -04:00
Eugene Brodsky	a301ef5a5a	chore(ci): update github action version pins in container build workflow	2024-09-03 16:01:58 -04:00
Eugene Brodsky	9422df2737	feat(ci): enable a checkbox to push the container image when manually building via workflow dispatch	2024-09-03 16:01:58 -04:00
Lincoln Stein	6dabe4d3ca	assign T5 encoder to base type "Any"	2024-09-03 15:55:51 -04:00
Lincoln Stein	00e4652d30	add more reliable fallback method for determining BnbQuantizedLlmInt8b	2024-09-03 15:55:51 -04:00
Lincoln Stein	b6434c5318	correct modelformat probe for t5 encoders	2024-09-03 15:55:51 -04:00
Lincoln Stein	3f7f9f8d61	add probes for T5_encoder and ClipTextModel	2024-09-03 15:55:51 -04:00
Brandon Rising	f3bb592544	Update latents used for preview images in flux	2024-09-03 14:04:16 -04:00
Brandon Rising	69f080fb75	Move flux step callback code into the step_callback util scripts, use other services within the invocation context	2024-09-03 14:04:16 -04:00
Brandon Rising	04272a7cc8	Initial attempt at preview images	2024-09-03 14:04:16 -04:00
Lincoln Stein	8d35af946e	[MM] add API routes for getting & setting MM cache sizes (#6523 ) * [MM] add API routes for getting & setting MM cache sizes, and retrieving MM stats * Update invokeai/app/api/routers/model_manager.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> * code cleanup after @ryand review * Update invokeai/app/api/routers/model_manager.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> * fix merge conflicts; tested and working --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: Ryan Dick <ryanjdick3@gmail.com>	2024-09-02 12:18:21 -04:00
Ryan Dick	24065ec6b6	Add FLUX image-to-image and inpainting (#6798 ) ## Summary This PR adds support for Image-to-Image and inpainting workflows with the FLUX model. Full changelog: - Split out `FLUX VAE Encode` and `FLUX VAE Decode` nodes - Renamed `FLUX Text-to-Image` node to `FLUX Denoise` (since it now supports image-to-image too). This is a workflow-breaking change. - Added support for FLUX image-to-image via the `Latents` param on the FLUX denoising node. - Added support for FLUX masked inpainting via the `Denoise Mask` param on the FLUX denoising node. - Added "Denoise Start" and "Denoise End" params to the "FLUX Denoise" node. - Updated the "FLUX Text to Image" default workflow. - Added a "FLUX Image to Image" default workflow. ### Example FLUX inpainting workflow <img width="1282" alt="image" src="https://github.com/user-attachments/assets/86fc1170-e620-4412-8fd8-e119f875fc2e"> Input image ![image](https://github.com/user-attachments/assets/9c381b86-9f87-4257-bd2e-da22c56ca26c) Mask ![image](https://github.com/user-attachments/assets/8f774c5c-2a25-45fe-9d4b-b233e3d58d2c) Output image ![image](https://github.com/user-attachments/assets/8576a630-24ce-4a00-8052-e86bab59c855) ### Callouts for reviewers: - I renamed FLUXTextToImageInvocation -> FLUXDenoisingInvocation. This is, of course, a breaking change. It feels like the right move and now is the right time to do it. Any objection? - I added new `FLUX VAE Encode` and `FLUX VAE Decode` nodes. Alternatively, I could have tried to match these names to the corresponding SD nodes (e.g. `FLUX Image to Latents`, `FLUX Latents to Image`). Personally, I prefer the current names, but want to hear other opinions. ### Usage notes: - With the default dev timestep scheduler, the image structure is largely determined in the first ~3 steps. A consequence of this is that the denoise_start parameter provides limited 'granularity' of control. This will likely be improved in the future as we add more scheduler options. In the meantime, you will likely want to use small values for `denoise_start` (e.g. 0.03) to start denoising on step ~1-4 out of ~30. - Currently, there is no 'noise' parameter on the `FLUX Denoise` node, so the `denoise_end` parameter has limited utility. This will be added in the future. ## QA Instructions Test the following workflows: - [x] Vanilla FLUX text-to-image behaviour is unchanged - [x] Image-to-image with FLUX dev, no mask - [x] Image-to-image with FLUX dev, with mask - [x] Image-to-image with FLUX schnell, no mask (smoke test, not expected to work well) ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-09-02 09:50:31 -04:00
Ryan Dick	627b0bf644	Expose all FLUX model params in the default FLUX models.	2024-09-02 09:38:17 -04:00
Ryan Dick	b43da46b82	Rename 'FLUX VAE Encode'/'FLUX VAE Decode' to 'FLUX Image to Latents'/'FLUX Latents to Image'	2024-09-02 09:38:17 -04:00
Ryan Dick	4255a01c64	Restore line that was accidentally removed during development.	2024-09-02 09:38:17 -04:00
Ryan Dick	23adbd4002	Update schema.ts.	2024-09-02 09:38:17 -04:00
Ryan Dick	fb5a24fcc6	Update default workflows for FLUX.	2024-09-02 09:38:17 -04:00
Ryan Dick	cfdd5a1900	Rename flux_text_to_image.py -> flex_denoise.py	2024-09-02 09:38:17 -04:00
Ryan Dick	2313f326df	Add denoise_end param to FluxDenoiseInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	2e092a2313	Rename FluxTextToImageInvocation -> FluxDenoiseInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	763ef06c18	Use the existence of initial latents to decide whether we are doing image-to-image in the FLUX denoising node. Previously we were using the denoising_start value, but in some cases with an inpaintin mask you may want to run image-to-image from densoising_start=0.	2024-09-02 09:38:17 -04:00
Ryan Dick	8292f6cd42	Code cleanup and documentation around FLUX inpainting.	2024-09-02 09:38:17 -04:00
Ryan Dick	278bba499e	Split FLUX VAE decoding out into its own node from LatentsToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	dd99ed28e0	Split FLUX VAE encoding out into its own node from ImageToLatentsInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	9a8aca69bf	Get a rough version of FLUX inpainting working.	2024-09-02 09:38:17 -04:00
Ryan Dick	7ad62512eb	Update MaskTensorToImageInvocation to support input mask tensors with or without a channel dimension.	2024-09-02 09:38:17 -04:00
Ryan Dick	bd466661ec	Remove unused vae field from FLUXTextToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	7ebb509d05	Bump FLUX node versions after splitting out VAE encode/decode.	2024-09-02 09:38:17 -04:00
Ryan Dick	0aa13c046c	Split VAE decoding out from the FLUXTextToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	a7a33d73f5	Get FLUX non-masked image-to-image working - still rough.	2024-09-02 09:38:17 -04:00
Ryan Dick	ffa39857d3	Add FLUX VAE decoding support to LatentsToImageInvocation.	2024-09-02 09:38:17 -04:00
Ryan Dick	e85c3bc465	Add FLUX VAE support to ImageToLatentsInvocation.	2024-09-02 09:38:17 -04:00
psychedelicious	8185ba7054	scripts: add allocate_vram script Allocates the specified amount of VRAM, or allocates enough VRAM such that you have the specified amount of VRAM free. Useful to simulate an environment with a specific amount of VRAM.	2024-09-02 18:18:26 +10:00
Lincoln Stein	d501865bec	add a new FAQ for converting safetensors (#6736 ) Co-authored-by: Lincoln Stein <lstein@gmail.com>	2024-08-31 18:56:08 +00:00
Brandon Rising	d62310bb5f	Support HF repos with subfolders in source on windows OS	2024-08-30 19:31:42 -04:00
Brandon Rising	1835bff196	Fix source string in hugging face installs with subfolders	2024-08-30 19:31:42 -04:00
Ryan Dick	87261bdbc9	FLUX memory management improvements (#6791 ) ## Summary This PR contains several improvements to memory management for FLUX workflows. It is now possible to achieve better FLUX model caching performance, but this still requires users to manually configure their `ram`/`vram` settings. E.g. a `vram` setting of 16.0 should allow for all quantized FLUX models to be kept in memory on the GPU. Changes: - Check the size of a model on disk and free the requisite space in the model cache before loading it. (This behaviour existed previously, but was removed in https://github.com/invoke-ai/InvokeAI/pull/6072/files. The removal did not seem to be intentional). - Removed the hack to free 24GB of space in the cache before loading the FLUX model. - Split the T5 embedding and CLIP embedding steps into separate functions so that the two models don't both have to be held in RAM at the same time. - Fix a bug in `InvokeLinear8bitLt` that was causing some tensors to be left on the GPU when the model was offloaded to the CPU. (This class is getting very messy due to the non-standard state_dict handling in `bnb.nn.Linear8bitLt`. ) - Tidy up some dtype handling in FluxTextToImageInvocation to avoid situations where we hold references to two copies of the same tensor unnecessarily. - (minor) Misc cleanup of ModelCache: improve docs and remove unused vars. Future: We should revisit our default ram/vram configs. The current defaults are very conservative, and users could see major performance improvements from tuning these values. ## QA Instructions I tested the FLUX workflow with the following configurations and verified that the cache hit rates and memory usage matched the expected behaviour: - `ram = 16` and `vram = 16` - `ram = 16` and `vram = 1` - `ram = 1` and `vram = 1` Note that the changes in this PR are not isolated to FLUX. Since we now check the size of models on disk, we may see slight changes in model cache offload patterns for other models as well. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-08-29 15:17:45 -04:00
Ryan Dick	4e4b6c6dbc	Tidy variable management and dtype handling in FluxTextToImageInvocation.	2024-08-29 19:08:18 +00:00
Ryan Dick	5e8cf9fb6a	Remove hack to clear cache from the FluxTextToImageInvocation. We now clear the cache based on the on-disk model size.	2024-08-29 19:08:18 +00:00
Ryan Dick	c738fe051f	Split T5 encoding and CLIP encoding into separate functions to ensure that all model references are locally-scoped so that the two models don't have to be help in memory at the same time.	2024-08-29 19:08:18 +00:00
Ryan Dick	29fe1533f2	Fix bug in InvokeLinear8bitLt that was causing old state information to persist after loading from a state dict. This manifested as state tensors being left on the GPU even when a model had been offloaded to the CPU cache.	2024-08-29 19:08:18 +00:00
Ryan Dick	77090070bd	Check the size of a model on disk and make room for it in the cache before loading it.	2024-08-29 19:08:18 +00:00
Ryan Dick	6ba9b1b6b0	Tidy up GIG -> GB and remove unused GIG constant.	2024-08-29 19:08:18 +00:00
Ryan Dick	c578b8df1e	Improve ModelCache docs.	2024-08-29 19:08:18 +00:00

1 2 3 4 5 ...

12702 Commits