InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-02-15 14:45:06 -05:00

Author	SHA1	Message	Date
Ryan Dick	debcbd6e2c	Support FLUX OneTrainer LoRA formats (incl. DoRA) (#7590 ) ## Summary This PR adds support for the FLUX LoRA model format produced by OneTrainer. Specifically, this PR adds: - Support for DoRA patches - Support for patch models that modify the FLUX T5 encoder - Probing / loading support for OneTrainer models ## Known limitations - DoRA patches cannot currently be applied to base weights that are quantized with `bitsandbytes`. The DoRA algorithm requires accessing the original model weight in order to compute the patch diff, and the bitsandbytes quantization layers make this difficult. DoRA patches can be applied to non-quantized and GGUF-quantized layers without issue. - This PR results in a slight speed regression for a very particular inference combination: quantized base model + LoRA with diffusers keys (i.e. uses the `MergedLayerPatch`). Now that more LoRA formats are using the `MergedLayerPatch`, it was becoming too much work to maintain this optimization. Regression from ~1.7 it/s to ~1.4 it/s. ## Future Notes - We may want to consider dropping support for bitsandbytes quantization. It is very difficult to maintain compatibility for across features like partial-loading and LoRA patching. - At a future time, we should refactor the LoRA parsing logic to be more generalized rather than handling each format independently. - There are some redundant device casts and dequantizations in `autocast_linear_forward_sidecar_patches(...)` (and its sub-calls). Optimizing this is left for future work. ## Related Issues / Discussions - This PR should address a handful of the LoRAs reported in https://github.com/invoke-ai/InvokeAI/issues/7131 (specifically, most of the `envy*` LoRAs). - This PR should address the example in https://github.com/invoke-ai/InvokeAI/issues/6912 (though the intended effect of that LoRA is not totally clear, so its hard to verify with full confidence). ## QA Instructions OneTrainer test models: - https://civitai.com/models/844821/envy-flux-dark-watercolor-01?modelVersionId=945159 (DoRA, transformer only) - https://civitai.com/models/836757/envy-flux-digital-brush-01?modelVersionId=936167 (hada, transformer only) - ball_flux from https://github.com/invoke-ai/InvokeAI/issues/6912 (DoRA, transformer/clip/t5) The following tests were repeated with each of the OneTrainer test models: - [x] Test with non-quantized base model - [x] Test with GGUF-quantized base model - [x] Test with BnB-quantized base model - [x] Test with non-quantized base model that is partially-loaded onto the GPU Other regression test: - [x] Test some SD1 LoRAs - [x] Test some SDXL LoRAs - [x] Test a variety of existing FLUX LoRA formats - [x] Test a FLUX Control LoRA on all base model quantization formats. ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-01-28 12:50:52 -05:00
Ryan Dick	229834a5e8	Performance optimizations for LoRAs applied on top of GGML-quantized tensors.	2025-01-28 14:51:35 +00:00
Ryan Dick	6c919e1bca	Handle DoRA layer device casting when model is partially-loaded.	2025-01-28 14:51:35 +00:00
Ryan Dick	5357d6e08e	Rename ConcatenatedLoRALayer to MergedLayerPatch. And other minor cleanup.	2025-01-28 14:51:35 +00:00
Ryan Dick	7fef569e38	Update frontend graph building logic to support FLUX LoRAs that modify the T5 encoder weights.	2025-01-28 14:51:35 +00:00
Ryan Dick	e7fb435cc5	Update DoRALayer with a custom get_parameters() override that 1) applies alpha scaling to delta_v, and 2) warns if the base model is incompatible.	2025-01-28 14:51:35 +00:00
Ryan Dick	5d472ac1b8	Move quantized weight handling for patch layers up from ConcatenatedLoRALayer to CustomModuleMixin.	2025-01-28 14:51:35 +00:00
Ryan Dick	28514ba59a	Update ConcatenatedLoRALayer to work with all sub-layer types.	2025-01-28 14:51:35 +00:00
Ryan Dick	5ea7953537	Update GGMLTensor with ops necessary to work with ConcatenatedLoRALayer.	2025-01-28 14:51:35 +00:00
Ryan Dick	0db6639b4b	Add FLUX OneTrainer model probing.	2025-01-28 14:51:35 +00:00
Ryan Dick	b8eed2bdcb	Relax lora_layers_from_flux_diffusers_grouped_state_dict(...) so that it can work with more LoRA variants (e.g. hada)	2025-01-28 14:51:35 +00:00
Ryan Dick	1054283f5c	Fix bug in FLUX T5 Koyha-style LoRA key parsing.	2025-01-28 14:51:35 +00:00
Ryan Dick	f4a0b78a8d	Update FLUX invocations to support LoRAs that modify the T5 text encoder.	2025-01-28 14:51:35 +00:00
Ryan Dick	409b69ee5d	Fix typo in DoRALayer.	2025-01-28 14:51:35 +00:00
Ryan Dick	206f261e45	Add utils for loading FLUX OneTrainer DoRA models.	2025-01-28 14:51:35 +00:00
Ryan Dick	7eee4da896	Further updates to lora_model_from_flux_diffusers_state_dict() so that it can be re-used for OneTrainer LoRAs.	2025-01-28 14:51:35 +00:00
Ryan Dick	908976ac08	Add support for LyCoris-style LoRA keys in lora_model_from_flux_diffusers_state_dict(). Previously, it only supported PEFT-style LoRA keys.	2025-01-28 14:51:35 +00:00
Ryan Dick	dfa253e75b	Add utils for working with Kohya LoRA keys.	2025-01-28 14:51:35 +00:00
Ryan Dick	4f369e3dfb	First draft of DoRALayer. Not tested yet.	2025-01-28 14:51:35 +00:00
Ryan Dick	faa4fa02c0	Expand unit tests to test for confusion between FLUX LoRA formats.	2025-01-28 14:51:35 +00:00
Ryan Dick	5bd6428fdd	Add is_state_dict_likely_in_flux_onetrainer_format() util function.	2025-01-28 14:51:35 +00:00
Ryan Dick	8b4f411f7b	Add a test state dict for the OneTrainer DoRA format.	2025-01-28 14:51:35 +00:00
Ryan Dick	9d2f8b4ac8	Improve MaskOutput dimension consistency (#7591 ) ## Summary This PR fixes an issue with mask dimension consistency. Prior to this change, the following workflow would fail with `tuple out of range` error: <img width="1072" alt="image" src="https://github.com/user-attachments/assets/d0a9e658-1d64-4db4-adee-973bbdaca745" /> ### Before this PR Dimension compatibility for invocations that take a mask input: - `ApplyMaskTensorToImageInvocation`: 2 or 3 - `MaskTensorToImageInvocation`: 2 or 3 - `InvertTensorMaskInvocation`: 3 Mask dimension for invocations that produce a MaskOutput: - `RectangleMaskInvocation`: 3 - `AlphaMaskToTensorInvocation`: 3 - `InvertTensorMaskInvocation`: 3 - `ImageMaskToTensorInvocation`: 3 - `SegmentAnythingInvocation`: 2 ### After this PR (changes in bold) Dimension compatibility for invocations that take a mask input: - `ApplyMaskTensorToImageInvocation`: 2 or 3 - `MaskTensorToImageInvocation`: 2 or 3 - `InvertTensorMaskInvocation`: 2 or 3 <---------------- Mask dimension for invocations that produce a MaskOutput: - `RectangleMaskInvocation`: 3 - `AlphaMaskToTensorInvocation`: 3 - `InvertTensorMaskInvocation`: 3 - `ImageMaskToTensorInvocation`: 3 - `SegmentAnythingInvocation`: 3 <------------------- ## QA Instructions I tested the workflow in the PR description and this workflow: <img width="872" alt="image" src="https://github.com/user-attachments/assets/20496860-ce81-47c0-a46a-a611b73faa22" /> ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-01-28 09:42:39 -05:00
Ryan Dick	80c3d8bc5c	pnpm typegen	2025-01-28 14:30:15 +00:00
Ryan Dick	b681132da4	Update InvertTensorMaskInvocation to handle mask tensors with dim 2 or 3.	2025-01-24 22:04:46 +00:00
Ryan Dick	f60a5a5015	Update SegmentAnythingInvocation invocations to return masks with a channel dimension of size 1. This is the convention used by other nodes that produce a MaskOutput.	2025-01-24 22:04:10 +00:00
psychedelicious	6efd108481	docs: typo in manual docs install command Thanks to ShaneDK on discord for catching this.	2025-01-23 14:57:22 +11:00
Ryan Dick	f88c1ba0c3	Fix bug with some LoRA variants when applied to a BnB NF4 quantized model. Note the previous commit which added a unit test to trigger this bug.	2025-01-22 09:20:40 +11:00
Ryan Dick	e2f05d0800	Add unit tests for LoKR patch layers. The new tests trigger a bug when LoKR layers are applied to BnB-quantized layers (also impacts several other LoRA variant types).	2025-01-22 09:20:40 +11:00
psychedelicious	83e33a4810	chore: bump version to v5.6.0 v5.6.0	2025-01-21 17:58:47 +11:00
psychedelicious	e635028477	chore(ui): update whats new copy	2025-01-21 17:58:47 +11:00
psychedelicious	b7b8f8a9e5	fix(nodes): remove WithMetadata from non-image-outputting node	2025-01-21 17:58:47 +11:00
psychedelicious	e926d2f24b	fix(nodes): add beta classification to new inpainting support nodes	2025-01-21 17:58:47 +11:00
psychedelicious	ad8885c456	chore(ui): typegen	2025-01-21 17:45:32 +11:00
psychedelicious	cf4c79fe2e	feat(nodes): add PasteImageIntoBoundingBoxInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	e0edfe6c40	feat(nodes): add CropImageToBoundingBoxInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	8a0a37191a	feat(nodes): add GetMaskBoundingBoxInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	7dbd5f150a	feat(nodes): add BoundingBoxField.tuple() to get bbox as PIL tuple	2025-01-21 17:45:32 +11:00
psychedelicious	1ad65ffd53	feat(nodes): re-title "Mask from ID" -> "Mask from Segmented Image"	2025-01-21 17:45:32 +11:00
psychedelicious	14b5c871dc	feat(nodes): simplify MaskFromIDInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	8d2b4e2bf5	feat(nodes): support FLUX, SD3 in ideal_size	2025-01-21 17:45:32 +11:00
psychedelicious	aba70eacab	fix(ui): field handle positioning for non-batch fields Accidentally overwrote some reactflow styles which caused field handles to be positioned differently for non-batch fields. Just a minor visual issue.	2025-01-21 11:49:49 +11:00
Riccardo Giovanetti	4b67175b1b	translationBot(ui): update translation (Italian) Currently translated at 99.1% (1690 of 1704 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
Hosted Weblate	e3423d1ba8	translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. Co-authored-by: Hosted Weblate <hosted@weblate.org> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
Linos	87fb00ff5d	translationBot(ui): update translation (Vietnamese) Currently translated at 100.0% (1697 of 1697 strings) translationBot(ui): update translation (Vietnamese) Currently translated at 99.2% (1684 of 1697 strings) translationBot(ui): update translation (Vietnamese) Currently translated at 99.7% (1676 of 1681 strings) translationBot(ui): update translation (Vietnamese) Currently translated at 99.3% (1670 of 1681 strings) translationBot(ui): update translation (Vietnamese) Currently translated at 99.5% (1658 of 1666 strings) translationBot(ui): update translation (Vietnamese) Currently translated at 100.0% (1652 of 1652 strings) Co-authored-by: Linos <linos.coding@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/vi/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
Riccardo Giovanetti	d99a9ffb72	translationBot(ui): update translation (Italian) Currently translated at 99.3% (1642 of 1652 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
Hosted Weblate	7964f438dc	translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. Co-authored-by: Hosted Weblate <hosted@weblate.org> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
Linos	b130a3a9ee	translationBot(ui): update translation (Vietnamese) Currently translated at 100.0% (1652 of 1652 strings) Co-authored-by: Linos <linos.coding@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/vi/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
Riccardo Giovanetti	a6b32160b2	translationBot(ui): update translation (Italian) Currently translated at 99.3% (1642 of 1652 strings) translationBot(ui): update translation (Italian) Currently translated at 99.3% (1641 of 1652 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2025-01-21 09:12:45 +11:00
psychedelicious	7d110cc9d3	fix(ui): disable dynamic prompts generators pending resolution of infinite recursion issue Dynamic prompts string generators can cause an infinite feedback loop when added to the linear view. The root cause is how these generators handle "resolving" their collections. They hit the dynamic prompts HTTP API within the view component to get the prompts, then set the batch node's internal state with those values. When the same generator is rendered in both the node editor view and linear view and the timing is just right, that state update causes an infinite feedback loop between the two components as they respond to the state updates from the other component. The other generators never store the generated values in the batch node's internal state. The values are "resolved" just-in-time as they are needed. To fix this, the batch value "resolver" utilities could be made async and hit the API. But there's a problem - the resolver utilities are used within the "are we ready to invoke? are there any problems with the current settings?" redux selectors, which are strictly synchronous. To fix that, we can refactor that "are we ready to invoke?" logic to not use redux selectors, so the whole thing could be async. It's not a big change but I'm not going to spend time on it at the moment. So, until I address this, the dynamic prompts generators are disabled.	2025-01-21 09:00:40 +11:00

... 30 31 32 33 34 ...

17015 Commits