InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-02-02 07:25:07 -05:00

Author	SHA1	Message	Date
psychedelicious	8fc5d3dd20	chore(nodes): bump versions of changed nodes	2025-02-04 12:06:17 +11:00
dunkeroni	6f1a198af4	better granularity on image adjust slider	2025-02-04 12:06:17 +11:00
dunkeroni	9c7bac693b	fix image adjust hue handling	2025-02-04 12:06:17 +11:00
dunkeroni	8c9fc45341	add labels	2025-02-04 12:06:17 +11:00
dunkeroni	f93571f7ef	update default filter	2025-02-04 12:06:17 +11:00
dunkeroni	cc27730cb4	fix: image channel invocations respect alpha	2025-02-04 12:06:17 +11:00
dunkeroni	fdf9740f3c	fix: offets to integers	2025-02-04 12:06:17 +11:00
dunkeroni	58255ab7ba	add adjust image filter to canvas	2025-02-04 12:06:17 +11:00
Mary Hipp	64475b8f21	feat(ui): add button to clear model cache	2025-01-30 09:18:28 -05:00
Ryan Dick	cc9d215a9b	Add endpoint for emptying the model cache. Also, adds a threading lock to the ModelCache to make it thread-safe.	2025-01-30 09:18:28 -05:00
Ryan Dick	f7315f0432	Make the default max RAM cache size more conservative.	2025-01-30 08:46:59 -05:00
Ryan Dick	285313b282	Fix T5EncoderField initialization in SD3 model loader.	2025-01-29 09:27:52 -05:00
Ryan Dick	debcbd6e2c	Support FLUX OneTrainer LoRA formats (incl. DoRA) (#7590 ) ## Summary This PR adds support for the FLUX LoRA model format produced by OneTrainer. Specifically, this PR adds: - Support for DoRA patches - Support for patch models that modify the FLUX T5 encoder - Probing / loading support for OneTrainer models ## Known limitations - DoRA patches cannot currently be applied to base weights that are quantized with `bitsandbytes`. The DoRA algorithm requires accessing the original model weight in order to compute the patch diff, and the bitsandbytes quantization layers make this difficult. DoRA patches can be applied to non-quantized and GGUF-quantized layers without issue. - This PR results in a slight speed regression for a very particular inference combination: quantized base model + LoRA with diffusers keys (i.e. uses the `MergedLayerPatch`). Now that more LoRA formats are using the `MergedLayerPatch`, it was becoming too much work to maintain this optimization. Regression from ~1.7 it/s to ~1.4 it/s. ## Future Notes - We may want to consider dropping support for bitsandbytes quantization. It is very difficult to maintain compatibility for across features like partial-loading and LoRA patching. - At a future time, we should refactor the LoRA parsing logic to be more generalized rather than handling each format independently. - There are some redundant device casts and dequantizations in `autocast_linear_forward_sidecar_patches(...)` (and its sub-calls). Optimizing this is left for future work. ## Related Issues / Discussions - This PR should address a handful of the LoRAs reported in https://github.com/invoke-ai/InvokeAI/issues/7131 (specifically, most of the `envy*` LoRAs). - This PR should address the example in https://github.com/invoke-ai/InvokeAI/issues/6912 (though the intended effect of that LoRA is not totally clear, so its hard to verify with full confidence). ## QA Instructions OneTrainer test models: - https://civitai.com/models/844821/envy-flux-dark-watercolor-01?modelVersionId=945159 (DoRA, transformer only) - https://civitai.com/models/836757/envy-flux-digital-brush-01?modelVersionId=936167 (hada, transformer only) - ball_flux from https://github.com/invoke-ai/InvokeAI/issues/6912 (DoRA, transformer/clip/t5) The following tests were repeated with each of the OneTrainer test models: - [x] Test with non-quantized base model - [x] Test with GGUF-quantized base model - [x] Test with BnB-quantized base model - [x] Test with non-quantized base model that is partially-loaded onto the GPU Other regression test: - [x] Test some SD1 LoRAs - [x] Test some SDXL LoRAs - [x] Test a variety of existing FLUX LoRA formats - [x] Test a FLUX Control LoRA on all base model quantization formats. ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-01-28 12:50:52 -05:00
Ryan Dick	229834a5e8	Performance optimizations for LoRAs applied on top of GGML-quantized tensors.	2025-01-28 14:51:35 +00:00
Ryan Dick	6c919e1bca	Handle DoRA layer device casting when model is partially-loaded.	2025-01-28 14:51:35 +00:00
Ryan Dick	5357d6e08e	Rename ConcatenatedLoRALayer to MergedLayerPatch. And other minor cleanup.	2025-01-28 14:51:35 +00:00
Ryan Dick	7fef569e38	Update frontend graph building logic to support FLUX LoRAs that modify the T5 encoder weights.	2025-01-28 14:51:35 +00:00
Ryan Dick	e7fb435cc5	Update DoRALayer with a custom get_parameters() override that 1) applies alpha scaling to delta_v, and 2) warns if the base model is incompatible.	2025-01-28 14:51:35 +00:00
Ryan Dick	5d472ac1b8	Move quantized weight handling for patch layers up from ConcatenatedLoRALayer to CustomModuleMixin.	2025-01-28 14:51:35 +00:00
Ryan Dick	28514ba59a	Update ConcatenatedLoRALayer to work with all sub-layer types.	2025-01-28 14:51:35 +00:00
Ryan Dick	5ea7953537	Update GGMLTensor with ops necessary to work with ConcatenatedLoRALayer.	2025-01-28 14:51:35 +00:00
Ryan Dick	0db6639b4b	Add FLUX OneTrainer model probing.	2025-01-28 14:51:35 +00:00
Ryan Dick	b8eed2bdcb	Relax lora_layers_from_flux_diffusers_grouped_state_dict(...) so that it can work with more LoRA variants (e.g. hada)	2025-01-28 14:51:35 +00:00
Ryan Dick	1054283f5c	Fix bug in FLUX T5 Koyha-style LoRA key parsing.	2025-01-28 14:51:35 +00:00
Ryan Dick	f4a0b78a8d	Update FLUX invocations to support LoRAs that modify the T5 text encoder.	2025-01-28 14:51:35 +00:00
Ryan Dick	409b69ee5d	Fix typo in DoRALayer.	2025-01-28 14:51:35 +00:00
Ryan Dick	206f261e45	Add utils for loading FLUX OneTrainer DoRA models.	2025-01-28 14:51:35 +00:00
Ryan Dick	7eee4da896	Further updates to lora_model_from_flux_diffusers_state_dict() so that it can be re-used for OneTrainer LoRAs.	2025-01-28 14:51:35 +00:00
Ryan Dick	908976ac08	Add support for LyCoris-style LoRA keys in lora_model_from_flux_diffusers_state_dict(). Previously, it only supported PEFT-style LoRA keys.	2025-01-28 14:51:35 +00:00
Ryan Dick	dfa253e75b	Add utils for working with Kohya LoRA keys.	2025-01-28 14:51:35 +00:00
Ryan Dick	4f369e3dfb	First draft of DoRALayer. Not tested yet.	2025-01-28 14:51:35 +00:00
Ryan Dick	faa4fa02c0	Expand unit tests to test for confusion between FLUX LoRA formats.	2025-01-28 14:51:35 +00:00
Ryan Dick	5bd6428fdd	Add is_state_dict_likely_in_flux_onetrainer_format() util function.	2025-01-28 14:51:35 +00:00
Ryan Dick	8b4f411f7b	Add a test state dict for the OneTrainer DoRA format.	2025-01-28 14:51:35 +00:00
Ryan Dick	9d2f8b4ac8	Improve MaskOutput dimension consistency (#7591 ) ## Summary This PR fixes an issue with mask dimension consistency. Prior to this change, the following workflow would fail with `tuple out of range` error: <img width="1072" alt="image" src="https://github.com/user-attachments/assets/d0a9e658-1d64-4db4-adee-973bbdaca745" /> ### Before this PR Dimension compatibility for invocations that take a mask input: - `ApplyMaskTensorToImageInvocation`: 2 or 3 - `MaskTensorToImageInvocation`: 2 or 3 - `InvertTensorMaskInvocation`: 3 Mask dimension for invocations that produce a MaskOutput: - `RectangleMaskInvocation`: 3 - `AlphaMaskToTensorInvocation`: 3 - `InvertTensorMaskInvocation`: 3 - `ImageMaskToTensorInvocation`: 3 - `SegmentAnythingInvocation`: 2 ### After this PR (changes in bold) Dimension compatibility for invocations that take a mask input: - `ApplyMaskTensorToImageInvocation`: 2 or 3 - `MaskTensorToImageInvocation`: 2 or 3 - `InvertTensorMaskInvocation`: 2 or 3 <---------------- Mask dimension for invocations that produce a MaskOutput: - `RectangleMaskInvocation`: 3 - `AlphaMaskToTensorInvocation`: 3 - `InvertTensorMaskInvocation`: 3 - `ImageMaskToTensorInvocation`: 3 - `SegmentAnythingInvocation`: 3 <------------------- ## QA Instructions I tested the workflow in the PR description and this workflow: <img width="872" alt="image" src="https://github.com/user-attachments/assets/20496860-ce81-47c0-a46a-a611b73faa22" /> ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-01-28 09:42:39 -05:00
Ryan Dick	80c3d8bc5c	pnpm typegen	2025-01-28 14:30:15 +00:00
Ryan Dick	b681132da4	Update InvertTensorMaskInvocation to handle mask tensors with dim 2 or 3.	2025-01-24 22:04:46 +00:00
Ryan Dick	f60a5a5015	Update SegmentAnythingInvocation invocations to return masks with a channel dimension of size 1. This is the convention used by other nodes that produce a MaskOutput.	2025-01-24 22:04:10 +00:00
psychedelicious	6efd108481	docs: typo in manual docs install command Thanks to ShaneDK on discord for catching this.	2025-01-23 14:57:22 +11:00
Ryan Dick	f88c1ba0c3	Fix bug with some LoRA variants when applied to a BnB NF4 quantized model. Note the previous commit which added a unit test to trigger this bug.	2025-01-22 09:20:40 +11:00
Ryan Dick	e2f05d0800	Add unit tests for LoKR patch layers. The new tests trigger a bug when LoKR layers are applied to BnB-quantized layers (also impacts several other LoRA variant types).	2025-01-22 09:20:40 +11:00
psychedelicious	83e33a4810	chore: bump version to v5.6.0 v5.6.0	2025-01-21 17:58:47 +11:00
psychedelicious	e635028477	chore(ui): update whats new copy	2025-01-21 17:58:47 +11:00
psychedelicious	b7b8f8a9e5	fix(nodes): remove WithMetadata from non-image-outputting node	2025-01-21 17:58:47 +11:00
psychedelicious	e926d2f24b	fix(nodes): add beta classification to new inpainting support nodes	2025-01-21 17:58:47 +11:00
psychedelicious	ad8885c456	chore(ui): typegen	2025-01-21 17:45:32 +11:00
psychedelicious	cf4c79fe2e	feat(nodes): add PasteImageIntoBoundingBoxInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	e0edfe6c40	feat(nodes): add CropImageToBoundingBoxInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	8a0a37191a	feat(nodes): add GetMaskBoundingBoxInvocation	2025-01-21 17:45:32 +11:00
psychedelicious	7dbd5f150a	feat(nodes): add BoundingBoxField.tuple() to get bbox as PIL tuple	2025-01-21 17:45:32 +11:00

1 2 3 4 5 ...

15476 Commits