InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-02-01 11:44:59 -05:00

Author	SHA1	Message	Date
psychedelicious	6cfeb71bed	feat(nodes): add expand_mask_with_fade to better handle canvas compositing needs Previously we used erode/dilate and a Gaussian blur to expand and fade the edges of Canvas masks. The implementation a number of problems: - Erode/dilate kernel sizes were not calculated correctly, and extra iterations were run to compensate. The result is the blur size, which should have been pixels, was very inaccurate and unreliable. - What we want is to add a "soft bleed" - like a drop shadow with no offset - starting from the edge of the mask, extending out by however many pixels. But Gaussian blur does not do this. The blurred area starts _inside_ the mask and extends outside it. So it kinda blurs inwards and outwards. We compensated for this by expanding the mask. - Using a Gaussian blur can cause banding artifacts. Gaussian blur doesn't have a "size" or "radius" parameter in the sense that you think it should. It's a convolution matrix and there are _no non-zero values in the result_. This means that, far away from the mask, once compositing completes, we have some values that are very close to zero but not quite zero. These values are quantized by HTML Canvas, resulting in banding artifacts where you'd expect the blur to have faded to 0% alpha. At least, that is my understanding of why the banding artifacts occur. The new node uses a better strategy to expand the mask and add the fade out effect: - Calculate the distance from each white pixel to the nearest black pixel. - Normalize this distance by dividing by the fade size in px, then clip the values to 0 - 1. The result represents the distance of each white pixel to its nearest black pixel as a percentage of the fade size. At this point, it is a linear distribution. - Create a polynomial to describe the fade's intensity so that we can have a smooth transition from the masked region (black) to unmasked (white). There are some magic numbers here, deterined experimentally. - Evaluate the polynomial over the normalized distances, so we now have a matrix representing the fade intensity for every pixel - Convert this matrix back to uint8 and apply it to the mask This works soooo much better than the previous method. Not only does it fix the banding issues, but when we enable "output only generated regions", we get a much smaller image. Will add images to the PR to clarify.	2025-03-21 10:24:03 +11:00
psychedelicious	534f993023	feat(nodes): add `apply_mask_to_image` node It simply applies the mask to an image.	2025-03-21 10:24:03 +11:00
psychedelicious	67f9b6420c	fix(nodes): ensure alpha mask is opened as RGBA	2025-03-21 10:24:03 +11:00
psychedelicious	61bf065237	feat(nodes): rename "FLUX Fill" -> "FLUX Fill Conditioning"	2025-03-21 10:24:03 +11:00
psychedelicious	e78cf889ee	fix(ui): clip shift-draw strokes to bbox when clip to bbox enabled Closes #7809	2025-03-21 08:14:20 +11:00
psychedelicious	5d13f0ba15	tidy(ui): remove recommended flag from workflow (believe was for testing purposes)	2025-03-20 08:50:01 -04:00
psychedelicious	633b9afa46	fix(ui): recommended star stretches tag list layout	2025-03-20 08:50:01 -04:00
psychedelicious	f1889b259d	tidy(ui): split browse workflows button into own component	2025-03-20 08:50:01 -04:00
psychedelicious	ed21d0b57e	tidy(ui): remove noop useEffect	2025-03-20 08:50:01 -04:00
Mary Hipp	df90da28e1	tsc fix	2025-03-20 15:43:57 +11:00
Mary Hipp	702054aa62	make sure browse is selected	2025-03-20 15:43:57 +11:00
Mary Hipp	636ec1de6e	add viewAllWorkflowsRecommended to studio init action to show library with only recomended workflows	2025-03-20 15:43:57 +11:00
Mary Hipp	063d07fd41	switch to using recommended with star insteaed of auto-selecting	2025-03-20 15:43:57 +11:00
Mary Hipp	c78eac624e	update workflow tag/categories so that we can pass in 1+ selected tags to start with	2025-03-20 15:43:57 +11:00
Mary Hipp	05de3b7a84	workflow library UI updates: scrollbar to make obvious its overflowing, move deselecet all tags to be next to browse button	2025-03-20 15:43:57 +11:00
Ryan Dick	9cc2232b6f	Bump FluxDenoise invocation version and typegen.	2025-03-19 14:45:18 +11:00
Ryan Dick	9fdc06b447	Add FLUX Fill input validation and error/warning reporting.	2025-03-19 14:45:18 +11:00
Ryan Dick	5ea3ec5cc8	Get FLUX Fill working. Note: To use FLUX Fill, set guidance to ~30.	2025-03-19 14:45:18 +11:00
Ryan Dick	f13a07ba6a	WIP on updating FluxDenoise to support FLUX Fill.	2025-03-19 14:45:18 +11:00
Ryan Dick	a913f0163d	WIP - Add FluxFillInvocation	2025-03-19 14:45:18 +11:00
Ryan Dick	f7cfbd1323	Add FLUX Fill starter model.	2025-03-19 14:45:18 +11:00
Ryan Dick	2806b60701	Add logic to probe FLUX variant (NORMAL vs INPAINT).	2025-03-19 14:45:18 +11:00
psychedelicious	d8c3af624b	Use git-lfs for larger assets (#7804 ) ## Summary - Integrate Git LFS to our automated Python tests in CI - Add stripped model files with git-lfs - `README.md` instructions to install and configure git-lfs - Unrelated change (skip hashing to make unit test run faster) ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [ ] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-03-19 09:53:26 +11:00
psychedelicious	feed44b68d	Stripped models (#7797 ) ## Summary Problem We want to have automated tests for model classification/probing, but model files are too large to include in the source. Proposed Solution Classification/probing only requires metadata (key names, tensor shapes), not weights. This PR introduces "stripped" models - lightweight versions that retains only essential metadata. - Added script to strip models - Added stripped models to automated tests Model size before and after "stripping": ``` LLaVA Onevision Qwen2 0.5b-ov-hf before: 1.8 GB, after: 11.6 MB text_encoder before: 246.1 MB, after: 35.6 kB llava-onevision-qwen2-7b-si-hf before: 16.1 GB, after: 11.7 MB RealESRGAN_x2plus.pth before: 67.1 MB, after: 143.0 kB IP Adapter SD1 before: 2.5 GB, after: 94.9 kB Hard Edge Detection (canny) before: 722.6 MB, after: 63.6 kB Lineart before: 722.6 MB, after: 63.6 kB Segmentation Map before: 722.6 MB, after: 63.6 kB EasyNegative before: 24.7 kB, after: 151 Bytes Face Reference (IP Adapter Plus Face) before: 98.2 MB, after: 13.7 kB Standard Reference (IP Adapter) before: 44.6 MB, after: 6.0 kB shinkai_makoto_offset before: 151.1 MB, after: 160.0 kB thickline_fp16 before: 151.1 MB, after: 160.0 kB Alien Style before: 228.5 MB, after: 582.6 kB Noodles Style before: 228.5 MB, after: 582.6 kB Juggernaut XL v9 before: 6.9 GB, after: 3.7 MB dreamshaper-8 before: 168.9 MB, after: 1.6 MB ``` ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [ ] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-03-19 08:13:10 +11:00
Billy	247f3b5d67	Merge branch 'stripped-models' into git-lfs	2025-03-19 07:53:27 +11:00
Billy	8e14f9d971	Merge branch 'main' into stripped-models	2025-03-19 07:52:56 +11:00
Billy	bdb44ee48d	Merge branch 'git-lfs' of github.com:invoke-ai/InvokeAI into git-lfs	2025-03-19 07:30:34 +11:00
Billy	b57f5330c5	Pin action to commit	2025-03-19 07:28:28 +11:00
jazzhaiku	ade3c015b4	Update docs/contributing/dev-environment.md Co-authored-by: Eugene Brodsky <ebr@users.noreply.github.com>	2025-03-19 07:23:23 +11:00
psychedelicious	7fe4d4c21a	feat(app): better errors when scanning models with picklescan Differentiate between malware detection and scan error.	2025-03-19 07:20:25 +11:00
psychedelicious	133a7fde55	Model classification api (#7742 ) ## Summary The _goal_ of this PR is to make it easier to add an new config type. This _scope_ of this PR is to integrate the API and does not include adding new configs (outside tests) or porting existing ones. One of the glaring issues of the existing legacy probe is that the logic for each type is spread across multiple classes and intertwined with the other configs. This means that adding a new config type (or modifying an existing one) is complex and error prone. This PR attempts to remedy this by providing a new API for adding configs that: - Is backwards compatible with the existing probe. - Encapsulates fields and logic in a single class, keeping things self-contained and easy to modify safely. Below is a minimal toy example illustrating the proposed new structure: ```python class MinimalConfigExample(ModelConfigBase): type: ModelType = ModelType.Main format: ModelFormat = ModelFormat.Checkpoint fun_quote: str @classmethod def matches(cls, mod: ModelOnDisk) -> bool: return mod.path.suffix == ".json" @classmethod def parse(cls, mod: ModelOnDisk) -> dict[str, Any]: with open(mod.path, "r") as f: contents = json.load(f) return { "fun_quote": contents["quote"], "base": BaseModelType.Any, } ``` To create a new config type, one needs to inherit from `ModelConfigBase` and implement its interface. The code falls back to the legacy model probe for existing models using the old API. This allows us to incrementally port the configs one by one. ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-03-18 15:25:56 +11:00
Billy	6375214878	Merge branch 'stripped-models' into git-lfs	2025-03-18 14:57:58 +11:00
Billy	b9972be7f1	Merge branch 'model-classification-api' into stripped-models	2025-03-18 14:57:23 +11:00
Billy	e61c5a3f26	Merge	2025-03-18 14:55:11 +11:00
Billy	8c633786f6	Remove accidently included files	2025-03-18 14:16:51 +11:00
Billy	8703eea49b	LFS cache	2025-03-18 14:08:21 +11:00
Billy	c8888be4c3	Formatting	2025-03-18 13:10:07 +11:00
Billy	11963a65a4	CI/CD	2025-03-18 12:56:28 +11:00
Billy	ab6422fdf7	Add to README.md	2025-03-18 12:37:32 +11:00
psychedelicious	1f8632029e	fix(nodes): add validator to vllm node images field to handle single image field inputs	2025-03-18 11:53:06 +11:00
Ryan Dick	88a762474d	typegen	2025-03-18 11:53:06 +11:00
Ryan Dick	e6dd721e33	Add max_length=3 to the LLaVA OneVision image input field.	2025-03-18 11:53:06 +11:00
Billy	2a09604baf	Formatting	2025-03-18 11:53:06 +11:00
Billy	f94f00ede0	Ruff formatting	2025-03-18 11:53:06 +11:00
Billy	37af281299	WIP - model selection for LLaVA	2025-03-18 11:53:06 +11:00
Billy	fc82775d7a	WIP - model selection for LLaVA	2025-03-18 11:53:06 +11:00
Billy	9ed46f60b7	Add LLaVA OneVision to Config dropdown in UI	2025-03-18 11:53:06 +11:00
Ryan Dick	9a389e6b93	Add a LLaVA OneVision starter model.	2025-03-18 11:53:06 +11:00
Ryan Dick	2ef1ecf381	Fix copy-paste errors.	2025-03-18 11:53:06 +11:00
Ryan Dick	41de112932	Make LLaVA Onevision node work with 0 images, and other minor improvements.	2025-03-18 11:53:06 +11:00

1 2 3 4 5 ...

16214 Commits