mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-04-23 03:00:31 -04:00

Go to file

psychedelicious 8399de9c25 refactor(mm): simplify model classification process

Previously, we had a multi-phase strategy to identify models from their
files on disk:
1. Run each model config classes' `matches()` method on the files. It
checks if the model could possibly be an identified as the candidate
model type. This was intended to be a quick check. Break on the first
match.
2. If we have a match, run the config class's `parse()` method. It
derive some additional model config attrs from the model files. This was
intended to encapsulate heavier operations that may require loading the
model into memory.
3. Derive the common model config attrs, like name, description,
calculate the hash, etc. Some of these are also heavier operations.

This strategy has some issues:
- It is not clear how the pieces fit together. There is some
back-and-forth between different methods and the config base class. It
is hard to trace the flow of logic until you fully wrap your head around
the system and therefore difficult to add a model architecture to the
probe.
- The assumption that we could do quick, lightweight checks before
heavier checks is incorrect. We often _must_ load the model state dict
in the `matches()` method. So there is no practical perf benefit to
splitting up the responsibility of `matches()` and `parse()`.
- Sometimes we need to do the same checks in `matches()` and `parse()`.
In these cases, splitting the logic is has a negative perf impact
because we are doing the same work twice.
- As we introduce the concept of an "unknown" model config (i.e. a model
that we cannot identify, but still record in the db; see #8582), we will
_always_ run _all_ the checks for every model. Therefore we need not try
to defer heavier checks or resource-intensive ops like hashing. We are
going to do them anyways.
- There are situations where a model may match multiple configs. One
known case are SD pipeline models with merged LoRAs. In the old probe
API, we relied on the implicit order of checks to know that if a model
matched for pipeline _and_ LoRA, we prefer the pipeline match. But, in
the new API, we do not have this implicit ordering of checks. To resolve
this in a resilient way, we need to get all matches up front, then use
tie-breaker logic to figure out which should win (or add "differential
diagnosis" logic to the matchers).
- Field overrides weren't handled well by this strategy. They were only
applied at the very end, if a model matched successfully. This means we
cannot tell the system "Hey, this model is type X with base Y. Trust me
bro.". We cannot override the match logic. As we move towards letting
users correct mis-identified models (see #8582), this is a requirement.

We can simplify the process significantly and better support "unknown"
models.

Firstly, model config classes now have a single `from_model_on_disk()`
method that attempts to construct an instance of the class from the
model files. This replaces the `matches()` and `parse()` methods.

If we fail to create the config instance, a special exception is raised
that indicates why we think the files cannot be identified as the given
model config class.

Next, the flow for model identification is a bit simpler:
- Derive all the common fields up-front (name, desc, hash, etc).
- Merge in overrides.
- Call `from_model_on_disk()` for every config class, passing in the
fields. Overrides are handled in this method.
- Record the results for each config class and choose the best one.

The identification logic is a bit more verbose, with the special
exceptions and handling of overrides, but it is very clear what is
happening.

The one downside I can think of for this strategy is we do need to check
every model type, instead of stopping at the first match. It's a bit
less efficient. In practice, however, this isn't a hot code path, and
the improved clarity is worth far more than perf optimizations that the
end user will likely never notice.

2025-10-13 10:30:05 +11:00

.dev_scripts

Apply black

2023-07-27 10:54:01 -04:00

.github

Revert "ci: add translation string check to frontend checks"

2025-09-08 11:20:55 +10:00

coverage

combine pytest.ini with pyproject.toml

2023-03-05 17:00:08 +00:00

docker

bugfix(docker) Ensure the correct extra install.

2025-07-17 04:19:22 +00:00

docs

docs: add BiRefNet and Image Export to communityNodes.md

2025-10-06 10:08:29 +11:00

invokeai

refactor(mm): simplify model classification process

2025-10-13 10:30:05 +11:00

scripts

chore: fix some comments

2025-08-14 09:32:54 +10:00

tests

refactor: port MM probes to new api

2025-10-13 10:30:05 +11:00

.dockerignore

refactor Dockerfile; get rid of multi-stage build; upgrade to python 3.12

2025-04-04 18:42:13 +11:00

.editorconfig

Merge dev into main for 2.2.0 (#1642 )

2022-11-30 16:12:23 -05:00

.git-blame-ignore-revs

Git blame ignore revs

2025-03-26 12:56:04 +11:00

.gitattributes

Stripped model data

2025-03-18 09:51:10 +11:00

.gitignore

repo: update ignores

2025-07-18 08:08:15 -04:00

.gitmodules

remove src directory, which is gumming up conda installs; addresses issue #77

2022-08-25 10:43:05 -04:00

.nvmrc

update nodes schema / typegen

2025-04-04 18:42:13 +11:00

.pre-commit-config.yaml

chore: update pre-commit syntax; add check for uv.lock needing an update

2025-04-15 07:41:32 +10:00

.prettierrc.yaml

feat: automated releases via github action

2024-02-29 21:57:20 -05:00

flake.lock

update flake (#7032 )

2024-10-08 10:55:49 +11:00

flake.nix

update flake (#7032 )

2024-10-08 10:55:49 +11:00

InvokeAI_Statement_of_Values.md

Add @ebr to Contributors (#2095 )

2022-12-21 14:33:08 -05:00

LICENSE

Update LICENSE

2023-07-05 23:46:27 -04:00

LICENSE-SD1+SD2.txt

updated LICENSE files and added information about watermarking

2023-07-26 17:27:33 -04:00

LICENSE-SDXL.txt

updated LICENSE files and added information about watermarking

2023-07-26 17:27:33 -04:00

Makefile

build: remove installer & convert installer build script to only build the wheel

2025-04-04 18:42:13 +11:00

mkdocs.yml

docs: add docs for low vram mode

2025-01-09 11:20:05 +11:00

pins.json

chore: bump torch to 2.7.0

2025-05-19 12:29:51 +10:00

pyproject.toml

fix: schema generation bug in fastapi 0.119.0

2025-10-12 08:18:03 -04:00

README.md

Update README.md

2024-12-20 17:01:34 +11:00

SECURITY.md

Create SECURITY.md

2024-11-25 04:10:03 -08:00

Stable_Diffusion_v1_Model_Card.md

Global replace [ \t]+$, add "GB" (#1751 )

2022-12-19 16:36:39 +00:00

test.json

refactor(ui): move model categorisation-ish logic to central location, simplify model manager models list

2025-10-13 10:30:03 +11:00

uv.lock

chore: uv lock

2025-10-12 08:18:03 -04:00

README.md

Invoke - Professional Creative AI Tools for Visual Media

To learn more about Invoke, or implement our Business solutions, visit invoke.com

Invoke is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies. Invoke offers an industry leading web-based UI, and serves as the foundation for multiple commercial products.

Invoke is available in two editions:

Community Edition	Professional Edition
For users looking for a locally installed, self-hosted and self-managed service	For users or teams looking for a cloud-hosted, fully managed service
- Free to use under a commercially-friendly license	- Monthly subscription fee with three different plan levels
- Download and install on compatible hardware	- Offers additional benefits, including multi-user support, improved model training, and more
- Includes all core studio features: generate, refine, iterate on images, and build workflows	- Hosted in the cloud for easy, secure model access and scalability
Quick Start -> Installation and Updates	More Information -> www.invoke.com/pricing

Documentation

Quick Links
Installation and Updates - Documentation and Tutorials - Bug Reports - Contributing

Installation

To get started with Invoke, Download the Installer.

For detailed step by step instructions, or for instructions on manual/docker installations, visit our documentation on Installation and Updates

Troubleshooting, FAQ and Support

Please review our FAQ for solutions to common installation problems and other issues.

For more help, please join our Discord.

Features

Full details on features can be found in our documentation.

Web Server & UI

Invoke runs a locally hosted web server & React UI with an industry-leading user experience.

Unified Canvas

The Unified Canvas is a fully integrated canvas implementation with support for all core generation capabilities, in/out-painting, brush tools, and more. This creative tool unlocks the capability for artists to create with AI as a creative collaborator, and can be used to augment AI-generated imagery, sketches, photography, renders, and more.

Workflows & Nodes

Invoke offers a fully featured workflow management solution, enabling users to combine the power of node-based workflows with the easy of a UI. This allows for customizable generation pipelines to be developed and shared by users looking to create specific workflows to support their production use-cases.

Board & Gallery Management

Invoke features an organized gallery system for easily storing, accessing, and remixing your content in the Invoke workspace. Images can be dragged/dropped onto any Image-base UI element in the application, and rich metadata within the Image allows for easy recall of key prompts or settings used in your workflow.

Other features

Support for both ckpt and diffusers models
SD1.5, SD2.0, SDXL, and FLUX support
Upscaling Tools
Embedding Manager & Support
Model Manager & Support
Workflow creation & management
Node-Based Architecture

Contributing

Anyone who wishes to contribute to this project - whether documentation, features, bug fixes, code cleanup, testing, or code reviews - is very much encouraged to do so.

Get started with contributing by reading our contribution documentation, joining the #dev-chat or the GitHub discussion board.

We hope you enjoy using Invoke as much as we enjoy creating it, and we hope you will elect to become part of our community.

Thanks

Invoke is a combined effort of passionate and talented people from across the world. We thank them for their time, hard work and effort.