InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-04-23 03:00:31 -04:00

Author	SHA1	Message	Date
psychedelicious	454d05bbde	refactor: model manager v3 (#8607 ) * feat(mm): add UnknownModelConfig * refactor(ui): move model categorisation-ish logic to central location, simplify model manager models list * refactor(ui)refactor(ui): more cleanup of model categories * refactor(ui): remove unused excludeSubmodels I can't remember what this was for and don't see any reference to it. Maybe it's just remnants from a previous implementation? * feat(nodes): add unknown as model base * chore(ui): typegen * feat(ui): add unknown model base support in ui * feat(ui): allow changing model type in MM, fix up base and variant selects * feat(mm): omit model description instead of making it "base type filename model" * feat(app): add setting to allow unknown models * feat(ui): allow changing model format in MM * feat(app): add the installed model config to install complete events * chore(ui): typegen * feat(ui): toast warning when installed model is unidentified * docs: update config docstrings * chore(ui): typegen * tests(mm): fix test for MM, leave the UnknownModelConfig class in the list of configs * tidy(ui): prefer types from zod schemas for model attrs * chore(ui): lint * fix(ui): wrong translation string * feat(mm): normalized model storage Store models in a flat directory structure. Each model is in a dir named its unique key (a UUID). Inside that dir is either the model file or the model dir. * feat(mm): add migration to flat model storage * fix(mm): normalized multi-file/diffusers model installation no worky now worky * refactor: port MM probes to new api - Add concept of match certainty to new probe - Port CLIP Embed models to new API - Fiddle with stuff * feat(mm): port TIs to new API * tidy(mm): remove unused probes * feat(mm): port spandrel to new API * fix(mm): parsing for spandrel * fix(mm): loader for clip embed * fix(mm): tis use existing weight_files method * feat(mm): port vae to new API * fix(mm): vae class inheritance and config_path * tidy(mm): patcher types and import paths * feat(mm): better errors when invalid model config found in db * feat(mm): port t5 to new API * feat(mm): make config_path optional * refactor(mm): simplify model classification process Previously, we had a multi-phase strategy to identify models from their files on disk: 1. Run each model config classes' `matches()` method on the files. It checks if the model could possibly be an identified as the candidate model type. This was intended to be a quick check. Break on the first match. 2. If we have a match, run the config class's `parse()` method. It derive some additional model config attrs from the model files. This was intended to encapsulate heavier operations that may require loading the model into memory. 3. Derive the common model config attrs, like name, description, calculate the hash, etc. Some of these are also heavier operations. This strategy has some issues: - It is not clear how the pieces fit together. There is some back-and-forth between different methods and the config base class. It is hard to trace the flow of logic until you fully wrap your head around the system and therefore difficult to add a model architecture to the probe. - The assumption that we could do quick, lightweight checks before heavier checks is incorrect. We often _must_ load the model state dict in the `matches()` method. So there is no practical perf benefit to splitting up the responsibility of `matches()` and `parse()`. - Sometimes we need to do the same checks in `matches()` and `parse()`. In these cases, splitting the logic is has a negative perf impact because we are doing the same work twice. - As we introduce the concept of an "unknown" model config (i.e. a model that we cannot identify, but still record in the db; see #8582), we will _always_ run _all_ the checks for every model. Therefore we need not try to defer heavier checks or resource-intensive ops like hashing. We are going to do them anyways. - There are situations where a model may match multiple configs. One known case are SD pipeline models with merged LoRAs. In the old probe API, we relied on the implicit order of checks to know that if a model matched for pipeline _and_ LoRA, we prefer the pipeline match. But, in the new API, we do not have this implicit ordering of checks. To resolve this in a resilient way, we need to get all matches up front, then use tie-breaker logic to figure out which should win (or add "differential diagnosis" logic to the matchers). - Field overrides weren't handled well by this strategy. They were only applied at the very end, if a model matched successfully. This means we cannot tell the system "Hey, this model is type X with base Y. Trust me bro.". We cannot override the match logic. As we move towards letting users correct mis-identified models (see #8582), this is a requirement. We can simplify the process significantly and better support "unknown" models. Firstly, model config classes now have a single `from_model_on_disk()` method that attempts to construct an instance of the class from the model files. This replaces the `matches()` and `parse()` methods. If we fail to create the config instance, a special exception is raised that indicates why we think the files cannot be identified as the given model config class. Next, the flow for model identification is a bit simpler: - Derive all the common fields up-front (name, desc, hash, etc). - Merge in overrides. - Call `from_model_on_disk()` for every config class, passing in the fields. Overrides are handled in this method. - Record the results for each config class and choose the best one. The identification logic is a bit more verbose, with the special exceptions and handling of overrides, but it is very clear what is happening. The one downside I can think of for this strategy is we do need to check every model type, instead of stopping at the first match. It's a bit less efficient. In practice, however, this isn't a hot code path, and the improved clarity is worth far more than perf optimizations that the end user will likely never notice. * refactor(mm): remove unused methods in config.py * refactor(mm): add model config parsing utils * fix(mm): abstractmethod bork * tidy(mm): clarify that model id utils are private * fix(mm): fall back to UnknownModelConfig correctly * feat(mm): port CLIPVisionDiffusersConfig to new api * feat(mm): port SigLIPDiffusersConfig to new api * feat(mm): make match helpers more succint * feat(mm): port flux redux to new api * feat(mm): port ip adapter to new api * tidy(mm): skip optimistic override handling for now * refactor(mm): continue iterating on config * feat(mm): port flux "control lora" and t2i adapter to new api * tidy(ui): use Extract to get model config types * fix(mm): t2i base determination * feat(mm): port cnet to new api * refactor(mm): add config validation utils, make it all consistent and clean * feat(mm): wip port of main models to new api * feat(mm): wip port of main models to new api * feat(mm): wip port of main models to new api * docs(mm): add todos * tidy(mm): removed unused model merge class * feat(mm): wip port main models to new api * tidy(mm): clean up model heuristic utils * tidy(mm): clean up ModelOnDisk caching * tidy(mm): flux lora format util * refactor(mm): make config classes narrow Simpler logic to identify, less complexity to add new model, fewer useless attrs that do not relate to the model arch, etc * refactor(mm): diffusers loras w * feat(mm): consistent naming for all model config classes * fix(mm): tag generation & scattered probe fixes * tidy(mm): consistent class names * refactor(mm): split configs into separate files * docs(mm): add comments for identification utils * chore(ui): typegen * refactor(mm): remove legacy probe, new configs dir structure, update imports * fix(mm): inverted condition * docs(mm): update docsstrings in factory.py * docs(mm): document flux variant attr * feat(mm): add helper method for legacy configs * feat(mm): satisfy type checker in flux denoise * docs(mm): remove extraneous comment * fix(mm): ensure unknown model configs get unknown attrs * fix(mm): t5 identification * fix(mm): sdxl ip adapter identification * feat(mm): more flexible config matching utils * fix(mm): clip vision identification * feat(mm): add sanity checks before probing paths * docs(mm): add reminder for self for field migrations * feat(mm): clearer naming for main config class hierarchy * feat(mm): fix clip vision starter model bases, add ref to actual models * feat(mm): add model config schema migration logic * fix(mm): duplicate import * refactor(mm): split big migration into 3 Split the big migration that did all of these things into 3: - Migration 22: Remove unique contraint on base/name/type in models table - Migration 23: Migrate configs to v6.8.0 schemas - Migration 24: Normalize file storage * fix(mm): pop base/type/format when creating unknown model config * fix(db): migration 22 insert only real cols * fix(db): migration 23 fall back to unknown model when config change fails * feat(db): run migrations 23 and 24 * fix(mm): false negative on flux lora * fix(mm): vae checkpoint probe checking for dir instead of file * fix(mm): ModelOnDisk skips dirs when looking for weights Previously a path w/ any of the known weights suffixes would be seen as a weights file, even if it was a directory. We now check to ensure the candidate path is actually a file before adding it to the list of weights. * feat(mm): add method to get main model defaults from a base * feat(mm): do not log when multiple non-unknown model matches * refactor(mm): continued iteration on model identifcation * tests(mm): refactor model identification tests Overhaul of model identification (probing) tests. Previously we didn't test the correctness of probing except in a few narrow cases - now we do. See tests/model_identification/README.md for a detailed overview of the new test setup. It includes instructions for adding a new test case. In brief: - Download the model you want to add as a test case - Run a script against it to generate the test model files - Fill in the expected model type/format/base/etc in the generated test metadata JSON file Included test cases: - All starter models - A handful of other models that I had installed - Models present in the previous test cases as smoke tests, now also tested for correctness * fix(mm): omit type/format/base when creating unknown config instance * feat(mm): use ValueError for model id sanity checks * feat(mm): add flag for updating models to allow class changes * tests(mm): fix remaining MM tests * feat: allow users to edit models freely * feat(ui): add warning for model settings edit * tests(mm): flux state dict tests * tidy: remove unused file * fix(mm): lora state dict loading in model id * feat(ui): use translation string for model edit warning * docs(db): update version numbers in migration comments * chore: bump version to v6.9.0a1 * docs: update model id readme * tests(mm): attempt to fix windows model id tests * fix(mm): issue with deleting single file models * feat(mm): just delete the dir w/ rmtree when deleting model * tests(mm): windows CI issue * fix(ui): typegen schema sync * fix(mm): fixes for migration 23 - Handle CLIP Embed and Main SD models missing variant field - Handle errors when calling the discriminator function, previously only handled ValidationError but it could be a ValueError or something else - Better logging for config migration * chore: bump version to v6.9.0a2 * chore: bump version to v6.9.0a3	2025-10-15 10:18:53 +11:00
psychedelicious	3b0d59e459	tests(app): update mm tests to test updated behaviour	2025-07-29 16:08:15 +10:00
Kent Keirsey	070eef3eff	remove whitespace	2025-07-22 07:33:48 +10:00
Kent Keirsey	dd35ab026a	update logic and remove bad test	2025-07-22 07:33:48 +10:00
Cursor Agent	7fc06db8ad	Add LoRA model metadata extraction from JSON and PNG files Co-authored-by: kent <kent@invoke.ai>	2025-07-22 07:33:48 +10:00
psychedelicious	df81f3274a	feat(nodes): improved pydantic type annotation massaging When we do our field type overrides to allow invocations to be instantiated without all required fields, we were not modifying the annotation of the field but did set the default value of the field to `None`. This results in an error when doing a ser/de round trip. Here's what we end up doing: ```py from pydantic import BaseModel, Field class MyModel(BaseModel): foo: str = Field(default=None) ``` And here is a simple round-trip, which should not error but which does: ```py MyModel(**MyModel().model_dump()) # ValidationError: 1 validation error for MyModel # foo # Input should be a valid string [type=string_type, input_value=None, input_type=NoneType] # For further information visit https://errors.pydantic.dev/2.11/v/string_type ``` To fix this, we now check every incoming field and update its annotation to match its default value. In other words, when we override the default field value to `None`, we make its type annotation `<original type> \| None`. This prevents the error during deserialization. This slightly alters the schema for all invocations and outputs - the values of all fields without default values are now typed as `<original type> \| None`, reflecting the overrides. This means the autogenerated types for fields have also changed for fields without defaults: ```ts // Old image?: components["schemas"]["ImageField"]; // New image?: components["schemas"]["ImageField"] \| null; ``` This does not break anything on the frontend.	2025-05-14 06:55:15 +10:00
Kevin Turner	52a8ad1c18	chore: rename model.size to model.file_size to disambiguate from RAM size or pixel size	2025-04-10 09:53:03 +10:00
Kevin Turner	98260a8efc	test: add size field to test model configs	2025-04-10 09:53:03 +10:00
Billy	a6b94e8ca4	Revert some files	2025-03-26 13:18:50 +11:00
Billy	182580ff69	Imports	2025-03-26 12:55:10 +11:00
Billy	8e9d5c1187	Ruff formatting	2025-03-26 12:30:31 +11:00
Billy	99aac5870e	Remove star imports	2025-03-26 12:27:00 +11:00
psychedelicious	b9c7bc8b0e	chore: ruff	2025-03-07 11:45:49 +11:00
psychedelicious	0f45ee04a2	tests: fix test_extract_valid_metadata_from_image to accomodate prev commit	2025-03-07 11:45:49 +11:00
psychedelicious	8e46b03f09	tests: add tests for extract_metadata_from_image	2025-03-07 11:45:49 +11:00
psychedelicious	cf0cbaf0ae	chore: ruff (more)	2025-03-06 10:57:54 +11:00
psychedelicious	ac6fc6eccb	chore: ruff	2025-03-06 10:57:54 +11:00
Billy	f2689598c0	Formatting	2025-03-06 09:11:00 +11:00
psychedelicious	c1cf01a038	tests: use dangerously_run_function_in_subprocess to fix configure_torch_cuda_allocator tests	2025-03-06 07:49:35 +11:00
psychedelicious	d037d8f9aa	tests: update tests for configure_torch_cuda_allocator	2025-03-06 07:49:35 +11:00
Ryan Dick	49191709a0	Mark test_configure_torch_cuda_allocator_raises_if_torch_is_already_imported() to only run if CUDA is available.	2025-02-28 21:39:09 +00:00
Ryan Dick	a36a627f83	Switch from use_cuda_malloc flag to a general pytorch_cuda_alloc_conf config field that allows full customization of the CUDA allocator.	2025-02-28 21:39:09 +00:00
Ryan Dick	b31c71f302	Simplify is_torch_cuda_malloc_enabled() implementation and add unit tests.	2025-02-28 21:39:09 +00:00
psychedelicious	868e06eb8b	tests: fix test_model_install.py	2025-01-03 11:21:23 -05:00
Mary Hipp	1640ea0298	(pytest) add missing arg for mocked context	2024-11-15 05:49:05 +11:00
Mary Hipp	0c32ae9775	(pytest) fix import	2024-11-15 05:49:05 +11:00
Mary Hipp	571faf6d7c	(pytest) add queue_item and invocation to data in context for test	2024-11-15 05:49:05 +11:00
Ryan Dick	d20b894a61	Add cfg_scale_start_step and cfg_scale_end_step to FLUX Denoise node.	2024-10-23 07:59:48 +11:00
psychedelicious	5440c03767	fix(app): directory traversal when deleting images	2024-10-18 14:27:41 +11:00
Lincoln Stein	633bbb4e85	[MM2] Use typed ModelRecordChanges for model_install() rather than untyped dict (#6645 ) * [MM2] replace untyped config dict passed to install_model with typed ModelRecordChanges - adjusted frontend to work with new schema - used this facility to assign "starter model" names and descriptions to the installed models. * documentation fix * [MM2] replace untyped config dict passed to install_model with typed ModelRecordChanges - adjusted frontend to work with new schema - used this facility to assign "starter model" names and descriptions to the installed models. * documentation fix * remove v9 pnpm lockfile * [MM2] replace untyped config dict passed to install_model with typed ModelRecordChanges - adjusted frontend to work with new schema - used this facility to assign "starter model" names and descriptions to the installed models. * [MM2] replace untyped config dict passed to install_model with typed ModelRecordChanges - adjusted frontend to work with new schema - used this facility to assign "starter model" names and descriptions to the installed models. * remove v9 pnpm lockfile * regenerate schema.ts * prettified --------- Co-authored-by: Lincoln Stein <lstein@gmail.com>	2024-07-23 21:41:00 +00:00
Ryan Dick	69af099532	Warn on invalid model configs in the DB rather than crashing.	2024-07-11 21:05:55 -04:00
Lincoln Stein	0fe92cd406	[MM bugfix] Put model install errors on the event bus (#6578 ) * fix access token lookup * fix bug preventing model install error events from being reported --------- Co-authored-by: Lincoln Stein <lstein@gmail.com>	2024-07-03 22:44:34 -04:00
psychedelicious	41f46b846b	chore: ruff	2024-06-28 10:36:05 +10:00
psychedelicious	6dfc406c52	tests: update `test_bulk_download.py` after addition of `archived` field	2024-06-28 10:36:05 +10:00
psychedelicious	e26125b734	tests: fix test_model_install.py	2024-06-17 10:57:11 +10:00
Lincoln Stein	dc134935c8	replace load_and_cache_model() with load_remote_model() and load_local_odel()	2024-06-07 14:12:16 +10:00
Lincoln Stein	9f9379682e	ruff fixes	2024-06-07 13:54:41 +10:00
Lincoln Stein	f81b8bc9f6	add support for generic loading of diffusers directories	2024-06-07 13:54:30 +10:00
Lincoln Stein	ead1748c54	issue a download progress event when install download starts	2024-05-28 19:30:42 -04:00
Lincoln Stein	34e1eb19f9	merge with main and resolve conflicts	2024-05-27 22:20:34 -04:00
psychedelicious	5b390bb11c	tests: clean up tests after events changes	2024-05-27 09:06:02 +10:00
psychedelicious	4756920282	tests: move fixtures import to conftest.py	2024-05-27 09:06:02 +10:00
psychedelicious	a876675448	tests: update tests to use new events	2024-05-27 09:06:02 +10:00
psychedelicious	9bd78823a3	refactor(events): use pydantic schemas for events Our events handling and implementation has a couple pain points: - Adding or removing data from event payloads requires changes wherever the events are dispatched from. - We have no type safety for events and need to rely on string matching and dict access when interacting with events. - Frontend types for socket events must be manually typed. This has caused several bugs. `fastapi-events` has a neat feature where you can create a pydantic model as an event payload, give it an `__event_name__` attr, and then dispatch the model directly. This allows us to eliminate a layer of indirection and some unpleasant complexity: - Event handler callbacks get type hints for their event payloads, and can use `isinstance` on them if needed. - Event payload construction is now the responsibility of the event itself (a pydantic model), not the service. Every event model has a `build` class method, encapsulating this logic. The build methods are provided as few args as possible. For example, `InvocationStartedEvent.build()` gets the invocation instance and queue item, and can choose the data it wants to include in the event payload. - Frontend event types may be autogenerated from the OpenAPI schema. We use the payload registry feature of `fastapi-events` to collect all payload models into one place, making it trivial to keep our schema and frontend types in sync. This commit moves the backend over to this improved event handling setup.	2024-05-27 09:06:02 +10:00
Lincoln Stein	e77c7e40b7	fix ruff error	2024-05-17 22:53:45 -04:00
Lincoln Stein	8aebc29b91	fix test to run on 32bit cpu	2024-05-17 22:48:54 -04:00
Lincoln Stein	d968c6f379	refactor multifile download code	2024-05-17 22:29:19 -04:00
Lincoln Stein	2dae5eb7ad	more refactoring; HF subfolders not working	2024-05-16 22:26:18 -04:00
Lincoln Stein	911a24479b	add tests for model install file size reporting	2024-05-16 07:18:33 -04:00
Lincoln Stein	f29c406fed	refactor model_install to work with refactored download queue	2024-05-13 22:49:15 -04:00

1 2 3

149 Commits