InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-01-25 13:57:57 -05:00

Author	SHA1	Message	Date
Ryan Dick	a36a627f83	Switch from use_cuda_malloc flag to a general pytorch_cuda_alloc_conf config field that allows full customization of the CUDA allocator.	2025-02-28 21:39:09 +00:00
Ryan Dick	b31c71f302	Simplify is_torch_cuda_malloc_enabled() implementation and add unit tests.	2025-02-28 21:39:09 +00:00
Ryan Dick	5302d4890f	Add use_cuda_malloc config option.	2025-02-28 21:39:09 +00:00
Ryan Dick	766b752572	Add utils for configuring the torch CUDA allocator.	2025-02-28 21:39:09 +00:00
Ryan Dick	da2b6815ac	Make InvokeAILogger an inline import in startup_utils.py in response to review comment.	2025-02-28 20:10:24 +00:00
Ryan Dick	38991ffc35	Add register_mime_types() startup util.	2025-02-28 20:10:24 +00:00
Ryan Dick	f345c0fabc	Create an apply_monkeypatches() start util.	2025-02-28 20:10:24 +00:00
Ryan Dick	35910d3952	Move check_cudnn() and jurigged setup to startup_utils.py.	2025-02-28 20:08:53 +00:00
Ryan Dick	6f1dcf385b	Move find_port() util to its own file.	2025-02-28 20:08:53 +00:00
Ryan Dick	b301785dc8	Normalize the T5 model identifiers so that a FLUX T5 or an SD3 T5 model can be used interchangeably.	2025-01-16 08:33:58 +11:00
Brandon Rising	e75903389f	Run ruff, fix bug in hf downloading code which failed to download parts of a model	2024-11-04 12:42:09 -05:00
Brandon Rising	27567052f2	Create new latent factors for sd35	2024-11-04 12:42:09 -05:00
Ryan Dick	9361ed9d70	Add progress images to SD3 and make denoising cancellable.	2024-11-04 12:42:09 -05:00
psychedelicious	a6f93d3862	feat(app): use new signal_progress for denoising - Update the step callback methods in the invocation API to use the new signal_progress API - Copy and update the `calc_percentage`, reducing special handling for step and total_steps - a followup commit will fix callers of the step callbacks	2024-09-22 21:20:32 +03:00
Brandon Rising	69f080fb75	Move flux step callback code into the step_callback util scripts, use other services within the invocation context	2024-09-03 14:04:16 -04:00
psychedelicious	f66584713c	fix(api): sort OpenAPI schema properties for InvocationOutputMap This makes the schema output deterministic!	2024-08-10 07:45:23 -04:00
Ryan Dick	9da5925287	Add ruff rule to disallow relative parent imports.	2024-07-04 09:35:37 -04:00
Ryan Dick	5301770525	Add naive ControlNet support to TiledStableDiffusionRefineInvocation	2024-06-25 11:31:52 -07:00
Lincoln Stein	2276f327e5	Merge branch 'main' into lstein/feat/simple-mm2-api	2024-06-02 09:45:31 -04:00
psychedelicious	5beec8211a	feat(api): sort openapi schemas Reduces the constant changes to the frontend client types due to inconsistent ordering of pydantic models.	2024-05-30 12:03:38 +10:00
psychedelicious	2f9ebdec69	fix(app): openapi schema generation Some tech debt related to dynamic pydantic schemas for invocations became problematic. Including the invocations and results in the event schemas was breaking pydantic's handling of ref schemas. I don't really understand why - I think it's a pydantic bug in a remote edge case that we are hitting. After many failed attempts I landed on this implementation, which is actually much tidier than what was in there before. - Create pydantic-enabled types for `AnyInvocation` and `AnyInvocationOutput` and use these in place of the janky dynamic unions. Actually, they are kinda the same, but better encapsulated. Use these in `Graph`, `GraphExecutionState`, `InvocationEventBase` and `InvocationCompleteEvent`. - Revise the custom openapi function to work with the new models. - Split out the custom openapi function to a separate file. Add a `post_transform` callback so consumers can customize the output schema. - Update makefile scripts.	2024-05-30 12:03:03 +10:00
Lincoln Stein	34e1eb19f9	merge with main and resolve conflicts	2024-05-27 22:20:34 -04:00
psychedelicious	0f733c42fc	fix(events): fix denoise progress percentage - Restore calculation of step percentage but in the backend instead of client - Simplify signatures for denoise progress event callbacks - Clean up `step_callback.py` (types, do not recreate constant matrix on every step, formatting)	2024-05-27 09:06:02 +10:00
psychedelicious	9bd78823a3	refactor(events): use pydantic schemas for events Our events handling and implementation has a couple pain points: - Adding or removing data from event payloads requires changes wherever the events are dispatched from. - We have no type safety for events and need to rely on string matching and dict access when interacting with events. - Frontend types for socket events must be manually typed. This has caused several bugs. `fastapi-events` has a neat feature where you can create a pydantic model as an event payload, give it an `__event_name__` attr, and then dispatch the model directly. This allows us to eliminate a layer of indirection and some unpleasant complexity: - Event handler callbacks get type hints for their event payloads, and can use `isinstance` on them if needed. - Event payload construction is now the responsibility of the event itself (a pydantic model), not the service. Every event model has a `build` class method, encapsulating this logic. The build methods are provided as few args as possible. For example, `InvocationStartedEvent.build()` gets the invocation instance and queue item, and can choose the data it wants to include in the event payload. - Frontend event types may be autogenerated from the OpenAPI schema. We use the payload registry feature of `fastapi-events` to collect all payload models into one place, making it trivial to keep our schema and frontend types in sync. This commit moves the backend over to this improved event handling setup.	2024-05-27 09:06:02 +10:00
Lincoln Stein	bb04f496e0	Merge branch 'main' into lstein/feat/simple-mm2-api	2024-04-28 11:33:26 -04:00
psychedelicious	398f37c0ed	tidy(backend): clean up controlnet_utils - Use the our adaptation of the HWC3 function with better types - Extraction some of the util functions, name them better, add comments - Improve type annotations - Remove unreachable codepaths	2024-04-25 13:20:09 +10:00
psychedelicious	5b8f77f990	tidy(nodes): move cnet mode literals to utils Now they can be used in type signatures without circular imports.	2024-04-25 13:20:09 +10:00
Lincoln Stein	3ddd7ced49	change names of convert and download caches and add migration script	2024-04-14 15:57:33 -04:00
psychedelicious	9a5575b46b	feat(mm): move HF token helper to route	2024-03-20 15:05:25 +11:00
psychedelicious	813e679b77	feat: add `hf_login` util This provides a simple way to provide a HF token. If HF reports no valid token, one is prompted for until a valid token is provided, or the user presses Ctrl + C to cancel.	2024-03-20 15:05:25 +11:00
psychedelicious	857e9c9b5f	feat: add `SuppressOutput` util This context manager suppresses/hides stdout.	2024-03-20 15:05:25 +11:00
psychedelicious	fabef8b45b	feat(mm): download upscaling & lama models as they are requested	2024-03-20 15:05:25 +11:00
psychedelicious	528ac5dd25	refactor(nodes): model identifiers - All models are identified by a key and optionally a submodel type via new model `ModelField`. Previously, a few model types had their own class, but not all of them. This inconsistency just added complexity without any benefit. - Update all invocation to use the new format. - In the node API, models are loaded by key or an instance of `ModelField` as a convenience. - Add an enriched model schema for metadata. It includes key, hash, name, base and type.	2024-03-07 10:56:59 +11:00
psychedelicious	44c40d7d1a	refactor(mm): remove unused metadata logic, fix tests - Metadata is merged with the config. We can simplify the MM substantially and remove the handling for metadata. - Per discussion, we don't have an ETA for frontend implementation of tags, and with the realization that the tags from CivitAI are largely useless, there's no reason to keep tags in the MM right now. When we are ready to implement tags on the frontend, we can refer back to the implementation here and use it if it supports the design. - Fix all tests.	2024-03-05 23:50:19 +11:00
psychedelicious	0b0128647b	feat(nodes): revise model load API args	2024-03-01 10:42:33 +11:00
Brandon Rising	c670dacc29	Ruff format	2024-03-01 10:42:33 +11:00
Brandon Rising	f475b78734	Ruff check	2024-03-01 10:42:33 +11:00
Brandon Rising	ca9b815c89	Extract TI loading logic into util, disallow it from ever failing a generation	2024-03-01 10:42:33 +11:00
psychedelicious	18adcc1dd2	feat(nodes): add whole queue_item to InvocationContextData No reason to not have the whole thing in there.	2024-03-01 10:42:33 +11:00
psychedelicious	725c03cf87	refactor(nodes): merge processors Consolidate graph processing logic into session processor. With graphs as the unit of work, and the session queue distributing graphs, we no longer need the invocation queue or processor. Instead, the session processor dequeues the next session and processes it in a simple loop, greatly simplifying the app. - Remove `graph_execution_manager` service. - Remove `queue` (invocation queue) service. - Remove `processor` (invocation processor) service. - Remove queue-related logic from `Invoker`. It now only starts and stops the services, providing them with access to other services. - Remove unused `invocation_retrieval_error` and `session_retrieval_error` events, these are no longer needed. - Clean up stats service now that it is less coupled to the rest of the app. - Refactor cancellation logic - cancellations now originate from session queue (i.e. HTTP cancel endpoint) and are emitted as events. Processor gets the events and sets the canceled event. Access to this event is provided to the invocation context for e.g. the step callback. - Remove `sessions` router; it provided access to `graph_executions` but that no longer exists.	2024-03-01 10:42:33 +11:00
psychedelicious	539570cc7a	feat(nodes): update invocation context for mm2, update nodes model usage	2024-03-01 10:42:33 +11:00
Lincoln Stein	3e330d7d9d	fix a number of typechecking errors	2024-03-01 10:42:33 +11:00
psychedelicious	8637c40661	feat(nodes): update all invocations to use new invocation context Update all invocations to use the new context. The changes are all fairly simple, but there are a lot of them. Supporting minor changes: - Patch bump for all nodes that use the context - Update invocation processor to provide new context - Minor change to `EventServiceBase` to accept a node's ID instead of the dict version of a node - Minor change to `ModelManagerService` to support the new wrapped context - Fanagling of imports to avoid circular dependencies	2024-03-01 10:42:33 +11:00
psychedelicious	3d98446d5d	feat(nodes): restricts invocation context power Creates a low-power `InvocationContext` with simplified methods and data. See `invocation_context.py` for detailed comments.	2024-03-01 10:42:33 +11:00
psychedelicious	4602efd598	feat: add profiler util (#5601 ) * feat(config): add profiling config settings - `profile_graphs` enables graph profiling with cProfile - `profiles_dir` sets the output for profiles * feat(nodes): add Profiler util Simple wrapper around cProfile. * feat(nodes): use Profiler in invocation processor * scripts: add generate_profile_graphs.sh script Helper to generate graphs for profiles. * pkg: add snakeviz and gprof2dot to dev deps These are useful for profiling. * tests: add tests for profiler util * fix(profiler): handle previous profile not stopped cleanly * feat(profiler): add profile_prefix config setting The prefix is used when writing profile output files. Useful to organise profiles into sessions. * tidy(profiler): add `_` to private API * feat(profiler): simplify API * feat(profiler): use child logger for profiler logs * chore(profiler): update docstrings * feat(profiler): stop() returns output path * chore(profiler): fix docstring * tests(profiler): update tests * chore: ruff	2024-01-31 10:51:57 +00:00
Brandon	32ad742f3e	Ti trigger from prompt util (#5294 ) * Pull logic for extracting TI triggers into a util function * Remove duplicate regex for ti triggers * Fix linting for ruff * Remove unused imports	2023-12-22 03:04:44 +00:00
psychedelicious	513fceac82	chore: ruff check - fix pycodestyle	2023-11-11 10:55:33 +11:00
psychedelicious	99a8ebe3a0	chore: ruff check - fix flake8-bugbear	2023-11-11 10:55:28 +11:00
psychedelicious	c238a7f18b	feat(api): chore: pydantic & fastapi upgrade Upgrade pydantic and fastapi to latest. - pydantic~=2.4.2 - fastapi~=103.2 - fastapi-events~=0.9.1 Big Changes There are a number of logic changes needed to support pydantic v2. Most changes are very simple, like using the new methods to serialized and deserialize models, but there are a few more complex changes. Invocations The biggest change relates to invocation creation, instantiation and validation. Because pydantic v2 moves all validation logic into the rust pydantic-core, we may no longer directly stick our fingers into the validation pie. Previously, we (ab)used models and fields to allow invocation fields to be optional at instantiation, but required when `invoke()` is called. We directly manipulated the fields and invocation models when calling `invoke()`. With pydantic v2, this is much more involved. Changes to the python wrapper do not propagate down to the rust validation logic - you have to rebuild the model. This causes problem with concurrent access to the invocation classes and is not a free operation. This logic has been totally refactored and we do not need to change the model any more. The details are in `baseinvocation.py`, in the `InputField` function and `BaseInvocation.invoke_internal()` method. In the end, this implementation is cleaner. Invocation Fields In pydantic v2, you can no longer directly add or remove fields from a model. Previously, we did this to add the `type` field to invocations. Invocation Decorators With pydantic v2, we instead use the imperative `create_model()` API to create a new model with the additional field. This is done in `baseinvocation.py` in the `invocation()` wrapper. A similar technique is used for `invocation_output()`. Minor Changes There are a number of minor changes around the pydantic v2 models API. Protected `model_` Namespace All models' pydantic-provided methods and attributes are prefixed with `model_` and this is considered a protected namespace. This causes some conflict, because "model" means something to us, and we have a ton of pydantic models with attributes starting with "model_". Forunately, there are no direct conflicts. However, in any pydantic model where we define an attribute or method that starts with "model_", we must tell set the protected namespaces to an empty tuple. ```py class IPAdapterModelField(BaseModel): model_name: str = Field(description="Name of the IP-Adapter model") base_model: BaseModelType = Field(description="Base model") model_config = ConfigDict(protected_namespaces=()) ``` Model Serialization Pydantic models no longer have `Model.dict()` or `Model.json()`. Instead, we use `Model.model_dump()` or `Model.model_dump_json()`. Model Deserialization Pydantic models no longer have `Model.parse_obj()` or `Model.parse_raw()`, and there are no `parse_raw_as()` or `parse_obj_as()` functions. Instead, you need to create a `TypeAdapter` object to parse python objects or JSON into a model. ```py adapter_graph = TypeAdapter(Graph) deserialized_graph_from_json = adapter_graph.validate_json(graph_json) deserialized_graph_from_dict = adapter_graph.validate_python(graph_dict) ``` Field Customisation Pydantic `Field`s no longer accept arbitrary args. Now, you must put all additional arbitrary args in a `json_schema_extra` arg on the field. Schema Customisation FastAPI and pydantic schema generation now follows the OpenAPI version 3.1 spec. This necessitates two changes: - Our schema customization logic has been revised - Schema parsing to build node templates has been revised The specific aren't important, but this does present additional surface area for bugs. Performance Improvements Pydantic v2 is a full rewrite with a rust backend. This offers a substantial performance improvement (pydantic claims 5x to 50x depending on the task). We'll notice this the most during serialization and deserialization of sessions/graphs, which happens very very often - a couple times per node. I haven't done any benchmarks, but anecdotally, graph execution is much faster. Also, very larges graphs - like with massive iterators - are much, much faster.	2023-10-17 14:59:25 +11:00
psychedelicious	402cf9b0ee	feat: refactor services folder/module structure Refactor services folder/module structure. Motivation While working on our services I've repeatedly encountered circular imports and a general lack of clarity regarding where to put things. The structure introduced goes a long way towards resolving those issues, setting us up for a clean structure going forward. Services Services are now in their own folder with a few files: - `services/{service_name}/__init__.py`: init as needed, mostly empty now - `services/{service_name}/{service_name}_base.py`: the base class for the service - `services/{service_name}/{service_name}_{impl_type}.py`: the default concrete implementation of the service - typically one of `sqlite`, `default`, or `memory` - `services/{service_name}/{service_name}_common.py`: any common items - models, exceptions, utilities, etc Though it's a bit verbose to have the service name both as the folder name and the prefix for files, I found it is _extremely_ confusing to have all of the base classes just be named `base.py`. So, at the cost of some verbosity when importing things, I've included the service name in the filename. There are some minor logic changes. For example, in `InvocationProcessor`, instead of assigning the model manager service to a variable to be used later in the file, the service is used directly via the `Invoker`. Shared Things that are used across disparate services are in `services/shared/`: - `default_graphs.py`: previously in `services/` - `graphs.py`: previously in `services/` - `paginatation`: generic pagination models used in a few services - `sqlite`: the `SqliteDatabase` class, other sqlite-specific things	2023-10-12 12:15:06 -04:00

1 2

77 Commits