InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-04-23 03:00:31 -04:00

Author	SHA1	Message	Date
mauwii	39715017f9	update pyproject.toml	2023-02-26 21:26:44 +01:00
mauwii	35518542f8	add .vscode files	2023-02-26 21:25:45 +01:00
mauwii	0aa1106c96	update .editorconfig	2023-02-26 21:25:45 +01:00
Jordan	9cf7e5f634	Merge branch 'main' into add_lora_support	2023-02-25 19:21:31 -08:00
Jordan	d9c46277ea	add peft setup (need to install huggingface/peft)	2023-02-25 20:21:20 -07:00
blessedcoolant	33f832e6ab	[ui]: 2.3 hotfixes (#2806 ) - Updated Spanish translation - Updated Portuguese (Brazil) translation - Fix a number of translation issues and add missing strings - Fix vertical symmetry and symmetry steps issue when generation steps is adjusted	2023-02-26 12:30:59 +13:00
blessedcoolant	c22d529528	Add node-based invocation system (#1650 ) This PR adds the core of the node-based invocation system first discussed in https://github.com/invoke-ai/InvokeAI/discussions/597 and implements it through a basic CLI and API. This supersedes #1047, which was too far behind to rebase. ## Architecture ### Invocations The core of the new system is invocations, found in `/ldm/invoke/app/invocations`. These represent individual nodes of execution, each with inputs and outputs. Core invocations are already implemented (`txt2img`, `img2img`, `upscale`, `face_restore`) as well as a debug invocation (`show_image`). To implement a new invocation, all that is required is to add a new implementation in this folder (there is a markdown document describing the specifics, though it is slightly out-of-date). ### Sessions Invocations and links between them are maintained in a session. These can be queued for invocation (either the next ready node, or all nodes). Some notes: * Sessions may be added to at any time (including after invocation), but may not be modified. * Links are always added with a node, and are always links from existing nodes to the new node. These links can be relative "history" links, e.g. `-1` to link from a previously executed node, and can link either specific outputs, or can opportunistically link all matching outputs by name and type by using ``. There are no iteration/looping constructs. Most needs for this could be solved by either duplicating nodes or cloning sessions. This is open for discussion, but is a difficult problem to solve in a way that doesn't make the code even more complex/confusing (especially regarding node ids and history). ### Services These make up the core the invocation system, found in `/ldm/invoke/app/services`. One of the key design philosophies here is that most components should be replaceable when possible. For example, if someone wants to use cloud storage for their images, they should be able to replace the image storage service easily. The services are broken down as follows (several of these are intentionally implemented with an initial simple/naïve approach): * Invoker: Responsible for creating and executing sessions and managing services used to do so. * Session Manager: Manages session history. An on-disk implementation is provided, which stores sessions as json files on disk, and caches recently used sessions for quick access. * Image Storage: Stores images of multiple types. An on-disk implementation is provided, which stores images on disk and retains recently used images in an in-memory cache. * Invocation Queue: Used to queue invocations for execution. An in-memory implementation is provided. * Events: An event system, primarily used with socket.io to support future web UI integration. ## Apps Apps are available through the `/scripts/invoke-new.py` script (to-be integrated/renamed). ### CLI ``` python scripts/invoke-new.py ``` Implements a simple CLI. The CLI creates a single session, and automatically links all inputs to the previous node's output. Commands are automatically generated from all invocations, with command options being automatically generated from invocation inputs. Help is also available for the cli and for each command, and is very verbose. Additionally, the CLI supports command piping for single-line entry of multiple commands. Example: ``` > txt2img --prompt "a cat eating sushi" --steps 20 --seed 1234 \| upscale \| show_image ``` ### API ``` python scripts/invoke-new.py --api --host 0.0.0.0 ``` Implements an API using FastAPI with Socket.io support for signaling. API documentation is available at `http://localhost:9090/docs` or `http://localhost:9090/redoc`. This includes OpenAPI schema for all available invocations, session interaction APIs, and image APIs. Socket.io signals are per-session, and can be subscribed to by session id. These aren't currently auto-documented, though the code for event emission is centralized in `/ldm/invoke/app/services/events.py`. A very simple test html and script are available at `http://localhost:9090/static/test.html` This demonstrates creating a session from a graph, invoking it, and receiving signals from Socket.io. ## What's left? * There are a number of features not currently covered by invocations. I kept the set of invocations small during core development in order to simplify refactoring as I went. Now that the invocation code has stabilized, I'd love some help filling those out! * There's no image metadata generated. It would be fairly straightforward (and would make good sense) to serialize either a session and node reference into an image, or the entire node into the image. There are a lot of questions to answer around source images, linked images, etc. though. This history is all stored in the session as well, and with complex sessions, the metadata in an image may lose its value. This needs some further discussion. * We need a list of features (both current and future) that would be difficult to implement without looping constructs so we can have a good conversation around it. I'm really hoping we can avoid needing looping/iteration in the graph execution, since it'll necessitate separating an execution of a graph into its own concept/system, and will further complicate the system. * The API likely needs further filling out to support the UI. I think using the new API for the current UI is possible, and potentially interesting, since it could work like the new/demo CLI in a "single operation at a time" workflow. I don't know how compatible that will be with our UI goals though. It would be nice to support only a single API though. * Deeper separation of systems. I intentionally tried to not touch Generate or other systems too much, but a lot could be gained by breaking those apart. Even breaking apart Args into two pieces (command line arguments and the parser for the current CLI) would make it easier to maintain. This is probably in the future though.	2023-02-26 12:25:41 +13:00
Kyle Schouviller	cd98d88fe7	[nodes] Removed InvokerServices, simplying service model	2023-02-24 20:11:28 -08:00
psychedelicious	281c788489	chore(ui): build frontend	2023-02-25 14:26:50 +11:00
psychedelicious	3858bef185	fix(ui): clamp symmetry steps to generation steps Also renamed the variables to `horizontalSymmetrySteps` as `TimePercentage` is not accurate.	2023-02-25 14:26:46 +11:00
Kyle Schouviller	34e3aa1f88	parent `9eed1919c2` author Kyle Schouviller <kyle0654@hotmail.com> 1669872800 -0800 committer Kyle Schouviller <kyle0654@hotmail.com> 1676240900 -0800 Adding base node architecture Fix type annotation errors Runs and generates, but breaks in saving session Fix default model value setting. Fix deprecation warning. Fixed node api Adding markdown docs Simplifying Generate construction in apps [nodes] A few minor changes (#2510) * Pin api-related requirements * Remove confusing extra CORS origins list * Adds response models for HTTP 200 [nodes] Adding graph_execution_state to soon replace session. Adding tests with pytest. Minor typing fixes [nodes] Fix some small output query hookups [node] Fixing some additional typing issues [nodes] Move and expand graph code. Add base item storage and sqlite implementation. Update startup to match new code [nodes] Add callbacks to item storage [nodes] Adding an InvocationContext object to use for invocations to provide easier extensibility [nodes] New execution model that handles iteration [nodes] Fixing the CLI [nodes] Adding a note to the CLI [nodes] Split processing thread into separate service [node] Add error message on node processing failure Removing old files and duplicated packages Adding python-multipart	2023-02-24 18:57:02 -08:00
psychedelicious	f9a1afd09c	fix(ui): fix #2802 vertical symmetry not working	2023-02-25 11:28:17 +11:00
psychedelicious	251e9c0294	fix(ui): add missing strings Fixes #2797 Fixes #2798	2023-02-25 11:27:47 +11:00
psychedelicious	d8bf2e3c10	fix(ui): fix translation typing, fix strings I had inadvertently un-safe-d our translation types when migrating to Weblate. This PR fixes that, and a number of translation string bugs that went unnoticed due to the lack of type safety,	2023-02-25 11:26:35 +11:00
Gabriel Mackievicz Telles	218f30b7d0	translationBot(ui): update translation (Portuguese (Brazil)) Currently translated at 91.8% (431 of 469 strings) Co-authored-by: Gabriel Mackievicz Telles <telles.gabriel@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/pt_BR/ Translation: InvokeAI/Web UI	2023-02-25 11:13:23 +11:00
Jeff Mahoney	da983c7773	translationBot(ui): added translation (Romanian) Co-authored-by: Jeff Mahoney <jbmahoney@gmail.com>	2023-02-25 11:13:23 +11:00
gallegonovato	7012e16c43	translationBot(ui): update translation (Spanish) Currently translated at 100.0% (469 of 469 strings) Co-authored-by: gallegonovato <fran-carro@hotmail.es> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/es/ Translation: InvokeAI/Web UI	2023-02-25 11:13:23 +11:00
psychedelicious	49ffb64ef3	ui: translations update from weblate (#2804 ) Translations update from [Hosted Weblate](https://hosted.weblate.org) for [InvokeAI/Web UI](https://hosted.weblate.org/projects/invokeai/web-ui/). Current translation status: ![Weblate translation status](https://hosted.weblate.org/widgets/invokeai/-/web-ui/horizontal-auto.svg)	2023-02-25 10:09:37 +11:00
Lincoln Stein	b1050abf7f	hotfix for broken merge function (#2801 ) Bump version up to accommodate a hotfix on v2.3.1 release. (model merge functionality was broken)	2023-02-24 15:33:54 -05:00
Lincoln Stein	210998081a	use right pep-440 standard version number v2.3.1.post1	2023-02-24 15:14:39 -05:00
Lincoln Stein	604acb9d91	use pep-440 standard version number	2023-02-24 15:07:54 -05:00
Jordan	ef822902d4	Merge branch 'main' into add_lora_support	2023-02-24 12:06:31 -08:00
Lincoln Stein	5beeb1a897	hotfix for broken merge function v2.3.1p1	2023-02-24 15:00:22 -05:00
Lincoln Stein	de6304b729	fixes crashes on merge in both WebUI and console (#2800 ) - an inadvertent change to the model manager broke the merging functions - corrected here - will be a hotfix	2023-02-24 14:58:06 -05:00
Lincoln Stein	d0be79c33d	fixes crashes on merge in both WebUI and console - an inadvertent change to the model manager broke the merging functions - corrected here - will be a hotfix	2023-02-24 14:54:23 -05:00
Gabriel Mackievicz Telles	ec14e2db35	translationBot(ui): update translation (Portuguese (Brazil)) Currently translated at 91.8% (431 of 469 strings) Co-authored-by: Gabriel Mackievicz Telles <telles.gabriel@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/pt_BR/ Translation: InvokeAI/Web UI	2023-02-24 17:54:54 +01:00
Jeff Mahoney	5725fcb3e0	translationBot(ui): added translation (Romanian) Co-authored-by: Jeff Mahoney <jbmahoney@gmail.com>	2023-02-24 17:54:54 +01:00
gallegonovato	1447b6df96	translationBot(ui): update translation (Spanish) Currently translated at 100.0% (469 of 469 strings) Co-authored-by: gallegonovato <fran-carro@hotmail.es> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/es/ Translation: InvokeAI/Web UI	2023-02-24 17:54:54 +01:00
Lincoln Stein	e700da23d8	Sync main with v2.3.1 (#2792 ) This PR will bring `main` up to date with released v2.3.1	2023-02-24 11:54:46 -05:00
Lincoln Stein	b4ed8bc47a	Merge branch 'main' into v2.3	2023-02-24 10:52:03 -05:00
Lincoln Stein	bd85e00530	Last PR needed for v2.3.1 (#2788 ) - Add curated set of starter models based on team discussion. The final list of starter models can be found in `invokeai/configs/INITIAL_MODELS.yaml` - To test model installation, I selected and installed all the models on the list. This led to my discovering that when there are no more starter models to display, the console front end crashes. So I made a fix to this in which the entire starter model selection is no longer shown. - Update model table in 050_INSTALL_MODELS.md - Add guide to dealing with low-memory situations - Version is now `v2.3.1` v2.3.1	2023-02-24 10:31:38 -05:00
Lincoln Stein	4e446130d8	Merge branch 'v2.3' into enhance/curated-2.3.1-models	2023-02-24 10:30:42 -05:00
Lincoln Stein	4c93b514bb	bump version to final 2.3.1	2023-02-24 10:04:41 -05:00
Lincoln Stein	d078941316	add low memory troubleshooting guide	2023-02-24 10:04:06 -05:00
Lincoln Stein	230d3a496d	document starter models - add new script `scripts/make_models_markdown_table.py` that parses INITIAL_MODELS.yaml and creates markdown table for the model installation documentation file - update 050_INSTALLING_MODELS.md with above table, and add a warning about additional license terms that apply to some of the models.	2023-02-24 09:33:07 -05:00
Jonathan	ec2890c19b	Run garbage collection to allow the CUDA cache to completely empty. (#2791 )	2023-02-24 08:48:54 -05:00
Jordan	036ca31282	Merge pull request #4 from damian0815/pr/2712 tweaks and small refactors	2023-02-24 03:49:41 -08:00
Damian Stewart	7dbe027b18	tweaks and small refactors	2023-02-24 12:46:57 +01:00
Jordan	523e44ccfe	simplify manager	2023-02-24 01:32:09 -07:00
Lincoln Stein	a540cc537f	add curated set of HuggingFace diffusers models for 2.3.1 release - Final list can be found in invokeai/configs/INITIAL_MODELS.yaml - After installing all the models, I discovered a bug in the file selection form that caused a crash when no remaining uninstalled models remained. So had to fix this.	2023-02-24 00:53:48 -05:00
Lincoln Stein	39c57aa358	fix generate backend to generate "accurate" intermediate images (#2787 ) The sample_to_image method in `ldm.invoke.generator.base` was still using ckpt-era code. As a result when the WebUI was set to show "accurate" intermediate images, there'd be a crash. This PR corrects the problem. - Closes #2784 - Closes #2775	2023-02-24 00:33:29 -05:00
Lincoln Stein	2d990c1f54	Merge branch 'v2.3' into bugfix/webui-accurate-intermediates	2023-02-23 22:07:18 -05:00
Lincoln Stein	7fb2da8741	fix generate backend to generate "accurate" intermediate images - Closes #2784 - Closes #2775	2023-02-23 22:03:28 -05:00
Lincoln Stein	c69fcb1c10	fix ckpt_convert module to work with dreambooth v2 models (#2776 ) - Discord member @marcus.llewellyn reported that some civitai 2.1-derived checkpoints were not converting properly (probably dreambooth-generated): https://discord.com/channels/1020123559063990373/1078386197589655582/1078387806122025070 - @blessedcoolant tracked this down to a missing key that was used to derive vector length of the CLIP model used by fetching the second dimension of the tensor at "cond_stage_model.model.text_projection". - On inspection, I found that the same second dimension can be recovered from key 'cond_stage_model.model.ln_final.bias', and use that instead. I hope this is correct; tested on multiple v1, v2 and inpainting models and they converted correctly. - While debugging this, I found and fixed several other issues: - model download script was not pre-downloading the OpenCLIP text_encoder or text_tokenizer. This is fixed. - got rid of legacy code in `ckpt_to_diffuser.py` and replaced with calls into `model_manager` - more consistent status reporting in the CLI.	2023-02-23 21:51:57 -05:00
Jordan	6a7948466e	Merge branch 'main' into add_lora_support	2023-02-23 18:33:52 -08:00
Lincoln Stein	0982548e1f	Merge branch 'v2.3' into bugfix/v2-model-conversion	2023-02-23 21:27:49 -05:00
Jordan	4ce8b1ba21	setup cross conditioning for lora	2023-02-23 19:27:45 -07:00
Jordan	68a3132d81	move legacy lora manager to its own file	2023-02-23 17:41:20 -07:00
Jordan	b69f9d4af1	initial setup of cross attention	2023-02-23 17:30:34 -07:00
Matthias Wild	11a29fdc4d	fix python 3.9 compatibility (#2780 ) without this change, the project can be installed on 3.9 but not used this also fixes the container images Maybe we should re-enable Python 3.9 checks which would have prevented this.	2023-02-24 00:49:25 +01:00

... 3 4 5 6 7 ...

3590 Commits