InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-01-23 00:18:05 -05:00

Author	SHA1	Message	Date
Riccardo Giovanetti	42e052d6f2	translationBot(ui): update translation (Italian) Currently translated at 98.8% (1777 of 1798 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2025-03-03 14:50:08 +11:00
psychedelicious	b03e429b26	fix(ui): add missing builder translations	2025-03-03 14:43:23 +11:00
psychedelicious	7399909029	feat(app): use simpler syntax for enqueue_batch threaded execution	2025-03-03 14:40:48 +11:00
psychedelicious	c8aaf5e76b	tidy(app): remove extraneous class attr type annotations	2025-03-03 14:40:48 +11:00
psychedelicious	0cdf7a7048	Revert "experiment(app): simulate very long enqueue operations (15s)" This reverts commit eb6a323d0b70004732de493d6530e08eb5ca8acf.	2025-03-03 14:40:48 +11:00
psychedelicious	41985487d3	Revert "experiment(app): make socketio server ping every 1s" This reverts commit ddf00bf260167092a3bc2afdce1244c6b116ebfb.	2025-03-03 14:40:48 +11:00
psychedelicious	41d5a17114	fix(ui): set RTKQ tag invalidationBehaviour to immediate This allows tags to be invalidated while mutations are executing, resolving an issue in this situation: - A long-running mutation starts. - A tag is invalidated; for example, user edits a board name, and the boards list query tag is invalidated. - The boards list query isn't fired, and the board name isn't updated. - The long-running mutation finishes. - Finally, the boards list query fires and the board name is updated. This is the "delayed" behaviour. The "immediately" behaviour has the fires requests from tag invalidation immediately, without waiting for all mutations to finish. It may cause extra network requests and stale data if we are mutating a lot of things very quickly. I don't think it will be an issue in practice and the improved responsiveness will be a net benefit.	2025-03-03 14:40:48 +11:00
psychedelicious	14f9d5b6bc	experiment(app): remove db locking logic Rely on WAL mode and the busy timeout. Also changed: - Remove extraneous rollbacks when we were only doing a `SELECT` - Remove try/catch blocks that were made extraneous when removing the extraneous rollbacks	2025-03-03 14:40:48 +11:00
psychedelicious	eec4bdb038	experiment(app): enable WAL mode and set busy_timeout This allows for read and write concurrency without using a global mutex. Operations may still fail they take longer than the busy timeout (5s). If we get a database lock error after waiting 5s for an operation, we have a problem. So, I think it's actually better to use a busy timeout instead of a global mutex. Alternatively, we could add a timeout to the global mutex.	2025-03-03 14:40:48 +11:00
psychedelicious	f3dd44044a	experiment(app): run enqueue_batch async in a thread	2025-03-03 14:40:48 +11:00
psychedelicious	61a22eb8cb	experiment(app): make socketio server ping every 1s	2025-03-03 14:40:48 +11:00
psychedelicious	03ca83fe13	experiment(app): simulate very long enqueue operations (15s)	2025-03-03 14:40:48 +11:00
psychedelicious	8f1e25c387	chore: bump version to v5.7.2rc1 v5.7.2rc1	2025-03-03 09:46:16 +11:00
Kevin Turner	29cf4bc002	feat: accept WebP uploads for assets	2025-03-02 08:50:38 -05:00
psychedelicious	9428642806	fix(ui): single or collection field rendering Fixes an issue where fields like control weight on ControlNet nodes and image on IP Adapter nodes didn't render. These are "single or collection" fields. They accept a single input object, or collection. They are supposed to render the UI input for a single object. In `a7a71ca935` a performance optimisation for a hot code-path inadvertently broke this. The determination of which UI component to render for a given field was done using a type guard function for the field's template. Previously, this used a zod schema to parse the template. This is very slow, especially when the template was not the expected type. The optimization changed the type guards to check the field name (aka its type, integer, image, etc) and cardinality directly, without any zod parsing. It's much faster, but subtly changed the behaviour because it was a bit stricter. For some fields, it rejected "single or collection" cardinalities when it should have accepted them. When these fields - like the aforementioned Control Weight and Image - were being rendered, none of the type guards passed and they rendered nothing. The fix here updates the type guard functions to support multiple cardinalities. So now, when we go to render a "single or collection" field, we will render the "single" input component as it should be.	2025-03-01 10:54:31 +11:00
psychedelicious	8620572524	docs: update RELEASE.md	2025-02-28 18:43:52 -05:00
psychedelicious	f44c7e824d	chore(ui): lint	2025-02-28 18:09:54 -05:00
psychedelicious	c5b8bde285	fix(ui): download button in workflow library downloads wrong workflow	2025-02-28 18:09:54 -05:00
Ryan Dick	4c86a7ecbf	Update Low-VRAM docs guidance around max_cache_vram_gb.	2025-02-28 17:18:57 -05:00
Ryan Dick	b9f9d1c152	Increase the VAE decode memory estimates. to account for memory reserved by the memory allocator, but not allocated, and to generally be more conservative.	2025-02-28 17:18:57 -05:00
Ryan Dick	7567ee2adf	Add `pytorch_cuda_alloc_conf` config to tune VRAM memory allocation (#7673 ) ## Summary This PR adds a `pytorch_cuda_alloc_conf` config flag to control the torch memory allocator behavior. - `pytorch_cuda_alloc_conf` defaults to `None`, preserving the current behavior. - The configuration options are explained here: https://pytorch.org/docs/stable/notes/cuda.html#optimizing-memory-usage-with-pytorch-cuda-alloc-conf. Tuning this configuration can reduce peak reserved VRAM and improve performance. - Setting `pytorch_cuda_alloc_conf: "backend:cudaMallocAsync"` in `invokeai.yaml` is expected to work well on many systems. This is a good first step for those looking to tune this config. (We may make this the default in the future.) - The optimal configuration seems to be dependent on a number of factors such as device version, VRAM, CUDA kernel version, etc. For now, users will have to experiment with this config to see if it hurts or helps on their systems. In most cases, I expect it to help. ### Memory Tests ``` VAE decode memory usage comparison: - SDXL, fp16, 1024x1024: - `cudaMallocAsync`: allocated=2593 MB, reserved=3200 MB - `native`: allocated=2595 MB, reserved=4418 MB - SDXL, fp32, 1024x1024: - `cudaMallocAsync`: allocated=3982 MB, reserved=5536 MB - `native`: allocated=3982 MB, reserved=7276 MB - SDXL, fp32, 1536x1536: - `cudaMallocAsync`: allocated=8643 MB, reserved=12032 MB - `native`: allocated=8643 MB, reserved=15900 MB ``` ## Related Issues / Discussions N/A ## QA Instructions - [x] Performance tests with `pytorch_cuda_alloc_conf` unset. - [x] Performance tests with `pytorch_cuda_alloc_conf: "backend:cudaMallocAsync"`. ## Merge Plan - [x] Merge #7668 first and change target branch to `main` ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-02-28 16:47:01 -05:00
Ryan Dick	0e632dbc5c	(minor) typo	2025-02-28 21:39:09 +00:00
Ryan Dick	49191709a0	Mark test_configure_torch_cuda_allocator_raises_if_torch_is_already_imported() to only run if CUDA is available.	2025-02-28 21:39:09 +00:00
Ryan Dick	3af7fc26fa	Update low-vram docs with info abhout .	2025-02-28 21:39:09 +00:00
Ryan Dick	a36a627f83	Switch from use_cuda_malloc flag to a general pytorch_cuda_alloc_conf config field that allows full customization of the CUDA allocator.	2025-02-28 21:39:09 +00:00
Ryan Dick	b31c71f302	Simplify is_torch_cuda_malloc_enabled() implementation and add unit tests.	2025-02-28 21:39:09 +00:00
Ryan Dick	5302d4890f	Add use_cuda_malloc config option.	2025-02-28 21:39:09 +00:00
Ryan Dick	766b752572	Add utils for configuring the torch CUDA allocator.	2025-02-28 21:39:09 +00:00
Eugene Brodsky	7feae5e5ce	do not cache image layers in CI docker build	2025-02-28 16:24:50 -05:00
Ryan Dick	26730ca702	Tidy app entrypoint (#7668 ) ## Summary Prior to this PR, most of the app setup was being done in `api_app.py` at import time. This PR cleans this up, by: - Splitting app setup into more modular functions - Narrower responsibility for the `api_app.py` file - it just initializes the `FastAPI` app The main motivation for this changes is to make it easier to support an upcoming torch configuration feature that requires more careful ordering of app initialization steps. ## Related Issues / Discussions N/A ## QA Instructions - [x] Launch the app via invokeai-web.py and smoke test it. - [ ] Launch the app via the installer and smoke test it. - [x] Test that generate_openapi_schema.py produces the same result before and after the change. - [x] No regression in unit tests that directly interact with the app. (test_images.py) ## Merge Plan - [x] Check to see if there are any commercial implications to modifying the app entrypoint. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2025-02-28 16:07:30 -05:00
Ryan Dick	1e2c7c51b5	Move load_custom_nodes() to run_app() entrypoint.	2025-02-28 20:54:26 +00:00
Ryan Dick	da2b6815ac	Make InvokeAILogger an inline import in startup_utils.py in response to review comment.	2025-02-28 20:10:24 +00:00
Ryan Dick	68d14de3ee	Split run_app.py and api_app.py so that api_app.py is more narrowly responsible for just initializing the FastAPI app. This also gives clearer control over the order of the initialization steps, which will be important as we add planned torch configurations that must be applied before torch is imported.	2025-02-28 20:10:24 +00:00
Ryan Dick	38991ffc35	Add register_mime_types() startup util.	2025-02-28 20:10:24 +00:00
Ryan Dick	f345c0fabc	Create an apply_monkeypatches() start util.	2025-02-28 20:10:24 +00:00
Ryan Dick	ca23b5337e	Simplify port selection logic to avoid the need for a global port variable.	2025-02-28 20:10:19 +00:00
Ryan Dick	35910d3952	Move check_cudnn() and jurigged setup to startup_utils.py.	2025-02-28 20:08:53 +00:00
Ryan Dick	6f1dcf385b	Move find_port() util to its own file.	2025-02-28 20:08:53 +00:00
psychedelicious	84c9ecc83f	chore: bump version to v5.7.1 v5.7.1	2025-02-28 13:23:30 -05:00
Thomas Bolteau	52aa839b7e	translationBot(ui): update translation (French) Currently translated at 99.1% (1782 of 1797 strings) Co-authored-by: Thomas Bolteau <thomas.bolteau50@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/fr/ Translation: InvokeAI/Web UI	2025-02-28 17:07:11 +11:00
Hiroto N	316ed1d478	translationBot(ui): update translation (Japanese) Currently translated at 42.6% (766 of 1797 strings) Co-authored-by: Hiroto N <hironow365@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ja/ Translation: InvokeAI/Web UI	2025-02-28 17:07:11 +11:00
Hosted Weblate	3519e8ae39	translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. Co-authored-by: Hosted Weblate <hosted@weblate.org> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ Translation: InvokeAI/Web UI	2025-02-28 17:07:11 +11:00
psychedelicious	82f645c7a1	feat(ui): add new workflow button to library menu	2025-02-28 16:06:02 +11:00
psychedelicious	cc36cfb617	feat(ui): reorg workflow menu buttons	2025-02-28 16:06:02 +11:00
psychedelicious	ded8a84284	feat(ui): increase spacing in form builder view mode	2025-02-28 16:06:02 +11:00
psychedelicious	94771ea626	feat(ui): add auto-links to text, heading, field description and workflow descriptions	2025-02-28 16:06:02 +11:00
psychedelicious	51d661023e	Revert "feat(ui): increase spacing in form builder view mode" This reverts commit 3766a3ba1e082f31bce09f794c47eb95cd76f1b1.	2025-02-28 16:06:02 +11:00
psychedelicious	d215829b91	feat(ui): increase spacing in form builder view mode	2025-02-28 16:06:02 +11:00
psychedelicious	fad6c67f01	fix(ui): workflow description cut off	2025-02-28 16:06:02 +11:00
psychedelicious	f366640d46	fix(ui): invoke button not showing loading indicator on canvas tab On the Canvas tab, when we made the network request to enqueue a batch, we were immediately resetting the request. This effectively disabled RTKQ's tracking of the request - including the loading state. As a result, when you click the Invoke button on the Canvas tab, it didn't show a spinner, and it was not clear that anything was happening. The solution is simple - just await the enqueue request before resetting the tracking, same as we already did on the workflows and upscaling tabs. I also added some extra logging messages for enqueuing, so we get the same JS console logs for each tab on success or failure.	2025-02-28 15:58:17 +11:00

1 2 3 4 5 ...

15898 Commits