InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-01-24 02:27:55 -05:00

Author	SHA1	Message	Date
Lincoln Stein	3dffa33097	Merge branch 'v2.3' into feat/lora-support-2.3	2023-04-05 21:59:54 -04:00
Lincoln Stein	d4d3441a52	save name of last model to disk whenever model changes - this allows invokeai to restore the last used model on startup, even after a crash or keyboard interrupt.	2023-04-02 15:46:39 -04:00
Lincoln Stein	e0bd30b98c	more elegant handling of lora context	2023-04-01 23:41:22 -04:00
Lincoln Stein	b632b35079	remove direct legacy checkpoint rendering capabilities	2023-04-01 17:08:30 -04:00
Lincoln Stein	c9372f919c	moved LoRA manager cleanup routines into a context	2023-04-01 16:49:23 -04:00
Lincoln Stein	879c80022e	preliminary LoRA support ready for testing Instructions: 1. Download LoRA .safetensors files of your choice and place in `INVOKEAIROOT/loras`. Unlike the draft version of this, the file names can contain underscores and alphanumerics. Names with arbitrary unicode characters are not supported. 2. Add `withLora(lora-file-basename,weight)` to your prompt. The weight is optional and will default to 1.0. A few examples, assuming that a LoRA file named `loras/sushi.safetensors` is present: ``` family sitting at dinner table eating sushi withLora(sushi,0.9) family sitting at dinner table eating sushi withLora(sushi, 0.75) family sitting at dinner table eating sushi withLora(sushi) ``` Multiple `withLora()` prompt fragments are allowed. The weight can be arbitrarily large, but the useful range is roughly 0.5 to 1.0. Higher weights make the LoRA's influence stronger. In my limited testing, I found it useful to reduce the CFG to avoid oversharpening. Also I got better results when running the LoRA on top of the model on which it was based during training. Don't try to load a SD 1.x-trained LoRA into a SD 2.x model, and vice versa. You will get a nasty stack trace. This needs to be cleaned up. 3. You can change the location of the `loras` directory by passing the `--lora_directory` option to `invokeai. Documentation can be found in docs/features/LORAS.md.	2023-03-31 00:03:16 -04:00
Lincoln Stein	ea5f6b9826	Merge branch 'release/2.3.3-rc3' into feat/lora-support-2.3	2023-03-30 22:02:37 -04:00
Lincoln Stein	249173faf5	remove extraneous warnings about overwriting trigger terms	2023-03-30 20:37:10 -04:00
Lincoln Stein	5a8d66ab02	merge lora support	2023-03-28 23:54:17 -04:00
Lincoln Stein	45aa770cd1	implemented multiprocessing across multiple GPUs	2023-03-05 01:52:28 -05:00
Jordan	9cf7e5f634	Merge branch 'main' into add_lora_support	2023-02-25 19:21:31 -08:00
Jordan	d9c46277ea	add peft setup (need to install huggingface/peft)	2023-02-25 20:21:20 -07:00
Kyle Schouviller	34e3aa1f88	parent `9eed1919c2` author Kyle Schouviller <kyle0654@hotmail.com> 1669872800 -0800 committer Kyle Schouviller <kyle0654@hotmail.com> 1676240900 -0800 Adding base node architecture Fix type annotation errors Runs and generates, but breaks in saving session Fix default model value setting. Fix deprecation warning. Fixed node api Adding markdown docs Simplifying Generate construction in apps [nodes] A few minor changes (#2510) * Pin api-related requirements * Remove confusing extra CORS origins list * Adds response models for HTTP 200 [nodes] Adding graph_execution_state to soon replace session. Adding tests with pytest. Minor typing fixes [nodes] Fix some small output query hookups [node] Fixing some additional typing issues [nodes] Move and expand graph code. Add base item storage and sqlite implementation. Update startup to match new code [nodes] Add callbacks to item storage [nodes] Adding an InvocationContext object to use for invocations to provide easier extensibility [nodes] New execution model that handles iteration [nodes] Fixing the CLI [nodes] Adding a note to the CLI [nodes] Split processing thread into separate service [node] Add error message on node processing failure Removing old files and duplicated packages Adding python-multipart	2023-02-24 18:57:02 -08:00
Jordan	ef822902d4	Merge branch 'main' into add_lora_support	2023-02-24 12:06:31 -08:00
Jonathan	ec2890c19b	Run garbage collection to allow the CUDA cache to completely empty. (#2791 )	2023-02-24 08:48:54 -05:00
Jordan	68a3132d81	move legacy lora manager to its own file	2023-02-23 17:41:20 -07:00
Jordan	6a1129ab64	switch all none diffusers stuff to legacy, and load through compel prompts	2023-02-23 16:48:33 -07:00
Jordan	d4083221a6	Merge branch 'main' into add_lora_support	2023-02-22 13:28:04 -08:00
Lincoln Stein	16aea1e869	Merge branch 'main' into install/refactor-configure-and-model-select	2023-02-22 14:22:52 -05:00
Jordan	5b4a241f5c	Merge branch 'main' into add_lora_support	2023-02-21 20:38:33 -08:00
Jordan	686f6ef8d6	Merge branch 'main' into add_lora_support	2023-02-21 18:35:11 -08:00
Lincoln Stein	4878c7a2d5	Merge branch 'main' into install/refactor-configure-and-model-select	2023-02-21 14:09:38 -05:00
Lincoln Stein	fff41a7349	merged with main	2023-02-21 12:20:59 -05:00
blessedcoolant	d5f524a156	Merge branch 'main' into bugfix/filename-embedding-fallback	2023-02-22 06:13:41 +13:00
Jonathan	3ab9d02883	Fixed embiggening crash due to clear_cuda_cache not being passed on and bad cuda stats initialization. (#2756 )	2023-02-22 06:12:24 +13:00
Lincoln Stein	9436f2e3d1	alphabetize trigger strings	2023-02-21 06:23:34 -05:00
Jordan	e2b6dfeeb9	Update generate.py	2023-02-20 21:33:20 -07:00
neecapp	3732af63e8	fix prompt	2023-02-20 23:06:05 -05:00
Jordan	6e730bd654	Merge branch 'main' into add_lora_support	2023-02-20 15:34:52 -08:00
Jordan	8f6e43d4a4	code cleanup	2023-02-20 14:06:58 -07:00
Lincoln Stein	1d9845557f	reduced verbosity of embed loading messages	2023-02-20 15:18:55 -05:00
Lincoln Stein	58be915446	Merge branch 'main' into install/refactor-configure-and-model-select	2023-02-20 14:48:41 -05:00
Jonathan	ca8d9fb885	Add symmetry to generation (#2675 ) Added symmetry to Invoke based on discussions with @damian0815. This can currently only be activated via the CLI with the `--h_symmetry_time_pct` and `--v_symmetry_time_pct` options. Those take values from 0.0-1.0, exclusive, indicating the percentage through generation at which symmetry is applied as a one-time operation. To have symmetry in either axis applied after the first step, use a very low value like 0.001.	2023-02-20 07:33:19 -05:00
Jordan	096e1d3a5d	start of rewrite for add / remove	2023-02-20 02:37:44 -07:00
Lincoln Stein	7beebc3659	resolved conflicts; ran black and isort	2023-02-19 19:48:01 -05:00
Lincoln Stein	5461318eda	clean up diagnostic messages	2023-02-19 19:38:29 -05:00
Kevin Turner	671c5943e4	Merge remote-tracking branch 'origin/main' into api/add-trigger-string-retrieval # Conflicts: # ldm/generate.py	2023-02-18 17:44:59 -08:00
Jordan	141be95c2c	initial setup of lora support	2023-02-18 05:29:04 -07:00
Kevin Turner	b8212e4dea	fix(diffusers_pipeline): ensure `cuda.get_mem_info` always gets a specific device index. Also tighten up the typing of `device` attributes in general.	2023-02-17 16:56:15 -08:00
Lincoln Stein	65a7432b5a	disable xformers if cuda not available	2023-02-16 22:20:30 -05:00
Lincoln Stein	2fa14200aa	Merge branch 'main' into api/add-trigger-string-retrieval	2023-02-16 22:12:39 -05:00
Kevin Turner	8a0d45ac5a	new OffloadingDevice loads one model at a time, on demand (#2596 ) * new OffloadingDevice loads one model at a time, on demand * fixup! new OffloadingDevice loads one model at a time, on demand * fix(prompt_to_embeddings): call the text encoder directly instead of its forward method allowing any associated hooks to run with it. * more attempts to get things on the right device from the offloader * more attempts to get things on the right device from the offloader * make offloading methods an explicit part of the pipeline interface * inlining some calls where device is only used once * ensure model group is ready after pipeline.to is called * fixup! Strategize slicing based on free [V]RAM (#2572) * doc(offloading): docstrings for offloading.ModelGroup * doc(offloading): docstrings for offloading-related pipeline methods * refactor(offloading): s/SimpleModelGroup/FullyLoadedModelGroup * refactor(offloading): s/HotSeatModelGroup/LazilyLoadedModelGroup to frame it is the same terms as "FullyLoadedModelGroup" --------- Co-authored-by: Damian Stewart <null@damianstewart.com>	2023-02-16 23:48:27 +00:00
Lincoln Stein	bc18a94d8c	add ability to retrieve current list of embedding trigger strings This PR adds a new attributer to ldm.generate, `embedding_trigger_strings`: ``` gen = Generate(...) strings = gen.embedding_trigger_strings strings = gen.embedding_trigger_strings() ``` The trigger strings will change when the model is updated to show only those strings which are compatible with the current model. Dynamically-downloaded triggers from the HF Concepts Library will only show up after they are used for the first time. However, the full list of concepts available for download can be retrieved programatically like this: ``` from ldm.invoke.concepts_lib import HuggingFAceConceptsLibrary concepts = HuggingFaceConceptsLibrary() trigger_strings = concepts.list_concepts() ```	2023-02-13 14:11:36 -05:00
Jonathan	9eed1919c2	Strategize slicing based on free [V]RAM (#2572 ) Strategize slicing based on free [V]RAM when not using xformers. Free [V]RAM is evaluated at every generation. When there's enough memory, the entire generation occurs without slicing. If there is not enough free memory, we use diffusers' sliced attention.	2023-02-12 18:24:15 +00:00
tyler	d3c850104b	pulling esrgan denoise strength through to the generate API.	2023-02-12 02:47:37 +13:00
tyler	c00155f6a4	pulling esrgan denoise strength through to the generate API.	2023-02-12 02:47:37 +13:00
Kent Keirsey	9826f80d7f	Initial Slider & Img2Img=1 Updates	2023-02-09 07:02:39 +11:00
Jonathan	28b40bebbe	Refactor CUDA cache clearing to add statistical reporting. (#2553 )	2023-02-06 12:53:30 -05:00
Jonathan	2432adb38f	In exception handlers, clear the torch CUDA cache (if we're using CUDA) to free up memory for other programs using the GPU and to reduce fragmentation. (#2549 )	2023-02-06 10:33:24 -05:00
blessedcoolant	bf4344be51	Beautify Usage Stats Log	2023-02-05 22:55:40 +13:00

1 2 3 4 5

241 Commits