InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-01-13 12:18:07 -05:00

Author	SHA1	Message	Date
Kent Keirsey	af58a75e97	Support PEFT Loras with Base_Model.model prefix (#8433 ) * Support PEFT Loras with Base_Model.model prefix * update tests * ruff * fix python complaints * update kes * format keys * remove unneeded test	2025-08-18 09:14:46 -04:00
psychedelicious	a8a07598c8	chore: ruff	2025-08-18 21:14:00 +10:00
psychedelicious	23206e22e8	tests: skip excessively flaky MPS-specific tests in CI	2025-08-18 21:14:00 +10:00
Heathen711	8cef0f5bf5	Update supported cuda slot input.	2025-06-16 19:33:19 +10:00
Kevin Turner	50cf285efb	fix: group aitoolkit lora layers	2025-06-16 19:08:11 +10:00
Kevin Turner	a214f4fff5	fix: group aitoolkit lora layers	2025-06-16 19:08:11 +10:00
Kevin Turner	2981591c36	test: add some aitoolkit lora tests	2025-06-16 19:08:11 +10:00
Kevin Turner	52a8ad1c18	chore: rename model.size to model.file_size to disambiguate from RAM size or pixel size	2025-04-10 09:53:03 +10:00
Kevin Turner	98260a8efc	test: add size field to test model configs	2025-04-10 09:53:03 +10:00
psychedelicious	aaa6211625	chore(backend): ruff C420	2025-03-28 18:28:32 -04:00
Billy	182580ff69	Imports	2025-03-26 12:55:10 +11:00
Billy	8e9d5c1187	Ruff formatting	2025-03-26 12:30:31 +11:00
Billy	99aac5870e	Remove star imports	2025-03-26 12:27:00 +11:00
Ryan Dick	f1fde792ee	Get FLUX Redux working: model loading and inference.	2025-03-06 10:31:17 +11:00
Ryan Dick	5357d6e08e	Rename ConcatenatedLoRALayer to MergedLayerPatch. And other minor cleanup.	2025-01-28 14:51:35 +00:00
Ryan Dick	28514ba59a	Update ConcatenatedLoRALayer to work with all sub-layer types.	2025-01-28 14:51:35 +00:00
Ryan Dick	206f261e45	Add utils for loading FLUX OneTrainer DoRA models.	2025-01-28 14:51:35 +00:00
Ryan Dick	dfa253e75b	Add utils for working with Kohya LoRA keys.	2025-01-28 14:51:35 +00:00
Ryan Dick	faa4fa02c0	Expand unit tests to test for confusion between FLUX LoRA formats.	2025-01-28 14:51:35 +00:00
Ryan Dick	5bd6428fdd	Add is_state_dict_likely_in_flux_onetrainer_format() util function.	2025-01-28 14:51:35 +00:00
Ryan Dick	8b4f411f7b	Add a test state dict for the OneTrainer DoRA format.	2025-01-28 14:51:35 +00:00
Ryan Dick	e2f05d0800	Add unit tests for LoKR patch layers. The new tests trigger a bug when LoKR layers are applied to BnB-quantized layers (also impacts several other LoRA variant types).	2025-01-22 09:20:40 +11:00
Ryan Dick	36a3869af0	Add keep_ram_copy_of_weights config option.	2025-01-16 15:35:25 +00:00
Ryan Dick	c76d08d1fd	Add keep_ram_copy option to CachedModelOnlyFullLoad.	2025-01-16 15:08:23 +00:00
Ryan Dick	04087c38ce	Add keep_ram_copy option to CachedModelWithPartialLoad.	2025-01-16 14:51:44 +00:00
Ryan Dick	974b4671b1	Deprecate the `ram` and `vram` configs to make the migration to dynamic memory limits smoother for users who had previously overriden these values.	2025-01-07 16:45:29 +00:00
Ryan Dick	d7ab464176	Offload the current model when locking if it is already partially loaded and we have insufficient VRAM.	2025-01-07 02:53:44 +00:00
Ryan Dick	5eafe1ec7a	Fix ModelCache execution device selection in unit tests.	2025-01-07 01:20:15 +00:00
Ryan Dick	a167632f09	Calculate model cache size limits dynamically based on the available RAM / VRAM.	2025-01-07 01:14:20 +00:00
Ryan Dick	402dd840a1	Add seed to flaky unit test.	2025-01-07 00:31:00 +00:00
Ryan Dick	d0bfa019be	Add 'enable_partial_loading' config flag.	2025-01-07 00:31:00 +00:00
Ryan Dick	535e45cedf	First pass at adding partial loading support to the ModelCache.	2025-01-07 00:30:58 +00:00
Ryan Dick	9a0a226ce1	Fix bitsandbytes imports in unit tests on MacOS.	2024-12-30 10:41:48 -05:00
Ryan Dick	6fd9b0a274	Delete old sidecar wrapper implementation. This functionality has moved into the custom layers.	2024-12-29 17:33:08 +00:00
Ryan Dick	52fc5a64d4	Add a unit test for a LoRA patch applied to a quantized linear layer with weights streamed from CPU to GPU.	2024-12-29 17:14:55 +00:00
Ryan Dick	a8bef59699	First pass at making custom layer patches work with weights streamed from the CPU to the GPU.	2024-12-29 17:01:37 +00:00
Ryan Dick	6d49ee839c	Switch the LayerPatcher to use 'custom modules' to manage layer patching.	2024-12-29 01:18:30 +00:00
Ryan Dick	2855bb6b41	Update BaseLayerPatch.get_parameters(...) to accept a dict of orig_parameters rather than orig_module. This will enable compatibility between patching and cpu->gpu streaming.	2024-12-28 21:12:53 +00:00
Ryan Dick	918f541af8	Add unit test for a SetParameterLayer patch applied to a CustomFluxRMSNorm layer.	2024-12-28 20:44:48 +00:00
Ryan Dick	93e76b61d6	Add CustomFluxRMSNorm layer.	2024-12-28 20:33:38 +00:00
Ryan Dick	f2981979f9	Get custom layer patches working with all quantized linear layer types.	2024-12-27 22:00:22 +00:00
Ryan Dick	ef970a1cdc	Add support for FluxControlLoRALayer in CustomLinear layers and add a unit test for it.	2024-12-27 21:00:47 +00:00
Ryan Dick	5ee7405f97	Add more unit tests for custom module LoRA patching: multiple LoRAs and ConcatenatedLoRALayers.	2024-12-27 19:47:21 +00:00
Ryan Dick	e24e386a27	Add support for patches to CustomModuleMixin and add a single unit test (more to come).	2024-12-27 18:57:13 +00:00
Ryan Dick	b06d61e3c0	Improve custom layer wrap/unwrap logic.	2024-12-27 16:29:48 +00:00
Ryan Dick	7d6ab0ceb2	Add a CustomModuleMixin class with a flag for enabling/disabling autocasting (since it incurs some runtime speed overhead.)	2024-12-26 20:08:30 +00:00
Ryan Dick	9692a36dd6	Use a fixture to parameterize tests in test_all_custom_modules.py so that a fresh instance of the layer under test is initialized for each test.	2024-12-26 19:41:25 +00:00
Ryan Dick	b0b699a01f	Add unit test to test that isinstance(...) behaves as expected with custom module types.	2024-12-26 18:45:56 +00:00
Ryan Dick	a8b2c4c3d2	Add inference tests for all custom module types (i.e. to test autocasting from cpu to device).	2024-12-26 18:33:46 +00:00
Ryan Dick	03944191db	Split test_autocast_modules.py into separate test files to mirror the source file structure.	2024-12-24 22:29:11 +00:00

1 2 3 4 5

227 Commits