InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-04-23 03:00:31 -04:00

Author	SHA1	Message	Date
Alexander Eichhorn	b92c6ae633	feat(flux2): add FLUX.2 klein model support (#8768 ) * WIP: feat(flux2): add FLUX 2 Kontext model support - Add new invocation nodes for FLUX 2: - flux2_denoise: Denoising invocation for FLUX 2 - flux2_klein_model_loader: Model loader for Klein architecture - flux2_klein_text_encoder: Text encoder for Qwen3-based encoding - flux2_vae_decode: VAE decoder for FLUX 2 - Add backend support: - New flux2 module with denoise and sampling utilities - Extended model manager configs for FLUX 2 models - Updated model loaders for Klein architecture - Update frontend: - Extended graph builder for FLUX 2 support - Added FLUX 2 model types and configurations - Updated readiness checks and UI components * fix(flux2): correct VAE decode with proper BN denormalization FLUX.2 VAE uses Batch Normalization in the patchified latent space (128 channels). The decode must: 1. Patchify latents from (B, 32, H, W) to (B, 128, H/2, W/2) 2. Apply BN denormalization using running_mean/running_var 3. Unpatchify back to (B, 32, H, W) for VAE decode Also fixed image normalization from [-1, 1] to [0, 255]. This fixes washed-out colors in generated FLUX.2 Klein images. * feat(flux2): add FLUX.2 Klein model support with ComfyUI checkpoint compatibility - Add FLUX.2 transformer loader with BFL-to-diffusers weight conversion - Fix AdaLayerNorm scale-shift swap for final_layer.adaLN_modulation weights - Add VAE batch normalization handling for FLUX.2 latent normalization - Add Qwen3 text encoder loader with ComfyUI FP8 quantization support - Add frontend components for FLUX.2 Klein model selection - Update configs and schema for FLUX.2 model types * Chore Ruff * Fix Flux1 vae probing * Fix Windows Paths schema.ts * Add 4B und 9B klein to Starter Models. * feat(flux2): add non-commercial license indicator for FLUX.2 Klein 9B - Add isFlux2Klein9BMainModelConfig and isNonCommercialMainModelConfig functions - Update MainModelPicker and InitialStateMainModelPicker to show license icon - Update license tooltip text to include FLUX.2 Klein 9B * feat(flux2): add Klein/Qwen3 variant support and encoder filtering Backend: - Add klein_4b/klein_9b variants for FLUX.2 Klein models - Add qwen3_4b/qwen3_8b variants for Qwen3 encoder models - Validate encoder variant matches Klein model (4B↔4B, 9B↔8B) - Auto-detect Qwen3 variant from hidden_size during probing Frontend: - Show variant field for all model types in ModelView - Filter Qwen3 encoder dropdown to only show compatible variants - Update variant type definitions (zFlux2VariantType, zQwen3VariantType) - Remove unused exports (isFluxDevMainModelConfig, isFlux2Klein9BMainModelConfig) * Chore Ruff * feat(flux2): add Klein 9B Base (undistilled) variant support Distinguish between FLUX.2 Klein 9B (distilled) and Klein 9B Base (undistilled) models by checking guidance_embeds in diffusers config or guidance_in keys in safetensors. Klein 9B Base requires more steps but offers higher quality. * feat(flux2): improve diffusers compatibility and distilled model support Backend changes: - Update text encoder layers from [9,18,27] to (10,20,30) matching diffusers - Use apply_chat_template with system message instead of manual formatting - Change position IDs from ones to zeros to match diffusers implementation - Add get_schedule_flux2() with empirical mu computation for proper schedule shifting - Add txt_embed_scale parameter for Qwen3 embedding magnitude control - Add shift_schedule toggle for base (28+ steps) vs distilled (4 steps) models - Zero out guidance_embedder weights for Klein models without guidance_embeds UI changes: - Clear Klein VAE and Qwen3 encoder when switching away from flux2 base - Clear Qwen3 encoder when switching between different Klein model variants - Add toast notification informing user to select compatible encoder * feat(flux2): fix distilled model scheduling with proper dynamic shifting - Configure scheduler with FLUX.2 Klein parameters from scheduler_config.json (use_dynamic_shifting=True, shift=3.0, time_shift_type="exponential") - Pass mu parameter to scheduler.set_timesteps() for resolution-aware shifting - Remove manual shift_schedule parameter (scheduler handles this automatically) - Simplify get_schedule_flux2() to return linear sigmas only - Remove txt_embed_scale parameter (no longer needed) This matches the diffusers Flux2KleinPipeline behavior where the FlowMatchEulerDiscreteScheduler applies dynamic timestep shifting based on image resolution via the mu parameter. Fixes 4-step distilled Klein 9B model quality issues. * fix(ui): fix FLUX.1 graph building with posCondCollect node lookup The posCondCollect node was created with getPrefixedId() which generates a random suffix (e.g., 'pos_cond_collect:abc123'), but g.getNode() was called with the plain string 'pos_cond_collect', causing a node lookup failure. Fix by declaring posCondCollect as a module-scoped variable and referencing it directly instead of using g.getNode(). * Remove Flux2 Klein Base from Starter Models * Remove Logging * Add Default Values for Flux2 Klein and add variant as additional info to from_base * Add migrations for the z-image qwen3 encoder without a variant value * Add img2img, inpainting and outpainting support for FLUX.2 Klein - Add flux2_vae_encode invocation for encoding images to FLUX.2 latents - Integrate inpaint_extension into FLUX.2 denoise loop for proper mask handling - Apply BN normalization to init_latents and noise for consistency in inpainting - Use manual Euler stepping for img2img/inpaint to preserve exact timestep schedule - Add flux2_img2img, flux2_inpaint, flux2_outpaint generation modes - Expand starter models with FP8 variants, standalone transformers, and separate VAE/encoders - Fix outpainting to always use full denoising (0-1) since strength doesn't apply - Improve error messages in model loader with clear guidance for standalone models * Add GGUF quantized model support and Diffusers VAE loader for FLUX.2 Klein - Add Main_GGUF_Flux2_Config for GGUF-quantized FLUX.2 transformer models - Add VAE_Diffusers_Flux2_Config for FLUX.2 VAE in diffusers format - Add Flux2GGUFCheckpointModel loader with BFL-to-diffusers conversion - Add Flux2VAEDiffusersLoader for AutoencoderKLFlux2 - Add FLUX.2 Klein 4B/9B hardware requirements to documentation - Update starter model descriptions to clarify dependencies install together - Update frontend schema for new model configs * Fix FLUX.2 model detection and add FP8 weight dequantization support - Improve FLUX.2 variant detection for GGUF/checkpoint models (BFL format keys) - Fix guidance_embeds logic: distilled=False, undistilled=True - Add FP8 weight dequantization for ComfyUI-style quantized models - Prevent FLUX.2 models from being misidentified as FLUX.1 - Preserve user-editable fields (name, description, etc.) on model reidentify - Improve Qwen3Encoder detection by variant in starter models - Add defensive checks for tensor operations * Chore ruff format * Chore Typegen * Fix FLUX.2 Klein 9B model loading by detecting hidden_size from weights Previously num_attention_heads was hardcoded to 24, which is correct for Klein 4B but causes size mismatches when loading Klein 9B checkpoints. Now dynamically calculates num_attention_heads from the hidden_size dimension of context_embedder weights: - Klein 4B: hidden_size=3072 → num_attention_heads=24 - Klein 9B: hidden_size=4096 → num_attention_heads=32 Fixes both Checkpoint and GGUF loaders for FLUX.2 models. * Only clear Qwen3 encoder when FLUX.2 Klein variant changes Previously the encoder was cleared whenever switching between any Klein models, even if they had the same variant. Now compares the variant of the old and new model and only clears the encoder when switching between different variants (e.g., klein_4b to klein_9b). This allows users to switch between different Klein 9B models without having to re-select the Qwen3 encoder each time. * Add metadata recall support for FLUX.2 Klein parameters The scheduler, VAE model, and Qwen3 encoder model were not being recalled correctly for FLUX.2 Klein images. This adds dedicated metadata handlers for the Klein-specific parameters. * Fix FLUX.2 Klein denoising scaling and Z-Image VAE compatibility - Apply exponential denoising scaling (exponent 0.2) to FLUX.2 Klein, matching FLUX.1 behavior for more intuitive inpainting strength - Add isFlux1VAEModelConfig type guard to filter FLUX 1.0 VAEs only - Restrict Z-Image VAE selection to FLUX 1.0 VAEs, excluding FLUX.2 Klein 32-channel VAEs which are incompatible * chore pnpm fix * Add FLUX.2 Klein to starter bundles and documentation - Add FLUX.2 Klein hardware requirements to quick start guide - Create flux2_klein_bundle with GGUF Q4 model, VAE, and Qwen3 encoder - Add "What's New" entry announcing FLUX.2 Klein support * Add FLUX.2 Klein built-in reference image editing support FLUX.2 Klein has native multi-reference image editing without requiring a separate model (unlike FLUX.1 which needs a Kontext model). Backend changes: - Add Flux2RefImageExtension for encoding reference images with FLUX.2 VAE - Apply BN normalization to reference image latents for correct scaling - Use T-coordinate offset scale=10 like diffusers (T=10, 20, 30...) - Concatenate reference latents with generated image during denoising - Extract only generated portion in step callback for correct preview Frontend changes: - Add flux2_reference_image config type without model field - Hide model selector for FLUX.2 reference images (built-in support) - Add type guards to handle configs without model property - Update validators to skip model validation for FLUX.2 - Add 'flux2' to SUPPORTS_REF_IMAGES_BASE_MODELS * Chore windows path fix * Add reference image resizing for FLUX.2 Klein Resize large reference images to match BFL FLUX.2 sampling.py limits: - Single reference: max 2024² pixels (~4.1M) - Multiple references: max 1024² pixels (~1M) Uses same scaling approach as BFL's cap_pixels() function.	2026-01-26 23:21:37 -05:00
Lincoln Stein	56fd7bc7c4	docs(z-image) add Z-Image requirements and starter bundle (#8734 ) * docs(z-image) add minimum requirements for Z-Image and create Z-Image starter bundle * fix(model manager) add flux VAE to Z-Image bundle * docs(model manager) remove out-of-date model info link * chore: fix frontendchecks * chore: lint:prettier * docs(model manager): clarify minimum hardware for z-image turbo * (fix) add flux VAE to ZIT starter dependencies & tweak UI docs	2026-01-04 10:17:26 -05:00
Lincoln Stein	36e400dd5d	(chore) Update requirements to python 3.11-12 (#8657 ) * (chore) update requirements to python 3.11-12 * update uv.lock	2025-11-08 21:29:43 -05:00
psychedelicious	bb153b55d3	docs: update quick start	2025-08-21 21:26:09 +10:00
psychedelicious	93ef637d59	docs: update latest release links	2025-08-21 21:26:09 +10:00
Heathen711	1cdd4b5980	bugfix(docs) link syntax	2025-07-17 04:26:06 +00:00
Heathen711	c84f8465b8	bugfix(pyproject) Convert from dependency groups to extras and update docks to use UV's built in torch support	2025-07-17 03:58:26 +00:00
DustyShoe	4077ffe595	Fixed a typo	2025-06-30 15:44:23 +10:00
Jonathan	66f6571086	Update manual installation for v5.12.0	2025-05-22 09:00:58 -04:00
psychedelicious	361c6eed4b	docs: update manual install docs w/ correct pytorch indicies for v5.10.0 and later	2025-04-17 10:32:41 +10:00
psychedelicious	7da43be4b7	docs: fix incorrect filename	2025-04-07 10:57:32 +10:00
psychedelicious	8561e9e540	docs: remove legacy scripts documentation	2025-04-07 10:57:32 +10:00
Eugene Brodsky	6bb102f860	modify docs for python 3.12	2025-04-04 18:42:13 +11:00
Chantell	2b5da91beb	Update manual.md Removed a redundancy of package specifier on step 6.	2025-04-04 16:52:04 +11:00
psychedelicious	c194281f4d	docs: install troubleshooting	2025-02-08 10:40:04 +11:00
psychedelicious	0747a5f464	docs: add link to low vram in requirements	2025-02-07 12:14:23 +11:00
psychedelicious	6efd108481	docs: typo in manual docs install command Thanks to ShaneDK on discord for catching this.	2025-01-23 14:57:22 +11:00
psychedelicious	d9c099bd3a	docs: fix incorrect macOS launcher fix command	2025-01-09 11:26:59 +11:00
psychedelicious	03f7bdc9f9	docs: fix manual install rocm pypi indices	2025-01-07 17:17:40 -05:00
psychedelicious	157b92e0fd	docs: no need to specify version for dev env setup	2025-01-03 10:59:39 -05:00
psychedelicious	5e9227c052	docs: update manual install docs to mirror the launcher's install method	2025-01-03 14:27:45 +11:00
psychedelicious	de0043f443	docs: update download links for launcher	2024-12-23 13:23:14 +11:00
psychedelicious	9adcd2cc31	docs: update install-related docs	2024-12-20 17:01:34 +11:00
Kent Keirsey	91a4160e36	Update Installation Docs	2024-12-20 17:01:34 +11:00
nirmal0001	2b74263007	Update patchmatch.md add required Install dependencies for arch linux	2024-10-31 16:01:57 +11:00
psychedelicious	f3f88dba47	docs: clean up and update lots of stuff	2024-09-22 17:10:14 +03:00
psychedelicious	b533f389e5	docs: update links to python installers	2024-09-22 17:10:14 +03:00
psychedelicious	a890531acf	docs: update installer tip about updating	2024-09-22 17:10:14 +03:00
psychedelicious	4f774d2f47	docs: rename installation files	2024-09-22 17:10:14 +03:00
Eugene Brodsky	b672cc37a7	docs: overhaul Docker documentation, add to main README	2024-07-09 09:47:29 -04:00
psychedelicious	124d34a8cc	docs: add link for `--extra-index-url`	2024-05-19 00:56:31 +10:00
Shukri	e8387d7523	docs: add link to tool on pytorch website	2024-05-19 00:56:31 +10:00
Shukri	a5d08c981b	docs: fix typo in --root arg of invokeai-web	2024-05-19 00:56:31 +10:00
Shukri	811d0da0f0	docs: fix link to. install reqs	2024-05-19 00:56:31 +10:00
psychedelicious	3b1743b7c2	docs: fix install reqs link	2024-05-16 10:37:42 +10:00
gogurtenjoyer	63e62c5720	Update INSTALL_REQUIREMENTS.md - 'linux only' under AMD for SDXL. Moved 'Linux only.' back from under NVIDIA to under AMD for the SDXL hardware requirements.	2024-05-09 10:56:23 -04:00
Kent Keirsey	ab87511a03	Update INSTALLATION.md	2024-05-03 17:31:50 +10:00
Kent Keirsey	af868b0ea6	Update 010_INSTALL_AUTOMATED.md	2024-05-03 17:31:50 +10:00
psychedelicious	636ece323f	Update INSTALL_DEVELOPMENT.md	2024-04-14 15:24:00 +10:00
psychedelicious	38718d8c65	docs: update 020_INSTALL_MANUAL.md, remove conda section	2024-04-04 11:28:09 +11:00
psychedelicious	98ab387e2b	docs: update 020_INSTALL_MANUAL.md Redo the install the package section. It was inaccurate with respect to extra index URLs.	2024-04-04 10:54:23 +11:00
psychedelicious	a0ae2f37d7	docs: update 020_INSTALL_MANUAL.md Tidy verbiage around the invokeai root and how it is determined	2024-04-04 10:54:23 +11:00
psychedelicious	f887e030bb	docs: update 010_INSTALL_AUTOMATED.md Remove reference to the autodetect GPU device option.	2024-04-04 08:43:17 +11:00
psychedelicious	50951439bd	docs: fix broken link	2024-04-03 17:36:15 +11:00
psychedelicious	58fd8bb8a5	docs: update 050_INSTALLING_MODELS.md Fix a couple missing links, add blurb about how to use scan folder to replicate autoimport.	2024-03-28 12:35:41 +11:00
psychedelicious	ec1bafdef5	docs: update INSTALL_DEVELOPMENT.md - Add blurb about `scan_models_on_startup` - Add blurb about editable install	2024-03-28 12:35:41 +11:00
psychedelicious	49a647ad00	docs: remove most references to autoimport There's still a few references in `WEB.md` but this doc is very outdated and needs to be totally redone. It's hard to just remove the references without redoing a lot more. Will need to follow up revising this doc.	2024-03-28 12:35:41 +11:00
psychedelicious	237ac58dae	docs: merge INSTALL_TROUBLESHOOTING into FAQ These two docs had overlap and were kinda the same thing.	2024-03-27 18:59:55 +05:30
psychedelicious	9b9b7a7071	docs: add warning about speed to malloc docs	2024-03-27 08:45:54 +11:00
psychedelicious	6ce82a41d5	docs: update docs for malloc change	2024-03-27 08:45:54 +11:00

1 2 3 4 5 ...

301 Commits