InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2026-04-23 03:00:31 -04:00

Author	SHA1	Message	Date
CypherNaugh_0x	9deb545cc1	External models (Gemini Nano Banana & OpenAI GPT Image) (#8633 ) (#8884 ) * feat: initial external model support * feat: support reference images for external models * fix: sorting lint error * chore: hide Reidentify button for external models * review: enable auto-install/remove fro external models * feat: show external mode name during install * review: model descriptions * review: implemented review comments * review: added optional seed control for external models * chore: fix linter warning * review: save api keys to a seperate file * docs: updated external model docs * chore: fix linter errors * fix: sync configured external starter models on startup * feat(ui): add provider-specific external generation nodes * feat: expose external panel schemas in model configs * feat(ui): drive external panels from panel schema * docs: sync app config docstring order * feat: add gemini 3.1 flash image preview starter model * feat: update gemini image model limits * fix: resolve TypeScript errors and move external provider config to api_keys.yaml Add 'external', 'external_image_generator', and 'external_api' to Zod enum schemas (zBaseModelType, zModelType, zModelFormat) to match the generated OpenAPI types. Remove redundant union workarounds from component prop types and Record definitions. Fix type errors in ModelEdit (react-hook-form Control invariance), parsing.tsx (model identifier narrowing), buildExternalGraph (edge typing), and ModelSettings import/export buttons. Move external_gemini_base_url and external_openai_base_url into api_keys.yaml alongside the API keys so all external provider config lives in one dedicated file, separate from invokeai.yaml. * feat: add resolution presets and imageConfig support for Gemini 3 models Add combined resolution preset selector for external models that maps aspect ratio + image size to fixed dimensions. Gemini 3 Pro and 3.1 Flash now send imageConfig (aspectRatio + imageSize) via generationConfig instead of text-based aspect ratio hints used by Gemini 2.5 Flash. Backend: ExternalResolutionPreset model, resolution_presets capability field, image_size on ExternalGenerationRequest, and Gemini provider imageConfig logic. Frontend: ExternalSettingsAccordion with combo resolution select, dimension slider disabling for fixed-size models, and panel schema constraint wiring for Steps/Guidance/Seed controls. * Remove unused external model fields and add provider-specific parameters - Remove negative_prompt, steps, guidance, reference_image_weights, reference_image_modes from external model nodes (unused by any provider) - Remove supports_negative_prompt, supports_steps, supports_guidance from ExternalModelCapabilities - Add provider_options dict to ExternalGenerationRequest for provider-specific parameters - Add OpenAI-specific fields: quality, background, input_fidelity - Add Gemini-specific fields: temperature, thinking_level - Add new OpenAI starter models: GPT Image 1.5, GPT Image 1 Mini, DALL-E 3, DALL-E 2 - Fix OpenAI provider to use output_format (GPT Image) vs response_format (DALL-E) and send model ID in requests - Add fixed aspect ratio sizes for OpenAI models (bucketing) - Add ExternalProviderRateLimitError with retry logic for 429 responses - Add provider-specific UI components in ExternalSettingsAccordion - Simplify ParamSteps/ParamGuidance by removing dead external overrides - Update all backend and frontend tests * Chore Ruff check & format * Chore typegen * feat: full canvas workflow integration for external models - Add missing aspect ratios (4:5, 5:4, 8:1, 4:1, 1:4, 1:8) to type system for external model support - Sync canvas bbox when external model resolution preset is selected - Use params preset dimensions in buildExternalGraph to prevent "unsupported aspect ratio" errors - Lock all bbox controls (resize handles, aspect ratio select, width/height sliders, swap/optimal buttons) for external models with fixed dimension presets - Disable denoise strength slider for external models (not applicable) - Sync bbox aspect ratio changes back to paramsSlice for external models - Initialize bbox dimensions when switching to an external model * Chore typegen Linux seperator * feat: full canvas workflow integration for external models - Update buildExternalGraph test to include dimensions in mock params * Merge remote-tracking branch 'upstream/main' into external-models * Chore pnpm fix * add missing parameter * docs: add External Models guide with Gemini and OpenAI provider pages * fix(external-models): address PR review feedback - Gemini recall: write temperature, thinking_level, image_size to image metadata; wire external graph as metadata receiver; add recall handlers. - Canvas: gate regional guidance, inpaint mask, and control layer for external models. - Canvas: throw a clear error on outpainting for external models (was falling back to inpaint and hitting an API-side mask/image size mismatch). - Workflow editor: add ui_model_provider_id filter so OpenAI and Gemini nodes only list their own provider's models. - Workflow editor: silently drop seed when the selected model does not support it instead of raising a capability error. - Remove the legacy external_image_generation invocation and the graph-builder fallback; providers must register a dedicated node. - Regenerate schema.ts. - remove Gemini debug dumps to outputs/external_debug * fix(external-models): resolve TSC errors in metadata parsing and external graph - Export imageSizeChanged from paramsSlice (required by the new ImageSize recall handler). - Emit the external graph's metadata model entry via zModelIdentifierField since ExternalApiModelConfig is not part of the AnyModelConfig union. * chore: prettier format ModelIdentifierFieldInputComponent * fix: remove unsupported thinkingConfig from Gemini image models and restrict GPT Image models to txt2img * chore typegen * chore(docs): regenerate settings.json for external provider fields * fix(external): fix mask handling and mode support for external providers - Remove img2img and inpaint modes from Gemini models (Gemini has no bitmap mask or dedicated edit API; image editing works via reference images in the UI) - Fix DALL-E 2 inpainting: convert grayscale mask to RGBA with alpha channel transparency (OpenAI expects transparent=edit area) and convert init image to RGBA when mask is present * fix(external): update mode support and UI for external providers - Remove DALL-E 2 from starter models (deprecated, shutdown May 12 2026) - Enable img2img for GPT Image 1/1.5/1-mini (supports edits endpoint) - Set Gemini models to txt2img only (no mask/edit API; editing via ref images) - Hide mode/init_image/mask_image fields on Gemini node (not usable) - Hide mask_image field on OpenAI node (no model supports inpaint) * Chore typegen * fix(external): improve OpenAI node UX and disable cache by default - Hide OpenAI node's mode and init_image fields: OpenAI's API has no img2img/inpaint distinction (the edits endpoint is invoked automatically when reference images are provided). init_image is functionally identical to a reference image and was misleading users. - Default use_cache to False for external image generation nodes: external API calls are non-deterministic and incur usage costs. Cache hits returned stale image references that did not produce new gallery entries on repeat invokes. * fix(external): duplicate cached images on cache hit instead of skipping External image generation nodes use the standard invocation cache, but returning the cached output (with stale image_name references) on cache hits resulted in no new gallery entries — the Invoke button would spin indefinitely on repeat invokes with identical parameters. Override invoke_internal so that on cache hit, the cached images are loaded and re-saved as new gallery entries. The expensive API call is still skipped (cost saving), but the user sees a new image as expected. * Chore typegen + ruff * CHore ruff format * fix(external): restore OpenAI advanced settings on Remix recall Remix recall iterates through ImageMetadataHandlers but only Gemini's temperature handler was wired up — OpenAI's quality, background, and input_fidelity were stored in image metadata but never parsed back into the params slice. Add the three missing handlers so Remix restores these settings as expected. --------- Co-authored-by: Alexander Eichhorn <alex@eichhorn.dev> Co-authored-by: Alexander Eichhorn <alex@code-with.us> Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2026-04-20 17:13:26 +00:00
Lincoln Stein	3c17a569ce	Revert "Revert "New Documentation Fixes (#9061 )" (#9065 )" (#9066 ) This reverts commit `b513a3d3c6`.	2026-04-18 16:45:32 -04:00
Lincoln Stein	b513a3d3c6	Revert "New Documentation Fixes (#9061 )" (#9065 ) This reverts commit `7eaf1d5bd0`.	2026-04-17 17:32:23 -04:00
Josh Corbett	7eaf1d5bd0	New Documentation Fixes (#9061 ) * fix(docs): swap `docs-new` to `docs` also renames `docs` to `docs-old`, purely for preservation purposes. * feat(docs): clarify compel syntax availability --------- Co-authored-by: joshistoast <me@joshcorbett.com>	2026-04-17 08:10:10 -04:00
Josh Corbett	9643b1385f	Docs Overhaul (#8896 ) * feat(docs): new docs scaffold * feat(docs): update alternate launchers section * feat(docs): add contributor section * fix(docs): update description of lynxhub launcher mention * feat(docs): add more docs * feat(docs): setup index page * feat(docs): add more docs, rewrote a few pages * feat(docs): add todo * feat(docs): set up internationalization * fix(docs): admonition typo * feat(docs): add invoke styles * feat(docs): add more invoke styling, revamp splash page, remove theme switcher * fix(docs): expressive code sh styles without title * chore(docs): cleanup readme * chore(docs): add new github pages workflow * fix(docs): remove base path * chore(docs): add initial translations CI, powered by Crowdin * feat(docs): upgrade astro * feat(docs): enhance new contributor guide * feat(docs): various enhancements - improve homepage; - enhance some docs pages; - override some layout components; - enhance interactivity and qol styling; - create new download page + component; - add llms.txt; - remove unused logo component; * feat(docs): isolate new docs * style(docs): use md reference links over utility links * chore(docs): specify package manager * feat(docs): releases page * feat(docs): add page context menus * feat(docs): sort workflows sidebar items * fix(docs): relative links on homepage * feat(docs): add text tool and recall params api guides * feat(docs): fix faq links, create models concept page * chore(docs): set CI to new dir, update deployment url * feat(docs): generate settings and api json for pages - update deploy script - add api and settings component to render generated json - increase page content width * style(docs): remove relative path for component import * fix(docs): resolve tests by regenerating json * fix(docs): fixing the test for real this time - sorts openapi output map required field - missing `__name__` attributes - resolved components name keyerror * feat(docs): finish 'adding nodes' page * feat(docs): upgrade astro + starlight, add link tester * chore(docs): upgrade astro * feat(docs): add prompting guides * fix(docs): generated openapi * fix(docs): ci node version * fix(docs): invalid links * fix(docs): md aside formatting * feat(docs): reorder 'configuration' category * feat(docs): change contributor checklist to steps list * chore(docs): upgrade deps * feat(docs): splash page image styling * feat(docs): add gallery marquee to homepage * feat(docs): add splash page marquee gallery * feat(docs): remove openapi generation * fix(docs): regenerate settings json * fix(docs): json generation test --------- Co-authored-by: joshistoast <me@joshcorbett.com> Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2026-04-16 22:03:05 -04:00
Alexander Eichhorn	acd4157bdf	feat(ui): add canvas project save/load (.invk format) (#8917 ) * feat(ui): add canvas project save/load (.invk format) Add ZIP-based .invk file format to save and restore the entire canvas state including all layers, masks, reference images, generation parameters, LoRAs, and embedded image files. Images are deduplicated on load - only missing images are re-uploaded from the project file. - Always clear LoRAs on project load, even when project has none - Fix jszip dependency ordering in package.json - Add useAssertSingleton to SaveCanvasProjectDialog for consistency - Add concurrency limit (max 5) for image fetch/upload requests - Remove redundant deep-clone in remapCroppableImage (mutate in-place) - Default project name to "Canvas Project" instead of empty string * Chore pnpm fix	2026-04-14 01:01:57 +00:00
Valeri Che	441821ca03	Feat(canvas): Add Lasso Tool with Freehand and Polygon modes (#8908 ) * Feat(Canvas): Add Lasso tool with Freehand and Polygon modes * Refine Lasso modes behavior and optimisation. * Fix: Pettier * added docs/features/Lasso_tool.md * Fix: Removed restrictions mentioned in PR's conversation: 1. Disabled when there is no visible raster content 2. Lasso is blocked when all inpaint masks are globally hidden. --------- Co-authored-by: dunkeroni <dunkeroni@gmail.com>	2026-04-14 00:22:34 +00:00
DustyShoe	e9246c1899	Feature(UI): Add text tool to canvas (#8723 ) * Initial mashup of mentioned feature. Still need to resolve some quirks and kinks. * Clean text tool integration * Fixed text tool opions bar jumping and added more fonts * Touch up for cursor styling * Minor addition to doc file * Appeasing frontend checks * Prettier fix * knip fixes * Added safe zones to font selection and color picker to be clickable without commiting text. * Removed color probing on cursor and added dynamic font display for fallback, minor tweaks * Finally fixed the text shifting on commit * Cursor now represent actual input field size. Tidy up options UI * Some strikethrough and underline line tweaks * Replaced the focus retry loop with a callback‑ref based approach in in CanvasTextOverlay.tsx Renamed containerMetrics to textContainerData in CanvasTextOverlay.tsx Fixed mouse cursor disapearing during typing. * Added missing localistaion string * Moved canvas-text-tool.md to docs/contributing/frontend * ui: Improve functionality of the text toolbar Few things done with this commit. - The varying size of the font selector box has been fixed. The UI no longer shifts and moves with font change. - We no longer format the font size input to add px each time. Instead now just have a permanent px indicator. - The bug with the random text inputs on the slider value has also been fixed. - The font size value is only committed on blur keeping it consistent with other editing apps. - Fixed the spacing of the toolbar to make it look cleaner. - Font size now permits increments of 1. * Added autoselect text in font size on click allowing immediate imput * Improvement: Added uncommited layer state with CTRL-move and options to select line spacing. * Added rotation handle to rotate uncommiitted text layer. * Fix: Redirect user facing labels to use localization file + Add tool discription to docs * Fixed box padding. Disable tool swich when text input is active, added message on canvas for better UX. * Updated Text tool description * Updated Text tool description * Typo * Add draggable text-box border with improved cursor feedback and larger hit targets. Supress hotkeys on uncommitted text. * Lint * Fix(bug): text commit to link uploaded image assets instead of embedding full base64 --------- Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com> Co-authored-by: blessedcoolant <54517381+blessedcoolant@users.noreply.github.com>	2026-02-20 01:43:32 +00:00
Lincoln Stein	b23f18734b	feat(model_manager): Add scan and delete of orphaned models (#8826 ) * Add script and UI to remove orphaned model files - This commit adds command-line and Web GUI functionality for identifying and optionally removing models in the models directory that are not referenced in the database. Co-authored-by: lstein <111189+lstein@users.noreply.github.com> * Add backend service and API routes for orphaned models sync Co-authored-by: lstein <111189+lstein@users.noreply.github.com> Add expandable file list to orphaned models dialog Co-authored-by: lstein <111189+lstein@users.noreply.github.com> * Fix cache invalidation after deleting orphaned models Co-authored-by: lstein <111189+lstein@users.noreply.github.com> * (bugfix) improve status messages * docs(backend): add info on the orphaned model detection/removal feature * Update docs/features/orphaned_model_removal.md --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: lstein <111189+lstein@users.noreply.github.com> Co-authored-by: dunkeroni <dunkeroni@gmail.com>	2026-02-06 22:32:10 +00:00
Alexander Eichhorn	a2e109b3c2	feat(ui): improve hotkey customization UX with interactive controls and validation (#8649 ) * feat: remove the ModelFooter in the ModelView and add the Delete Model Button from the Footer into the View * forget to run pnpm fix * chore(ui): reorder the model view buttons * Initial plan * Add customizable hotkeys infrastructure with UI Co-authored-by: dunkeroni <3298737+dunkeroni@users.noreply.github.com> * Fix ESLint issues in HotkeyEditor component Co-authored-by: dunkeroni <3298737+dunkeroni@users.noreply.github.com> * Fix knip unused export warning Co-authored-by: dunkeroni <3298737+dunkeroni@users.noreply.github.com> * Add tests for hotkeys slice Co-authored-by: dunkeroni <3298737+dunkeroni@users.noreply.github.com> * Fix tests to actually call reducer and add documentation Co-authored-by: dunkeroni <3298737+dunkeroni@users.noreply.github.com> * docs: add comprehensive hotkeys system documentation - Created new HOTKEYS.md technical documentation for developers explaining architecture, data flow, and implementation details - Added user-facing hotkeys.md guide with features overview and usage instructions - Removed old CUSTOMIZABLE_HOTKEYS.md in favor of new split documentation - Expanded documentation with detailed sections on: - State management and persistence - Component architecture and responsibilities - Developer integration * Behavior changed to hotkey press instead of input + checking for allready used hotkeys --------- Co-authored-by: blessedcoolant <54517381+blessedcoolant@users.noreply.github.com> Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: dunkeroni <3298737+dunkeroni@users.noreply.github.com> Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2025-11-16 14:35:37 +00:00
Ryan Dick	4c86a7ecbf	Update Low-VRAM docs guidance around max_cache_vram_gb.	2025-02-28 17:18:57 -05:00
Ryan Dick	3af7fc26fa	Update low-vram docs with info abhout .	2025-02-28 21:39:09 +00:00
Ryan Dick	66bc225bd3	Add a troubleshooting instructions for the Windows page file issue to the Low-VRAM docs.	2025-01-20 08:58:41 +11:00
Ryan Dick	ce57c4ed2e	Update the Low-VRAM docs.	2025-01-16 23:46:07 +00:00
psychedelicious	fc8cf224ca	docs: typo	2025-01-09 11:20:05 +11:00
psychedelicious	3e1ed18a1f	Update docs/features/low-vram.md Co-authored-by: Ryan Dick <ryanjdick3@gmail.com>	2025-01-09 11:20:05 +11:00
psychedelicious	9a84c85486	docs: add section about disabling the sysmem fallback	2025-01-09 11:20:05 +11:00
psychedelicious	b15dd00840	docs: add docs for low vram mode	2025-01-09 11:20:05 +11:00
psychedelicious	f3f88dba47	docs: clean up and update lots of stuff	2024-09-22 17:10:14 +03:00
psychedelicious	0dcb4dbc54	docs: remove ancient prompts doc	2024-09-22 17:10:14 +03:00
psychedelicious	9c1749920e	docs: tidying	2024-09-22 17:10:14 +03:00
psychedelicious	b6190651ad	docs: remove ancient TI docs	2024-09-22 17:10:14 +03:00
psychedelicious	9f6ba48c57	docs: remove ancient loras docs	2024-09-22 17:10:14 +03:00
psychedelicious	d08a145811	docs: remove ancient controlnet docs	2024-09-22 17:10:14 +03:00
psychedelicious	2703c9ff0c	docs: remove ancient NSFW/watermark docs	2024-09-22 17:10:14 +03:00
psychedelicious	e1305e1e54	docs: update configuration docs layout	2024-09-22 17:10:14 +03:00
psychedelicious	be96fc0157	docs: remove ancient utilities docs	2024-09-22 17:10:14 +03:00
psychedelicious	ea6d08ac23	docs: remove ancient model merging docs	2024-09-22 17:10:14 +03:00
psychedelicious	6a1aa54fba	docs: remove ancient logging docs	2024-09-22 17:10:14 +03:00
psychedelicious	08df30377e	docs: fix incorrect info in database.md	2024-09-22 17:10:14 +03:00
psychedelicious	5ab50b1193	docs: remove ancient "image management" (?) docs	2024-09-22 17:10:14 +03:00
psychedelicious	8173987a10	docs: remove ancient ui docs	2024-09-22 17:10:14 +03:00
omahs	0653f3ad87	fix typo	2024-09-19 05:40:54 +03:00
omahs	fb0d6b9387	fix typos	2024-09-19 05:40:54 +03:00
omahs	6c03cb4f7b	fix typo	2024-09-19 05:40:54 +03:00
omahs	f72a038689	fix typo	2024-09-19 05:40:54 +03:00
omahs	958fa569f7	fix typo	2024-09-19 05:40:54 +03:00
omahs	58e8887b48	fix typos	2024-09-19 05:40:54 +03:00
Shukri	7badaab17d	docs: fix link to invoke ai models site	2024-05-20 20:48:42 -07:00
Kent Keirsey	960eae8255	Update TRAINING.md	2024-05-03 17:30:42 +10:00
psychedelicious	3595beac1e	docs: remove references to config script in CONFIGURATION.md	2024-04-25 17:49:32 -04:00
sarashinai	fbfa29c2ef	Update GALLERY.md	2024-04-14 16:46:31 +10:00
sarashinai	9ee7b951eb	Update GALLERY.md	2024-04-14 16:46:31 +10:00
sarashinai	29dd1bb35b	Update GALLERY.md	2024-04-14 16:46:31 +10:00
sarashinai	68d8a2497e	Update GALLERY.md	2024-04-14 16:46:31 +10:00
sarashinai	4b171fa696	Creation of GALLERY.md and related images First draft of the walkthrough of the Gallery right-hand panel	2024-04-14 16:46:31 +10:00
sarashinai	d0beb45431	Create GALLERY.md	2024-04-14 16:46:31 +10:00
sarashinai	e724781a80	Update WEB.md Correct stated location of Gallery panel.	2024-04-14 16:46:31 +10:00
Ryan Dick	54327ec4a7	Remove documentation references to prompt-to-prompt cross-attention control.	2024-04-09 10:57:02 -04:00
psychedelicious	49a647ad00	docs: remove most references to autoimport There's still a few references in `WEB.md` but this doc is very outdated and needs to be totally redone. It's hard to just remove the references without redoing a lot more. Will need to follow up revising this doc.	2024-03-28 12:35:41 +11:00

1 2 3 4 5 ...

327 Commits