AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-04-30 03:00:41 -04:00

Author	SHA1	Message	Date
Bentlybro	cb8cf81be7	feat(platform): Add LLM registry public read API Implements public GET endpoints for querying LLM models and providers - Part 3 of 6 in the incremental registry rollout. Endpoints: - GET /api/llm/models - List all models (filterable by enabled_only) - GET /api/llm/providers - List providers with their models Design: - Uses in-memory registry from PR 2 (no DB queries) - Fast reads from cache populated at startup - Grouped by provider for easy UI rendering Response models: - LlmModel - model info with capabilities, costs, creator - LlmProvider - provider with nested models - LlmModelsResponse - list + total count - LlmProvidersResponse - grouped by provider Authentication: - Requires user auth (requires_user dependency) - Public within authenticated sessions Integration: - Registered in rest_api.py at /api prefix - Tagged with v2 + llm for OpenAPI grouping What's NOT included (later PRs): - Admin write API (PR 4) - Block integration (PR 5) - Redis cache (PR 6) Lines: ~180 total Files: 4 (3 new, 1 modified) Review time: < 10 minutes	2026-04-13 15:49:46 +01:00
Bentlybro	ef30c1ed76	fix(backend/llm-registry): guard subscription task lifecycle, top-level imports, tighten error match	2026-04-13 15:46:42 +01:00
Bentlybro	b5f63c13a4	test(backend/llm-registry): comprehensive registry + notifications tests registry_test.py (+8 tests): - clear_registry_cache, get_model (found/not found), get_all_models, get_enabled_models, get_all_model_slugs_for_validation - refresh_llm_registry error re-raise notifications_test.py (new, 9 tests): - publish: happy path and Redis error swallowed - subscribe: valid message triggers on_refresh, non-message types ignored, wrong channel ignored, None (timeout) handled, multiple messages, CancelledError stops loop, connection error triggers reconnect	2026-04-13 15:44:11 +01:00
Bentlybro	c5dfe3333d	feat(backend/llm-registry): add Redis-backed cache and cross-process pub/sub sync - Wrap DB fetch with @cached(shared_cache=True) so results are stored in Redis automatically — other workers skip the DB on warm cache - Add notifications.py with publish/subscribe helpers using llm_registry:refresh pub/sub channel for cross-process invalidation - clear_registry_cache() invalidates the shared Redis entry before a forced DB refresh (called by admin mutations) - rest_api.py: start a background subscription task so every worker reloads its in-process cache when another worker refreshes the registry	2026-04-13 15:44:11 +01:00
Bentlybro	696b273afc	fix(registry): switch to Pydantic models, add typed capabilities, add unit tests - Replace frozen dataclasses with Pydantic BaseModel(frozen=True) for true immutability - Add typed boolean fields for model capabilities (supports_tools, etc.) - Add comprehensive unit tests for registry module - Addresses Majdyz review feedback on PR #12359	2026-04-13 15:44:11 +01:00
Bentlybro	732365cd8f	fix(registry): address Majdyz review - extract helper, fix schema prefix, return copies, remove re-export	2026-04-13 15:44:10 +01:00
Bentlybro	7e85371ce5	style: fix trailing whitespace in registry.py	2026-04-13 15:43:27 +01:00
Bentlybro	3f964c8aba	fix(startup): handle missing AgentNode table in migrate_llm_models Tests fail with 'relation "platform.AgentNode" does not exist' because migrate_llm_models() runs during startup and queries a table that doesn't exist in fresh test databases. This is an existing bug in the codebase - the function has no error handling. Wrap the call in try/except to gracefully handle test environments where the AgentNode table hasn't been created yet.	2026-04-13 15:43:27 +01:00
Bentlybro	ad1f489c5c	refactor: address CodeRabbit/Majdyz review feedback - Fix ModelMetadata duplicate type collision by importing from blocks.llm - Remove _json_to_dict helper, use dict() inline - Add warning when Provider relation is missing (data corruption indicator) - Optimize get_default_model_slug with next() (single sort pass) - Optimize _build_schema_options to use list comprehension - Move llm_registry import to top-level in rest_api.py - Ensure max_output_tokens falls back to context_window when null All critical and quick-win issues addressed.	2026-04-13 15:43:27 +01:00
Bentlybro	9c77a2207f	fix: address Sentry/CodeRabbit critical and major issues CRITICAL FIX - ModelMetadata instantiation: - Removed non-existent 'supports_vision' argument - Added required fields: display_name, provider_name, creator_name, price_tier - Handle nullable DB fields (Creator, priceTier, maxOutputTokens) safely - Fallback: creator_name='Unknown' if no Creator, price_tier=1 if invalid MAJOR FIX - Preserve pricing unit: - Added 'unit' field to RegistryModelCost dataclass - Prevents RUN vs TOKENS ambiguity in cached costs - Convert Prisma enum to string when building cost objects MAJOR FIX - Deterministic default model: - Sort recommended models by display_name before selection - Prevents non-deterministic results when multiple models are recommended - Ensures consistent default across refreshes STARTUP IMPROVEMENT: - Added comment: graceful fallback OK for now (no blocks use registry yet) - Will be stricter in PR #5 when block integration lands - Added success log message for registry refresh Fixes identified by Sentry (critical TypeError) and CodeRabbit review.	2026-04-13 15:43:27 +01:00
Bentlybro	081fa9f2db	feat(platform): Add LLM registry core - DB layer + in-memory cache Implements the registry core for dynamic LLM model management: DB Layer: - Fetch models with provider, costs, and creator relations - Prisma query with includes for related data - Convert DB records to typed dataclasses In-memory Cache: - Global dict for fast model lookups - Atomic cache refresh with lock protection - Schema options generation for UI dropdowns Public API: - get_model(slug) - lookup by slug - get_all_models() - all models (including disabled) - get_enabled_models() - enabled models only - get_schema_options() - UI dropdown data - get_default_model_slug() - recommended or first enabled - refresh_llm_registry() - manual refresh trigger Integration: - Refresh at API startup (before block init) - Graceful fallback if registry unavailable - Enables blocks to consume registry data Models: - RegistryModel - full model with metadata - RegistryModelCost - pricing configuration - RegistryModelCreator - model creator info - ModelMetadata - context window, capabilities Next PRs: - PR #3: Public read API (GET endpoints) - PR #4: Admin write API (POST/PATCH/DELETE) - PR #5: Block integration (update LLM block) - PR #6: Redis cache (solve thundering herd) Lines: ~230 (registry.py ~210, __init__.py ~30, model.py from draft) Files: 4 (3 new, 1 modified)	2026-04-13 15:43:27 +01:00
Bentlybro	8b6dea2496	fix(backend): remove f-string from migrate_llm_models SQL query	2026-04-13 15:42:22 +01:00
Bentlybro	5e15213846	fix(schema): rename migration dirs to 14-digit Prisma timestamps, add onUpdate: Cascade - Rename 20260310_ → 20260310120000_ (schema) and 20260310130000_ (seed) - Add onUpdate: Cascade to LlmModelMigration FK relations	2026-04-04 20:47:49 +00:00
Bentlybro	f0cc4ae573	Seed LLM model creators and link models Update migration to seed LLM model creators and associate them with models. Adds INSERTs for LlmModelCreator, updates the file header comment, introduces a creator_ids CTE, and extends the LlmModel INSERT to include creatorId (joining on creator name). Existing provider seeding and model cost logic remain unchanged; ON CONFLICT behavior preserved.	2026-03-25 13:57:41 +00:00
Bently	e0282b00db	Merge branch 'dev' into feat/llm-registry-schema	2026-03-23 13:43:15 +00:00
Bentlybro	9a9c36b806	Update LLM registry seeds and conflict clause Add and rename model slugs and costs in the LLM registry seed migration (e.g. rename 'o3' -> 'o3-2025-04-16', add 'gpt-5.2-2025-12-11', Anthropic 'claude-opus-4-6'/'claude-sonnet-4-6', multiple Google Gemini and Mistralai OpenRouter entries, and other provider models). Also tighten the ON CONFLICT upsert semantics so conflicts are ignored only when "credentialId" IS NULL, preventing silent skips for credentialed entries. These changes seed new models and ensure correct conflict handling during migration.	2026-03-23 13:20:48 +00:00
Zamil Majdy	e86ac21c43	feat(platform): add workflow import from other tools (n8n, Make.com, Zapier) (#12440 ) ## Summary - Enable one-click import of workflows from other platforms (n8n, Make.com, Zapier, etc.) into AutoGPT via CoPilot - No backend endpoint — import is entirely client-side: the dialog reads the file or fetches the n8n template URL, uploads the JSON to the workspace via `uploadFileDirect`, stores the file reference in `sessionStorage`, and redirects to CoPilot with `autosubmit=true` - CoPilot receives the workflow JSON as a proper file attachment and uses the existing agent-generator pipeline to convert it - Library dialog redesigned: 2 tabs — "AutoGPT agent" (upload exported agent JSON) and "Another platform" (file upload + optional n8n URL) ## How it works 1. User uploads a workflow JSON (or pastes an n8n template URL) 2. Frontend fetches/reads the JSON and uploads it to the user's workspace via the existing file upload API 3. User is redirected to `/copilot?source=import&autosubmit=true` 4. CoPilot picks up the file from `sessionStorage` and sends it as a `FileUIPart` attachment with a prompt to recreate the workflow as an AutoGPT agent ## Test plan - [x] Manual test: import a real n8n workflow JSON via the dialog - [x] Manual test: paste an n8n template URL and verify it fetches + converts - [x] Manual test: import Make.com / Zapier workflow export JSON - [x] Repeated imports don't cause 409 conflicts (filenames use `crypto.randomUUID()`) - [x] E2E: Import dialog has 2 tabs (AutoGPT agent + Another platform) - [x] E2E: n8n quick-start template buttons present - [x] E2E: n8n URL input enables Import button on valid URL - [x] E2E: Workspace upload API returns file_id	2026-03-23 13:03:02 +00:00
Bentlybro	d5381625cd	Add timestamps to LLM registry seed inserts Update migration.sql to include createdAt and updatedAt columns/values for LlmProvider, LlmModel, and LlmModelCost seed inserts. Uses CURRENT_TIMESTAMP for both timestamp fields and adjusts the INSERT SELECT ordering for models to match the added columns. This ensures the seed data satisfies schemas that require timestamps and provides consistent created/updated metadata.	2026-03-23 12:59:30 +00:00
Bentlybro	f6ae3d6593	Update migration.sql	2026-03-23 12:43:06 +00:00
Lluis Agusti	94224be841	Merge remote-tracking branch 'origin/master' into dev	2026-03-23 20:42:32 +08:00
Bentlybro	0fb1b854df	add llm's via migration	2026-03-23 12:37:29 +00:00
Otto	da4bdc7ab9	fix(backend+frontend): reduce Sentry noise from user-caused errors (#12513 ) Requested by @majdyz User-caused errors (no payment method, webhook agent invocation, missing credentials, bad API keys) were hitting Sentry via `logger.exception()` in the `ValueError` handler, creating noise that obscures real bugs. Additionally, a frontend crash on the copilot page (BUILDER-71J) needed fixing. Changes: Backend — rest_api.py - Set `log_error=False` for the `ValueError` exception handler (line 278), consistent with how `FolderValidationError` and `NotFoundError` are already handled. User-caused 400 errors no longer trigger `logger.exception()` → Sentry. Backend — executor/manager.py - Downgrade `ExecutionManager` input validation skip errors from `error` to `warning` level. Missing credentials is expected user behavior, not an internal error. Backend — blocks/llm.py - Sanitize unpaired surrogates in LLM prompt content before sending to provider APIs. Prevents `UnicodeEncodeError: surrogates not allowed` when httpx encodes the JSON body (AUTOGPT-SERVER-8AX). Frontend — package.json - Upgrade `ai` SDK from `6.0.59` to `6.0.134` to fix BUILDER-71J (`TypeError: undefined is not an object (evaluating 'this.activeResponse.state')` on /copilot page). This is a known issue in the Vercel AI SDK fixed in later patch versions. Sentry issues addressed: - `No payment method found` (ValueError → 400) - `This agent is triggered by an external event (webhook)` (ValueError → 400) - `Node input updated with non-existent credentials` (ValueError → 400) - `[ExecutionManager] Skip execution, input validation error: missing input {credentials}` - `UnicodeEncodeError: surrogates not allowed` (AUTOGPT-SERVER-8AX) - `TypeError: activeResponse.state` (BUILDER-71J) Resolves SECRT-2166 --- Co-authored-by: Zamil Majdy (@majdyz) <zamil.majdy@agpt.co> --------- Co-authored-by: Zamil Majdy (@majdyz) <zamil.majdy@agpt.co>	2026-03-23 12:22:49 +00:00
Zamil Majdy	7176cecf25	perf(copilot): reduce tool schema token cost by 34% (#12398 ) ## Summary Reduce CoPilot per-turn token overhead by systematically trimming tool descriptions, parameter schemas, and system prompt content. All 35 MCP tool schemas are passed on every SDK call — this PR reduces their size. ### Strategy 1. Tool descriptions: Trimmed verbose multi-sentence explanations to concise single-sentence summaries while preserving meaning 2. Parameter schemas: Shortened parameter descriptions to essential info, removed some `default` values (handled in code) 3. System prompt: Condensed `_SHARED_TOOL_NOTES` and storage supplement template in `prompting.py` 4. Cross-tool references: Removed duplicate workflow hints (e.g. "call find_block before run_block" appeared in BOTH tools — kept only in the dependent tool). Critical cross-tool references retained (e.g. `continue_run_block` in `run_block`, `fix_agent_graph` in `validate_agent`, `get_doc_page` in `search_docs`, `web_fetch` preference in `browser_navigate`) ### Token Impact \| Metric \| Before \| After \| Reduction \| \|--------\|--------\|-------\|-----------\| \| System Prompt \| ~865 tokens \| ~497 tokens \| 43% \| \| Tool Schemas \| ~9,744 tokens \| ~6,470 tokens \| 34% \| \| Grand Total \| ~10,609 tokens \| ~6,967 tokens \| 34% \| Saves ~3,642 tokens per conversation turn. ### Key Decisions - Mostly description changes: Tool logic, parameters, and types unchanged. However, some schema-level `default` fields were removed (e.g. `save` in `customize_agent`) — these are machine-readable metadata, not just prose, and may affect LLM behavior. - Quality preserved: All descriptions still convey what the tool does and essential usage patterns - Cross-references trimmed carefully: Kept prerequisite hints in the dependent tool (run_block mentions find_block) but removed the reverse (find_block no longer mentions run_block). Critical cross-tool guidance retained where removal would degrade model behavior. - `run_time` description fixed: Added missing supported values (today, last 30 days, ISO datetime) per review feedback ### Future Optimization The SDK passes all 35 tools on every call. The MCP protocol's `list_tools()` handler supports dynamic tool registration — a follow-up PR could implement lazy tool loading (register core tools + a discovery meta-tool) to further reduce per-turn token cost. ### Changes - Trimmed descriptions across 25 tool files - Condensed `_SHARED_TOOL_NOTES` and `_build_storage_supplement` in `prompting.py` - Fixed `run_time` schema description in `agent_output.py` ### Checklist #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] All 273 copilot tests pass locally - [x] All 35 tools load and produce valid schemas - [x] Before/after token dumps compared - [x] Formatting passes (`poetry run format`) - [x] CI green	2026-03-23 08:27:24 +00:00
Zamil Majdy	f35210761c	feat(devops): add /pr-test skill + subscription mode auto-provisioning (#12507 ) ## Summary - Adds `/pr-test` skill for automated E2E testing of PRs using docker compose, agent-browser, and API calls - Covers full environment setup (copy .env, configure copilot auth, ARM64 Docker fix) - Includes browser UI testing, direct API testing, screenshot capture, and test report generation - Has `--fix` mode for auto-fixing bugs found during testing (similar to `/pr-address`) - Screenshot uploads use GitHub Git API (blobs → tree → commit → ref) — no local git operations, safe for worktrees - Subscription mode improvements: - Extract subscription auth logic to `sdk/subscription.py` — uses SDK's bundled CLI binary instead of requiring `npm install -g @anthropic-ai/claude-code` - Auto-provision `~/.claude/.credentials.json` from `CLAUDE_CODE_OAUTH_TOKEN` env var on container startup — no `claude login` needed in Docker - Add `scripts/refresh_claude_token.sh` — cross-platform helper (macOS/Linux/Windows) to extract OAuth tokens from host and update `backend/.env` ## Test plan - [x] Validated skill on multiple PRs (#12482, #12483, #12499, #12500, #12501, #12440, #12472) — all test scenarios passed - [x] Confirmed screenshot upload via GitHub Git API renders correctly on all 7 PRs - [x] Verified subscription mode E2E in Docker: `refresh_claude_token.sh` → `docker compose up` → copilot chat responds correctly with no API keys (pure OAuth subscription) - [x] Verified auto-provisioning of credentials file inside container from `CLAUDE_CODE_OAUTH_TOKEN` env var - [x] Confirmed bundled CLI detection (`claude_agent_sdk._bundled/claude`) works without system-installed `claude` - [x] `poetry run pytest backend/copilot/sdk/service_test.py` — 24/24 tests pass	2026-03-23 15:29:00 +07:00
Zamil Majdy	1ebcf85669	fix(platform): resolve 5 production Sentry alerts (#12496 ) ## Summary Fixes 5 high-priority Sentry alerts from production: - AUTOGPT-SERVER-8AM: Fix `TypeError: TypedDict does not support instance and class checks` — `_value_satisfies_type` in `type.py` now handles TypedDict classes that don't support `isinstance()` checks - AUTOGPT-SERVER-8AN: Fix `ValueError: No payment method found` triggering Sentry error — catch the expected ValueError in the auto-top-up endpoint and return HTTP 422 instead - BUILDER-7F5: Fix `Upload failed (409): File already exists` — add `overwrite` query param to workspace upload endpoint and set it to `true` from the frontend direct-upload - BUILDER-7F0: Fix `LaTeX-incompatible input` KaTeX warnings flooding Sentry — set `strict: false` on rehype-katex plugin to suppress warnings for unrecognized Unicode characters - AUTOGPT-SERVER-89N: Fix `Tool execution with manager failed: validation error for dict[str,list[any]]` — make RPC return type validation resilient (log warning instead of crash) and downgrade SmartDecisionMaker tool execution errors to warnings ## Test plan - [ ] Verify TypedDict type coercion works for GithubMultiFileCommitBlock inputs - [ ] Verify auto-top-up without payment method returns 422, not 500 - [ ] Verify file re-upload in copilot succeeds (overwrites instead of 409) - [ ] Verify LaTeX rendering with Unicode characters doesn't produce console warnings - [ ] Verify SmartDecisionMaker tool execution failures are logged at warning level	2026-03-23 08:05:08 +00:00
Otto	ab7c38bda7	fix(frontend): detect closed OAuth popup and allow dismissing waiting modal (#12443 ) Requested by @kcze When a user closes the OAuth sign-in popup without completing authentication, the 'Waiting on sign-in process' modal was stuck open with no way to dismiss it, forcing a page refresh. Two bugs caused this: 1. `oauth-popup.ts` had no detection for the popup being closed by the user. The promise would hang until the 5-minute timeout. 2. The modal's cancel button aborted a disconnected `AbortController` instead of the actual OAuth flow's abort function, so clicking cancel/close did nothing. ### Changes - Add `popup.closed` polling (500ms) in `openOAuthPopup()` that rejects the promise when the user closes the auth window - Add reject-on-abort so the cancel button properly terminates the flow - Replace the disconnected `oAuthPopupController` with a direct `cancelOAuthFlow()` function that calls the real abort ref - Handle popup-closed and user-canceled as silent cancellations (no error toast) ### Testing Tested manually ✅ - [x] Start OAuth flow → close popup window → modal dismisses automatically ✅ - [x] Start OAuth flow → click cancel on modal → popup closes, modal dismisses ✅ - [x] Complete OAuth flow normally → works as before ✅ Resolves SECRT-2054 --- Co-authored-by: Krzysztof Czerwinski (@kcze) <krzysztof.czerwinski@agpt.co> --------- Co-authored-by: Krzysztof Czerwinski <kpczerwinski@gmail.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:41:09 +00:00
Ubbe	0f67e45d05	hotfix(marketplace): adjust card height overflow (#12497 ) ## Summary ### Before <img width="500" height="501" alt="Screenshot 2026-03-20 at 21 50 31" src="https://github.com/user-attachments/assets/6154cffb-6772-4c3d-a703-527c8ca0daff" /> ### After <img width="500" height="581" alt="Screenshot 2026-03-20 at 21 33 12" src="https://github.com/user-attachments/assets/2f9bd69d-30c5-4d06-ad1e-ed76b184afe5" /> ### Other minor fixes - minor spacing adjustments in creator/search pages when empty and between sections ### Summary - Increase StoreCard height from 25rem to 26.5rem to prevent content overflow - Replace manual tooltip-based title truncation with `OverflowText` component in StoreCard - Adjust carousel indicator positioning and hide it on md+ when exactly 3 featured agents are shown ## Test plan - [x] Verify marketplace cards display without text overflow - [x] Verify featured section carousel indicators behave correctly - [x] Check responsive behavior at common breakpoints 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 22:03:28 +08:00
Ubbe	b9ce37600e	refactor(frontend/marketplace): move download below Add to library with contextual text (#12486 ) ## Summary <img width="1487" height="670" alt="Screenshot 2026-03-20 at 00 52 58" src="https://github.com/user-attachments/assets/f09de2a0-3c5b-4bce-b6f4-8a853f6792cf" /> - Move the download button from inline next to "Add to library" to a separate line below it - Add contextual text: "Want to use this agent locally? Download here" - Style the "Download here" as a violet ghost button link with the download icon ## Test plan - [ ] Visit a marketplace agent page - [ ] Verify "Add to library" button renders in its row - [ ] Verify "Want to use this agent locally? Download here" appears below it - [ ] Click "Download here" and confirm the agent downloads correctly 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 13:13:59 +00:00
Otto	3921deaef1	fix(frontend): truncate marketplace card description to 2 lines (#12494 ) Reduces `line-clamp` from 3 to 2 on the marketplace `StoreCard` description to prevent text from overlapping with the absolutely-positioned run count and +Add button at the bottom of the card. Resolves SECRT-2156. --- Co-authored-by: Abhimanyu Yadav (@Abhi1992002) <122007096+Abhi1992002@users.noreply.github.com>	2026-03-20 09:10:21 +00:00
Nicholas Tindle	f01f668674	fix(backend): support Responses API in SmartDecisionMakerBlock (#12489 ) ## Summary - Fixes SmartDecisionMakerBlock conversation management to work with OpenAI's Responses API, which was introduced in #12099 (commit `1240f38`) - The migration to `responses.create` updated the outbound LLM call but missed the conversation history serialization — the `raw_response` is now the entire `Response` object (not a `ChatCompletionMessage`), and tool calls/results use `function_call` / `function_call_output` types instead of role-based messages - This caused a 400 error on the second LLM call in agent mode: `"Invalid value: ''. Supported values are: 'assistant', 'system', 'developer', and 'user'."` ### Changes `smart_decision_maker.py` — 6 functions updated: \| Function \| Fix \| \|---\|---\| \| `_convert_raw_response_to_dict` \| Detects Responses API `Response` objects, extracts output items as a list \| \| `_get_tool_requests` \| Recognizes `type: "function_call"` items \| \| `_get_tool_responses` \| Recognizes `type: "function_call_output"` items \| \| `_create_tool_response` \| New `responses_api` kwarg produces `function_call_output` format \| \| `_update_conversation` \| Handles list return from `_convert_raw_response_to_dict` \| \| Non-agent mode path \| Same list handling for traditional execution \| `test_smart_decision_maker_responses_api.py` — 61 tests covering: - Every branch of all 6 affected helper functions - Chat Completions, Anthropic, and Responses API formats - End-to-end agent mode and traditional mode conversation validity ## Test plan - [x] 61 new unit tests all pass - [x] 11 existing SmartDecisionMakerBlock tests still pass (no regressions) - [x] All pre-commit hooks pass (ruff, black, isort, pyright) - [ ] CI integration tests 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Updates core LLM invocation and agent conversation/tool-call bookkeeping to match OpenAI’s Responses API, which can affect tool execution loops and prompt serialization across providers. Risk is mitigated by extensive new unit tests, but regressions could surface in production agent-mode flows or token/usage accounting. > > Overview > Migrates OpenAI calls from Chat Completions to the Responses API end-to-end, including tool schema conversion, output parsing, reasoning/text extraction, and updated token usage fields in `LLMResponse`. > > Fixes SmartDecisionMakerBlock conversation/tool handling for Responses API by treating `raw_response` as a Response object (splitting it into `output` items for replay), recognizing `function_call`/`function_call_output` entries, and emitting tool outputs in the correct Responses format to prevent invalid follow-up prompts. > > Also adjusts prompt compaction/token estimation to understand Responses API tool items, changes `get_execution_outputs_by_node_exec_id` to return list-valued `CompletedBlockOutput`, removes `gpt-3.5-turbo` from model/cost/docs lists, and adds focused unit tests plus a lightweight `conftest.py` to run these tests without the full server stack. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `ff292efd3d`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Otto <otto@agpt.co> Co-authored-by: Krzysztof Czerwinski <kpczerwinski@gmail.com> autogpt-platform-beta-v0.6.52	2026-03-20 03:23:52 +00:00
Otto	f7a3491f91	docs(platform): add TDD guidance to CLAUDE.md files (#12491 ) Requested by @majdyz Adds TDD (test-driven development) guidance to CLAUDE.md files so Claude Code follows a test-first workflow when fixing bugs or adding features. Changes: - Parent `CLAUDE.md`: Cross-cutting TDD workflow — write a failing `xfail` test, implement the fix, remove the marker - Backend `CLAUDE.md`: Concrete pytest example with `@pytest.mark.xfail` pattern - Frontend `CLAUDE.md`: Note about using Playwright `.fixme` annotation for bug-fix tests The workflow is: write a failing test first → confirm it fails for the right reason → implement → confirm it passes. This ensures every bug fix is covered by a test that would have caught the regression. --- Co-authored-by: Zamil Majdy (@majdyz) <zamil.majdy@agpt.co>	2026-03-20 02:13:16 +00:00
Nicholas Tindle	cbff3b53d3	Revert "feat(backend): migrate OpenAI provider to Responses API" (#12490 ) Reverts Significant-Gravitas/AutoGPT#12099 <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Reverts the OpenAI integration in `llm_call` from the Responses API back to `chat.completions`, which can change tool-calling, JSON-mode behavior, and token accounting across core AI blocks. The change is localized but touches the primary LLM execution path and associated tests/docs. > > Overview > Reverts the OpenAI path in `backend/blocks/llm.py` from the Responses API back to `chat.completions`, including updating JSON-mode (`response_format`), tool handling, and usage extraction to match the Chat Completions response shape. > > Removes the now-unused `backend/util/openai_responses.py` helpers and their unit tests, updates LLM tests to mock `chat.completions.create`, and adds `gpt-3.5-turbo` to the supported model list, cost config, and LLM docs. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `7d6226d10e`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY -->	2026-03-20 01:51:56 +00:00
Reinier van der Leer	5b9a4c52c9	revert(platform): Revert invite system (#12485 ) ## Summary Reverts the invite system PRs due to security gaps identified during review: - The move from Supabase-native `allowed_users` gating to application-level gating allows orphaned Supabase auth accounts (valid JWT without a platform `User`) - The auth middleware never verifies `User` existence, so orphaned users get 500s instead of clean 403s - OAuth/Google SSO signup completely bypasses the invite gate - The DB trigger that atomically created `User` + `Profile` on signup was dropped in favor of a client-initiated API call, introducing a failure window ### Reverted PRs - Reverts #12347 — Foundation: InvitedUser model, invite-gated signup, admin UI - Reverts #12374 — Tally enrichment: personalized prompts from form submissions - Reverts #12451 — Pre-check: POST /auth/check-invite endpoint - Reverts #12452 (collateral) — Themed prompt categories / SuggestionThemes UI. This PR built on top of #12374's `suggested_prompts` backend field and `/chat/suggested-prompts` endpoint, so it cannot remain without #12374. The copilot empty session falls back to hardcoded default prompts. ### Migration Includes a new migration (`20260319120000_revert_invite_system`) that: - Drops the `InvitedUser` table and its enums (`InvitedUserStatus`, `TallyComputationStatus`) - Restores the `add_user_and_profile_to_platform()` trigger on `auth.users` - Backfills `User` + `Profile` rows for any auth accounts created during the invite-gate window ### What's NOT reverted - The `generate_username()` function (never dropped, still used by backfill migration) - The old `add_user_to_platform()` function (superseded by `add_user_and_profile_to_platform()`) - PR #12471 (admin UX improvements) — was never merged, no action needed ## Test plan - [x] Verify migration: `InvitedUser` table dropped, enums dropped, trigger restored - [x] Verify backfill: no orphaned auth users, no users without Profile - [x] Verify existing users can still log in (email + OAuth) - [x] Verify CoPilot chat page loads with default prompts - [ ] Verify new user signup creates `User` + `Profile` via the restored trigger - [ ] Verify admin `/admin/users` page loads without crashing - [ ] Run backend tests: `poetry run test` 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co>	2026-03-19 17:15:30 +00:00
Otto	0ce1c90b55	fix(frontend): rename "CoPilot" to "AutoPilot" on credits page (#12481 ) Requested by @kcze Renames "CoPilot" → "AutoPilot" on the credits/usage limits page: - Heading: "CoPilot Usage Limits" → "AutoPilot Usage Limits" - Button: "Open CoPilot" → "Open AutoPilot" - Comment updated to match --- Co-authored-by: Zamil Majdy (@majdyz) <zamil.majdy@agpt.co> Co-authored-by: Zamil Majdy (@majdyz) <zamil.majdy@agpt.co>	2026-03-19 15:25:21 +00:00
Ubbe	d4c6eb9adc	fix(frontend): collapse navbar text to icons below 1280px (#12484 ) ## Summary <img width="400" height="339" alt="Screenshot 2026-03-19 at 22 53 23" src="https://github.com/user-attachments/assets/2fa76b8f-424d-4764-90ac-b7a331f5f610" /> <img width="600" height="595" alt="Screenshot 2026-03-19 at 22 53 31" src="https://github.com/user-attachments/assets/23f51cc7-b01e-4d83-97ba-2c43683877db" /> <img width="800" height="523" alt="Screenshot 2026-03-19 at 22 53 36" src="https://github.com/user-attachments/assets/1e447b9a-1cca-428c-bccd-1730f1670b8e" /> Now that we have the `Give feedback` button on the Navigation bar, collpase some of the links below `1280px` so there is more space and they don't collide with each other... - Collapse navbar link text to icon-only below 1280px (`xl` breakpoint) to prevent crowding - Wallet button shows only the wallet icon below 1280px instead of "Earn credits" text - Feedback button shows only the chat icon below 1280px instead of "Give Feedback" text - Added `whitespace-nowrap` to feedback button to prevent wrapping ## Changes - `NavbarLink.tsx`: `lg:block` → `xl:block` for link text - `Wallet.tsx`: `md:hidden`/`md:inline-block` → `xl:hidden`/`xl:inline-block` - `FeedbackButton.tsx`: wrap text in `hidden xl:inline` span, add `whitespace-nowrap` ## Test plan - [ ] Resize browser between 1024px–1280px and verify navbar shows only icons - [ ] At 1280px+ verify full text labels appear for links, wallet, and feedback - [ ] Verify mobile navbar still works correctly below `md` breakpoint 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 15:10:27 +00:00
Ubbe	1bb91b53b7	fix(frontend/marketplace): comprehensive marketplace UI redesign (#12462 ) ## Summary <img width="600" height="964" alt="Screenshot_2026-03-19_at_00 07 52" src="https://github.com/user-attachments/assets/95c0430a-26a3-499b-8f6a-25b9715d3012" /> <img width="600" height="968" alt="Screenshot_2026-03-19_at_00 08 01" src="https://github.com/user-attachments/assets/d440c3b0-c247-4f13-bf82-a51ff2e50902" /> <img width="600" height="939" alt="Screenshot_2026-03-19_at_00 08 14" src="https://github.com/user-attachments/assets/f19be759-e102-4a95-9474-64f18bce60cf" />" <img width="600" height="953" alt="Screenshot_2026-03-19_at_00 08 24" src="https://github.com/user-attachments/assets/ba4fa644-3958-45e2-89e9-a6a4448c63c5" /> - Re-style and re-skin the Marketplace pages to look more "professional" ... - Move the `Give feedback` button to the header ## Test plan - [x] Verify marketplace page search bar matches Form text field styling - [x] Verify agent cards have padding and subtle border - [x] Verify hover/focus states work correctly - [x] Check responsive behavior at different breakpoints 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 22:28:01 +08:00
Bentlybro	64a011664a	fix(schema): address Majdyz review feedback - Add FK constraints on LlmModelMigration (sourceModelSlug, targetModelSlug → LlmModel.slug) - Remove unused @@index([credentialProvider]) on LlmModelCost - Remove redundant @@index([isReverted]) on LlmModelMigration (covered by composite) - Add documentation for credentialProvider field explaining its purpose - Add reverse relation fields to LlmModel (SourceMigrations, TargetMigrations) Fixes data integrity: typos in migration slugs now caught at DB level.	2026-03-19 11:01:09 +00:00
Bentlybro	1db7c048d9	fix: isort import order	2026-03-19 11:01:09 +00:00
Bentlybro	4c5627c966	fix: use execute_raw_with_schema for proper multi-schema support Per Sentry feedback: db.execute_raw ignores connection string's ?schema= parameter and defaults to 'public' schema. This breaks in multi-schema setups. Changes: - Import execute_raw_with_schema from .db - Use {schema_prefix} placeholder in query - Call execute_raw_with_schema instead of db.execute_raw This matches the pattern used in fix_llm_provider_credentials and other schema-aware migrations. Works in both CI (public schema) and local (platform schema from connection string).	2026-03-19 11:01:09 +00:00
Bentlybro	d97d137a51	fix: remove hardcoded schema prefix from migrate_llm_models query The raw SQL query in migrate_llm_models() hardcoded platform."AgentNode" which fails in CI where tables are in 'public' schema (not 'platform'). This code exists in dev but only runs when LLM registry has data. With our new schema, the migration tries to run at startup and fails in CI. Changed: UPDATE platform."AgentNode" -> UPDATE "AgentNode" Matches pattern of all other migrations - let connection string's default schema handle routing.	2026-03-19 11:01:09 +00:00
Bentlybro	ded9e293ff	fix: remove CREATE SCHEMA to match CI environment CI uses schema "public" as default (not "platform"), so creating a platform schema then tables without prefix puts tables in public but Prisma looks in platform. Existing migrations don't create schema - they rely on connection string's default. Remove CREATE SCHEMA IF NOT EXISTS to match.	2026-03-19 11:01:09 +00:00
Bentlybro	83d504bed2	fix: remove schema prefix from migration SQL to match existing pattern CI failing with 'relation "platform.AgentNode" does not exist' because Prisma generates queries differently when tables are created with explicit schema prefixes. Existing AutoGPT migrations use: CREATE TABLE "AgentNode" (...) Not: CREATE TABLE "platform"."AgentNode" (...) The connection string's ?schema=platform handles schema selection, so explicit prefixes aren't needed and cause compatibility issues. Changes: - Remove all "platform". prefixes from: * CREATE TYPE statements * CREATE TABLE statements * CREATE INDEX statements * ALTER TABLE statements * REFERENCES clauses in foreign keys Now matches existing migration pattern exactly.	2026-03-19 11:01:09 +00:00
Bentlybro	a5f1ffb35b	fix: add partial unique indexes for data integrity Per CodeRabbit feedback - fix 2 actual bugs: 1. Prevent multiple active migrations per source model - Add partial unique index: UNIQUE (sourceModelSlug) WHERE isReverted = false - Prevents ambiguous routing when resolving migrations 2. Allow both default and credential-specific costs - Remove @@unique([llmModelId, credentialProvider, unit]) - Add 2 partial unique indexes: * UNIQUE (llmModelId, provider, unit) WHERE credentialId IS NULL (defaults) * UNIQUE (llmModelId, provider, credentialId, unit) WHERE credentialId IS NOT NULL (overrides) - Enables provider-level default costs + per-credential overrides Schema comments document that these constraints exist in migration SQL.	2026-03-19 11:01:09 +00:00
Bentlybro	97c6516a14	fix: remove multiSchema - follow existing AutoGPT pattern Remove unnecessary multiSchema configuration that broke existing models. AutoGPT uses connection string's ?schema=platform parameter as default, not Prisma's multiSchema feature. Existing models (User, AgentGraph, etc.) have no @@schema() directives and work fine. Changes: - Remove schemas = ["platform", "public"] from datasource - Remove "multiSchema" from previewFeatures - Remove all @@schema() directives from LLM models and enum Migration SQL already creates tables in platform schema explicitly (CREATE TABLE "platform"."LlmProvider" etc.) which is correct. This matches the existing pattern used throughout the codebase.	2026-03-19 11:01:09 +00:00
Bentlybro	876dde8bc7	fix: address CodeRabbit design feedback Per CodeRabbit review: 1. Safety: Change capability defaults false → safer for partial seeding - supportsTools: true → false - supportsJsonOutput: true → false - Prevents partially-seeded rows from being assumed capable 2. Clarity: Rename supportsParallelTool → supportsParallelToolCalls - More explicit about what the field represents 3. Performance: Remove redundant indexes - Drop @@index([llmModelId]) - covered by unique constraint - Drop @@index([sourceModelSlug]) - covered by composite index - Reduces write overhead and storage 4. Documentation: Acknowledge customCreditCost limitation - It's unit-agnostic (doesn't distinguish RUN vs TOKENS) - Noted as TODO for follow-up PR with proper unit-aware override Schema + migration both updated to match.	2026-03-19 11:01:09 +00:00
Bentlybro	0bfdd74b25	fix: add @@schema("platform") to LlmCostUnit enum Sentry caught this - enums also need @@schema directive with multiSchema enabled. Without it, Prisma looks for enum in public schema but it's created in platform.	2026-03-19 11:01:09 +00:00
Bentlybro	a7d2f81b18	feat: add database CHECK constraints for data integrity Per CodeRabbit feedback - enforce numeric domain rules at DB level: Migration: - priceTier: CHECK (priceTier BETWEEN 1 AND 3) - creditCost: CHECK (creditCost >= 0) - nodeCount: CHECK (nodeCount >= 0) - customCreditCost: CHECK (customCreditCost IS NULL OR customCreditCost >= 0) Schema comments: - Document constraints inline for developer visibility Prevents invalid data (negative costs, out-of-range tiers) from entering the database, matching backend/blocks/llm.py contract.	2026-03-19 11:01:09 +00:00
Bentlybro	3699eaa556	fix: use @@schema() instead of @@map() for platform schema + create schema in migration Critical fixes from PR review: 1. Replace @@map("platform.ModelName") with @@schema("platform") - Sentry correctly identified: Prisma was looking for literal table "platform.LlmProvider" with dot - Proper syntax: enable multiSchema feature + use @@schema directive 2. Create platform schema in migration - CI failed: schema "platform" does not exist - Add CREATE SCHEMA IF NOT EXISTS at start of migration Schema changes: - datasource: add schemas = ["platform", "public"] - generator: add "multiSchema" to previewFeatures - All 5 models: @@map() → @@schema("platform") Migration changes: - Add CREATE SCHEMA IF NOT EXISTS "platform" before enum creation Fixes CI failure and Sentry-identified bug.	2026-03-19 11:01:09 +00:00
Bentlybro	21adf9e0fb	feat(platform): Add LLM registry database schema Add Prisma schema and migration for dynamic LLM model registry: Schema additions: - LlmProvider: Registry of LLM providers (OpenAI, Anthropic, etc.) - LlmModel: Individual models with capabilities and metadata - LlmModelCost: Per-model pricing configuration - LlmModelCreator: Model creators/trainers (OpenAI, Meta, etc.) - LlmModelMigration: Track model migrations and reverts - LlmCostUnit enum: RUN vs TOKENS pricing units Key features: - Model-specific capabilities (tools, JSON, reasoning, parallel calls) - Flexible creator/provider separation (e.g., Meta model via Hugging Face) - Migration tracking with custom pricing overrides - Indexes for performance on common queries Part 1 of incremental LLM registry implementation. Refs: Draft PR #11699	2026-03-19 11:01:08 +00:00
Ubbe	a5f9c43a41	feat(platform): replace suggestion pills with themed prompt categories (#12452 ) ## Summary https://github.com/user-attachments/assets/13da6d36-5f35-429b-a6cf-e18316bb8709 Replaces the flat list of suggestion pills in the CoPilot empty session with themed prompt categories (Learn, Create, Automate, Organize), each shown as a popover with contextual prompts. - Backend: Changes `suggested_prompts` from a flat `list[str]` to a themed `dict[str, list[str]]` keyed by category. Updates Tally extraction LLM prompt to generate prompts per theme, and the `/suggested-prompts` API to return grouped themes. Legacy `list[str]` rows are preserved under a `"General"` key for backward compatibility. - Frontend: Replaces inline pill buttons with a `SuggestionThemes` popover component. Each theme button (with icon) opens a dropdown of 5 relevant prompts. Falls back to hardcoded defaults when the API has no personalized prompts. Normalizes partial API responses by padding missing themes with defaults. Legacy `"General"` prompts are distributed round-robin across themes so existing users keep their personalized suggestions. ### Changes 🏗️ - `backend/data/understanding.py`: `suggested_prompts` field changed from `list[str]` to `dict[str, list[str]]`; legacy list rows preserved under `"General"` key; list items validated as strings - `backend/data/tally.py`: LLM prompt updated to generate themed prompts; validation now per-theme with blank-string rejection - `backend/api/features/chat/routes.py`: New `SuggestedTheme` model; endpoint returns `themes[]` - `frontend/copilot/components/EmptySession/EmptySession.tsx`: Uses generated API types directly (no cast) - `frontend/copilot/components/EmptySession/helpers.ts`: `DEFAULT_THEMES` replaces `DEFAULT_QUICK_ACTIONS`; `getSuggestionThemes` normalizes partial API responses and distributes legacy `"General"` prompts across themes - `frontend/copilot/components/EmptySession/components/SuggestionThemes/`: New popover component with theme icons and loading states ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verify themed suggestion buttons render on CoPilot empty session - [x] Click each theme button and confirm popover opens with prompts - [x] Click a prompt and confirm it sends the message - [x] Verify fallback to default themes when API returns no custom prompts - [x] Verify legacy users' personalized prompts are preserved and visible 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-19 18:46:12 +08:00

1 2 3 4 5 ...

8158 Commits