AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-02-17 02:03:00 -05:00

Author	SHA1	Message	Date
Reinier van der Leer	8cb5753aa3	don't pass Prisma models through DatabaseManager	2026-02-16 21:31:38 +01:00
Reinier van der Leer	639e4d6fd3	make messages consistent	2026-02-16 17:34:54 +01:00
Reinier van der Leer	8fe47bd999	add workspace storage shutdown to `CoPilotExecutor.cleanup()`	2026-02-16 16:49:45 +01:00
Reinier van der Leer	8dff12b2d2	fix direct DB usage in copilot/service.py	2026-02-16 15:28:09 +01:00
Reinier van der Leer	17b4914140	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-16 14:50:04 +01:00
Reinier van der Leer	9d4dcbd9e0	fix(backend/docker): Make `server` last (= default) build stage Without specifying an explicit build target it would build the `migrate` stage because it is the last stage in the Dockerfile. This caused deployment failures. - Follow-up to #12124 and `074be7ae`	2026-02-16 14:49:30 +01:00
Reinier van der Leer	1901861cbc	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-16 14:26:37 +01:00
Reinier van der Leer	074be7aea6	fix(backend/docker): Update run commands to match deployment - Follow-up to #12124 Changes: - Update `run` commands for all backend services in `docker-compose.platform.yml` to match the deployment commands used in production - Add trigger on `docker-compose(.platform)?.yml` changes to the Frontend CI workflow	2026-02-16 14:23:29 +01:00
Otto	39d28b24fc	ci(backend): Upgrade RabbitMQ from 3.12 (EOL) to 4.1.4 (#12118 ) ## Summary Upgrades RabbitMQ from the end-of-life `rabbitmq:3.12-management` to `rabbitmq:4.1.4`, aligning CI, local dev, and e2e testing with production. ## Changes ### CI Workflow (`.github/workflows/platform-backend-ci.yml`) - Image: `rabbitmq:3.12-management` → `rabbitmq:4.1.4` - Port: Removed 15672 (management UI) — not used - Health check: Added to prevent flaky tests from race conditions during startup ### Docker Compose (`docker-compose.platform.yml`, `docker-compose.test.yaml`) - Image: `rabbitmq:management` → `rabbitmq:4.1.4` - Port: Removed 15672 (management UI) — not used ## Why - RabbitMQ 3.12 is EOL - We don't use the management interface, so `-management` variant is unnecessary - CI and local dev/e2e should match production (4.1.4) ## Testing CI validates that backend tests pass against RabbitMQ 4.1.4 on Python 3.11, 3.12, and 3.13. --- Closes SECRT-1703	2026-02-16 12:45:39 +00:00
Reinier van der Leer	bf79a7748a	fix(backend/build): Update stale Poetry usage in Dockerfile (#12124 ) [SECRT-2006: Dev deployment failing: poetry not found in container PATH](https://linear.app/autogpt/issue/SECRT-2006) - Follow-up to #12090 ### Changes 🏗️ - Remove now-broken Poetry path config values - Remove usage of now-broken `poetry run` in container run command - Add trigger on `backend/Dockerfile` changes to Frontend CI workflow ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - If it works, CI will pass	2026-02-16 13:54:20 +01:00
Otto	649d4ab7f5	feat(chat): Add delete chat session endpoint and UI (#12112 ) ## Summary Adds the ability to delete chat sessions from the CoPilot interface. ## Changes ### Backend - Add `DELETE /api/chat/sessions/{session_id}` endpoint in `routes.py` - Returns 204 on success, 404 if not found or not owned by user - Reuses existing `delete_chat_session` function from `model.py` ### Frontend - Add delete button (trash icon) that appears on hover for each chat session - Add confirmation dialog before deletion using existing `DeleteConfirmDialog` component - Refresh session list after successful delete - Clear current session selection if the deleted session was active - Update OpenAPI spec with new endpoint ## Testing 1. Hover over a chat session in sidebar → trash icon appears 2. Click trash icon → confirmation dialog 3. Confirm deletion → session removed, list refreshes 4. If deleted session was active, selection is cleared ## Screenshots Delete button appears on hover, confirmation dialog on click. ## Related Issues Closes SECRT-1928 <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Adds the ability to delete chat sessions from the CoPilot interface — a new `DELETE /api/chat/sessions/{session_id}` backend endpoint and a corresponding delete button with confirmation dialog in the `ChatSidebar` frontend component. - Backend route (`routes.py`): Clean implementation reusing the existing `delete_chat_session` model function with proper auth guards and 204/404 responses. No issues. - Frontend (`ChatSidebar.tsx`): Adds hover-visible trash icon per session, confirmation dialog, mutation with cache invalidation, and active session clearing on delete. However, it uses a `__legacy__` component (`DeleteConfirmDialog`) which violates the project's style guide — new code should use the modern design system components. Error handling only logs to console without user-facing feedback (project convention is to use toast notifications for mutation errors). `isDeleting` is destructured but unused. - OpenAPI spec updated correctly. - Unrelated file included: `notes/plan-SECRT-1959-graph-edge-desync.md` is a planning document for a different ticket and should be removed from this PR. The `notes/` directory is newly introduced and both plan files should be reconsidered for inclusion. </details> <details><summary><h3>Confidence Score: 3/5</h3></summary> - Functionally correct but has style guide violations and includes unrelated files that should be addressed before merge. - The core feature implementation (backend DELETE endpoint and frontend mutation logic) is sound and follows existing patterns. Score is lowered because: (1) the frontend uses a legacy component explicitly prohibited by the project's style guide, (2) mutation errors are not surfaced to the user, and (3) the PR includes an unrelated planning document for a different ticket. - Pay close attention to `ChatSidebar.tsx` for the legacy component import and error handling, and `notes/plan-SECRT-1959-graph-edge-desync.md` which should be removed. </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant ChatSidebar as ChatSidebar (Frontend) participant ReactQuery as React Query participant API as DELETE /api/chat/sessions/{id} participant Model as model.delete_chat_session participant DB as db.delete_chat_session (Prisma) participant Redis as Redis Cache User->>ChatSidebar: Click trash icon on session ChatSidebar->>ChatSidebar: Show DeleteConfirmDialog User->>ChatSidebar: Confirm deletion ChatSidebar->>ReactQuery: deleteSession({ sessionId }) ReactQuery->>API: DELETE /api/chat/sessions/{session_id} API->>Model: delete_chat_session(session_id, user_id) Model->>DB: delete_many(where: {id, userId}) DB-->>Model: bool (deleted count > 0) Model->>Redis: Delete session cache key Model->>Model: Clean up session lock Model-->>API: True API-->>ReactQuery: 204 No Content ReactQuery->>ChatSidebar: onSuccess callback ChatSidebar->>ReactQuery: invalidateQueries(sessions list) ChatSidebar->>ChatSidebar: Clear sessionId if deleted was active ``` </details> <sub>Last reviewed commit: 44a92c6</sub> <!-- greptile_other_comments_section --> <details><summary><h4>Context used (3)</h4></summary> - Context from `dashboard` - autogpt_platform/frontend/CLAUDE.md ([source](https://app.greptile.com/review/custom-context?memory=39861924-d320-41ba-a1a7-a8bff44f780a)) - Context from `dashboard` - autogpt_platform/frontend/CONTRIBUTING.md ([source](https://app.greptile.com/review/custom-context?memory=cc4f1b17-cb5c-4b63-b218-c772b48e20ee)) - Context from `dashboard` - autogpt_platform/CLAUDE.md ([source](https://app.greptile.com/review/custom-context?memory=6e9dc5dc-8942-47df-8677-e60062ec8c3a)) </details> <!-- /greptile_comment --> --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co>	2026-02-16 12:19:18 +00:00
Reinier van der Leer	65e020d32b	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-16 12:17:54 +01:00
Ubbe	223df9d3da	feat(frontend): improve create/edit copilot UX (#12117 ) ## Changes 🏗️ Make the UX nicer when running long tasks in Copilot, like creating an agent, editing it or running a task. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Run locally and play the game! <!-- greptile_comment --> <details><summary><h3>Greptile Summary</h3></summary> This PR replaces the static progress bar and idle wait screens with an interactive mini-game across the Create, Edit, and Run Agent copilot tools. The existing mini-game (a simple runner with projectile-dodge boss encounters) is significantly overhauled into a two-mode game: a runner mode with animated tree obstacles and a duel mode featuring a melee boss fight with attack, guard, and movement mechanics. Sprite-based rendering replaces the previous shape-drawing approach. - Create/Edit/Run Agent UX: All three tool views now show the mini-game with contextual overlays during long-running operations, replacing the progress bar in EditAgent and adding the game to RunAgent - Game mechanics overhaul: Boss encounters changed from projectile-dodging to melee duel with attack (Z), block (X), movement (arrows), and jump (Space) controls - Sprite rendering: Added 9 sprite sheet assets for characters, trees, and boss animations with fallback to shape rendering if images fail to load - UI overlays: Added React-managed overlay states for idle, boss-intro, boss-defeated, and game-over screens with continue/retry buttons - Minor issues found: Unused `isRunActive` variable in `MiniGame.tsx`, unreachable "leaving" boss phase in `useMiniGame.ts`, and a missing `expanded` property in `getAccordionMeta` return type annotation in `EditAgent.tsx` - Unused asset: `archer-shoot.png` is included in the PR but never imported or referenced in any code </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - This PR is safe to merge — it only affects the copilot mini-game UX with no backend or data model changes. - The changes are entirely frontend/cosmetic, scoped to the copilot tools' waiting UX. The mini-game logic is self-contained in a canvas-based hook and doesn't affect any application state, API calls, or routing. The issues found are minor (unused variable, dead code, type annotation gap, unused asset) and don't impact runtime behavior. - `useMiniGame.ts` has the most complex logic changes (boss AI, death animations, sprite rendering) and contains unreachable dead code in the "leaving" phase handler. `EditAgent.tsx` has a return type annotation that doesn't include `expanded`. </details> <details><summary><h3>Flowchart</h3></summary> ```mermaid flowchart TD A[Game Idle] -->\|"Start button"\| B[Run Mode] B -->\|"Jump over trees"\| C{Score >= Threshold?} C -->\|No\| B C -->\|"Yes, obstacles clear"\| D[Boss Intro Overlay] D -->\|"Continue button"\| E[Duel Mode] E -->\|"Attack Z / Guard X / Move ←→"\| F{Boss HP <= 0?} F -->\|No\| G{Player hit & not guarding?} G -->\|No\| E G -->\|Yes\| H[Player Death Animation] H --> I[Game Over Overlay] I -->\|"Retry button"\| B F -->\|Yes\| J[Boss Death Animation] J --> K[Boss Defeated Overlay] K -->\|"Continue button"\| L[Reset Boss & Resume Run] L --> B ``` </details> <sub>Last reviewed commit: ad80e24</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-16 10:53:08 +00:00
Ubbe	187ab04745	refactor(frontend): remove OldAgentLibraryView and NEW_AGENT_RUNS flag (#12088 ) ## Summary - Removes the deprecated `OldAgentLibraryView` directory (13 files, ~2200 lines deleted) - Removes the `NEW_AGENT_RUNS` feature flag from the `Flag` enum and defaults - Removes the legacy agent library page at `library/legacy/[id]` - Moves shared `CronScheduler` components to `src/components/contextual/CronScheduler/` - Moves `agent-run-draft-view` and `agent-status-chip` to `legacy-builder/` (co-located with their only consumer) - Updates all import paths in consuming files (`AgentInfoStep`, `SaveControl`, `RunnerInputUI`, `useRunGraph`) ## Test plan - [x] `pnpm format` passes - [x] `pnpm types` passes (no TypeScript errors) - [x] No remaining references to `OldAgentLibraryView`, `NEW_AGENT_RUNS`, or `new-agent-runs` in the codebase - [x] Verify `RunnerInputUI` dialog still works in the legacy builder - [x] Verify `AgentInfoStep` cron scheduling works in the publish modal - [x] Verify `SaveControl` cron scheduling works in the legacy builder 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> This PR removes deprecated code from the legacy agent library view system and consolidates the codebase to use the new agent runs implementation exclusively. The refactor successfully removes ~2200 lines of code across 13 deleted files while properly relocating shared components. Key changes: - Removed the entire `OldAgentLibraryView` directory and its 13 component files - Removed the `NEW_AGENT_RUNS` feature flag from the `Flag` enum and defaults - Deleted the legacy agent library page route at `library/legacy/[id]` - Moved `CronScheduler` components to `src/components/contextual/CronScheduler/` for shared use across the application - Moved `agent-run-draft-view` and `agent-status-chip` to `legacy-builder/` directory, co-locating them with their only consumer - Updated `useRunGraph.ts` to import `GraphExecutionMeta` from the generated API models instead of the deleted custom type definition - Updated all import paths in consuming components (`AgentInfoStep`, `SaveControl`, `RunnerInputUI`) Technical notes: - The new import path for `GraphExecutionMeta` (`@/app/api/__generated__/models/graphExecutionMeta`) will be generated when running `pnpm generate:api` from the OpenAPI spec - All references to the old code have been cleanly removed from the codebase - The refactor maintains proper separation of concerns by moving shared components to contextual locations </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - This PR is safe to merge with minimal risk, pending manual verification of the UI components mentioned in the test plan - The refactor is well-structured and all code changes are correct. The score of 4 (rather than 5) reflects that the PR author has marked three manual testing items as incomplete in the test plan: verifying `RunnerInputUI` dialog, `AgentInfoStep` cron scheduling, and `SaveControl` cron scheduling. While the code changes are sound, these UI components should be manually tested before merging to ensure the moved components work correctly in their new locations. - No files require special attention. The author should complete the manual testing checklist items for `RunnerInputUI`, `AgentInfoStep`, and `SaveControl` as noted in the test plan. </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Dev as Developer participant FE as Frontend Build participant API as Backend API participant Gen as Generated Types Note over Dev,Gen: Refactor: Remove OldAgentLibraryView & NEW_AGENT_RUNS flag Dev->>FE: Delete OldAgentLibraryView (13 files, ~2200 lines) Dev->>FE: Remove NEW_AGENT_RUNS from Flag enum Dev->>FE: Delete library/legacy/[id]/page.tsx Dev->>FE: Move CronScheduler → src/components/contextual/ Dev->>FE: Move agent-run-draft-view → legacy-builder/ Dev->>FE: Move agent-status-chip → legacy-builder/ Dev->>FE: Update RunnerInputUI import path Dev->>FE: Update SaveControl import path Dev->>FE: Update AgentInfoStep import path Dev->>FE: Update useRunGraph.ts FE->>Gen: Import GraphExecutionMeta from generated models Note over Gen: Type available after pnpm generate:api Gen-->>API: Uses OpenAPI spec schema API-->>FE: Type-safe GraphExecutionMeta model ``` </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-16 18:29:59 +08:00
Abhimanyu Yadav	e2d3c8a217	fix(frontend): Prevent node drag when selecting text in object editor key input (#11955 ) ## Summary - Add `nodrag` class to the key name input wrapper in `WrapIfAdditionalTemplate.tsx` - This prevents the node from being dragged when users try to select text in the key name input field - Follows the same pattern used by other input components like `TextWidget.tsx` ## Test plan - [x] Open the new builder - [x] Add a custom node with an Object input field - [x] Try to select text in the key name input by clicking and dragging - [x] Verify that text selection works without moving the block Co-authored-by: Claude <noreply@anthropic.com>	2026-02-16 06:59:33 +00:00
Eve	647c8ed8d4	feat(backend/blocks): enhance list concatenation with advanced operations (#12105 ) ## Summary Enhances the existing `ConcatenateListsBlock` and adds five new companion blocks for comprehensive list manipulation, addressing issue #11139 ("Implement block to concatenate lists"). ### Changes - Enhanced `ConcatenateListsBlock` with optional deduplication (`deduplicate`) and None-value filtering (`remove_none`), plus an output `length` field - New `FlattenListBlock`: Recursively flattens nested list structures with configurable `max_depth` - New `InterleaveListsBlock`: Round-robin interleaving of elements from multiple lists - New `ZipListsBlock`: Zips corresponding elements from multiple lists with support for padding to longest or truncating to shortest - New `ListDifferenceBlock`: Computes set difference between two lists (regular or symmetric) - New `ListIntersectionBlock`: Finds common elements between two lists, preserving order ### Helper Utilities Extracted reusable helper functions for validation, flattening, deduplication, interleaving, chunking, and statistics computation to support the blocks and enable future reuse. ### Test Coverage Comprehensive test suite with 188 test functions across 29 test classes covering: - Built-in block test harness validation for all 6 blocks - Manual edge-case tests for each block (empty inputs, large lists, mixed types, nested structures) - Internal method tests for all block classes - Unit tests for all helper utility functions Closes #11139 ## Test plan - [x] All files pass Python syntax validation (`ast.parse`) - [x] Built-in `test_input`/`test_output` tests defined for all blocks - [x] Manual tests cover edge cases: empty lists, large lists, mixed types, nested structures, deduplication, None removal - [x] Helper function tests validate all utility functions independently - [x] All block IDs are valid UUID4 - [x] Block categories set to `BlockCategory.BASIC` for consistency with existing list blocks <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Enhanced `ConcatenateListsBlock` with deduplication and None-filtering options, and added five new list manipulation blocks (`FlattenListBlock`, `InterleaveListsBlock`, `ZipListsBlock`, `ListDifferenceBlock`, `ListIntersectionBlock`) with comprehensive helper functions and test coverage. Key Changes: - Enhanced `ConcatenateListsBlock` with `deduplicate` and `remove_none` options, plus `length` output field - Added `FlattenListBlock` for recursively flattening nested lists with configurable `max_depth` - Added `InterleaveListsBlock` for round-robin element interleaving - Added `ZipListsBlock` with support for padding/truncation - Added `ListDifferenceBlock` and `ListIntersectionBlock` for set operations - Extracted 12 reusable helper functions for validation, flattening, deduplication, etc. - Comprehensive test suite with 188 test functions covering edge cases Minor Issues: - Helper function `_deduplicate_list` has redundant logic in the `else` branch that duplicates the `if` branch - Three helper functions (`_filter_empty_collections`, `_compute_list_statistics`, `_chunk_list`) are defined but unused - consider removing unless planned for future use - The `_make_hashable` function uses `hash(repr(item))` for unhashable types, which correctly treats structurally identical dicts/lists as duplicates </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - Safe to merge with minor style improvements recommended - The implementation is well-structured with comprehensive test coverage (188 tests), proper error handling, and follows existing block patterns. All blocks use valid UUID4 IDs and correct categories. The helper functions provide good code reuse. The minor issues are purely stylistic (redundant code, unused helpers) and don't affect functionality or safety. - No files require special attention - both files are well-tested and follow project conventions </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant Block as List Block participant Helper as Helper Functions participant Output User->>Block: Input (lists/parameters) Block->>Helper: _validate_all_lists() Helper-->>Block: validation result alt validation fails Block->>Output: error message else validation succeeds Block->>Helper: _concatenate_lists_simple() / _flatten_nested_list() / etc. Helper-->>Block: processed result opt deduplicate enabled Block->>Helper: _deduplicate_list() Helper-->>Block: deduplicated result end opt remove_none enabled Block->>Helper: _filter_none_values() Helper-->>Block: filtered result end Block->>Output: result + length end Output-->>User: Block outputs ``` </details> <sub>Last reviewed commit: a6d5445</sub> <!-- greptile_other_comments_section --> <sub>(2/5) Greptile learns from your feedback when you react with thumbs up/down!</sub> <!-- /greptile_comment --> --------- Co-authored-by: Otto <otto@agpt.co>	2026-02-16 05:39:53 +00:00
Zamil Majdy	27d94e395c	feat(backend/sdk): enable WebSearch, block WebFetch, consolidate tool constants (#12108 ) ## Summary - Enable Claude Agent SDK built-in WebSearch tool (Brave Search via Anthropic API) for the CoPilot SDK agent - Explicitly block WebFetch via `SDK_DISALLOWED_TOOLS`. The agent uses the SSRF-protected `mcp__copilot__web_fetch` MCP tool instead - Consolidate all tool security constants (`BLOCKED_TOOLS`, `WORKSPACE_SCOPED_TOOLS`, `DANGEROUS_PATTERNS`, `SDK_DISALLOWED_TOOLS`) into `tool_adapter.py` as a single source of truth — previously scattered across `tool_adapter.py`, `security_hooks.py`, and inline in `service.py` ## Changes - `tool_adapter.py`: Add `WebSearch` to `_SDK_BUILTIN_TOOLS`, add `SDK_DISALLOWED_TOOLS`, move security constants here - `security_hooks.py`: Import constants from `tool_adapter.py` instead of defining locally - `service.py`: Use `SDK_DISALLOWED_TOOLS` instead of inline `["Bash"]` ## Test plan - [x] All 21 security hooks tests pass - [x] Ruff lint clean - [x] All pre-commit hooks pass - [ ] Verify WebSearch works in CoPilot chat (manual test) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Consolidates tool security constants into `tool_adapter.py` as single source of truth, enables WebSearch (Brave via Anthropic API), and explicitly blocks WebFetch to prevent SSRF attacks. The change improves security by ensuring the agent uses the SSRF-protected `mcp__copilot__web_fetch` tool instead of the built-in WebFetch which can access internal networks like `localhost:8006`. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The changes improve security by blocking WebFetch (SSRF risk) while enabling safe WebSearch. The consolidation of constants into a single source of truth improves maintainability. All existing tests pass (21 security hooks tests), and the refactoring is straightforward with no behavioral changes to existing security logic. The only suggestions are minor improvements: adding a test for WebFetch blocking and considering a lowercase alias for consistency. - No files require special attention </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Agent as SDK Agent participant Hooks as Security Hooks participant TA as tool_adapter.py participant MCP as MCP Tools Note over TA: SDK_DISALLOWED_TOOLS = ["Bash", "WebFetch"] Note over TA: _SDK_BUILTIN_TOOLS includes WebSearch Agent->>Hooks: Request WebSearch (Brave API) Hooks->>TA: Check BLOCKED_TOOLS TA-->>Hooks: Not blocked Hooks-->>Agent: Allowed ✓ Agent->>Agent: Execute via Anthropic API Agent->>Hooks: Request WebFetch (SSRF risk) Hooks->>TA: Check BLOCKED_TOOLS Note over TA: WebFetch in SDK_DISALLOWED_TOOLS TA-->>Hooks: Blocked Hooks-->>Agent: Denied ✗ Note over Agent: Use mcp__copilot__web_fetch instead Agent->>Hooks: Request mcp__copilot__web_fetch Hooks->>MCP: Validate (MCP tool, not SDK builtin) MCP-->>Hooks: Has SSRF protection Hooks-->>Agent: Allowed ✓ Agent->>MCP: Execute with SSRF checks ``` </details> <sub>Last reviewed commit: 2d9975f</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-15 06:51:25 +00:00
Reinier van der Leer	87189e23b5	fix duplicate message appending in copilot service and sdk	2026-02-14 13:06:57 +01:00
Reinier van der Leer	35a37257a6	address thread safety comment	2026-02-13 22:57:52 +01:00
Reinier van der Leer	ee45f56310	fix tests	2026-02-13 21:35:38 +01:00
Reinier van der Leer	bfd04dcf04	address comments	2026-02-13 21:17:32 +01:00
DEEVEN SERU	b8f5c208d0	Handle errors in Jina ExtractWebsiteContentBlock (#12048 ) ## Summary - catch Jina reader client/server errors in ExtractWebsiteContentBlock and surface a clear error output keyed to the user URL - guard empty responses to return an explicit error instead of yielding blank content - add regression tests covering the happy path and HTTP client failures via a monkeypatched fetch ## Testing - not run (pytest unavailable in this environment) --------- Co-authored-by: Nicholas Tindle <nicktindle@outlook.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2026-02-13 19:15:09 +00:00
Reinier van der Leer	608db31508	address minor comments	2026-02-13 18:40:44 +01:00
Reinier van der Leer	62db72500e	Merge commit '9ddcaa884c86ca2bed0735ef37b2a711f3f20755' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-13 18:26:56 +01:00
Reinier van der Leer	c9efc3f51c	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-13 18:05:19 +01:00
Zamil Majdy	9ddcaa884c	Merge remote-tracking branch 'origin/dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-13 20:47:24 +04:00
Zamil Majdy	b3173ed91f	Merge branch 'dev' and integrate SDK into copilot microservice - Resolve merge conflicts from merged SDK changes (PR #12103) - Move sdk/ files from api/features/chat/sdk/ to copilot/sdk/ - Fix all imports to use backend.copilot.* paths - Move new tools (bash_exec, sandbox, web_fetch, feature_requests, check_operation_status) to copilot/tools/ with updated imports - Add append_and_save_message to model.py (adapted to chat_db() pattern) - Wire SDK service into copilot executor processor with feature flag - Add track_user_message to routes.py stream handler	2026-02-13 20:24:36 +04:00
Zamil Majdy	f9f358c526	feat(mcp): Add MCP tool block with OAuth, tool discovery, and standard credential integration (#12011 ) ## Summary <img width="1000" alt="image" src="https://github.com/user-attachments/assets/18e8ef34-d222-453c-8b0a-1b25ef8cf806" /> <img width="250" alt="image" src="https://github.com/user-attachments/assets/ba97556c-09c5-4f76-9f4e-49a2e8e57468" /> <img width="250" alt="image" src="https://github.com/user-attachments/assets/68f7804a-fe74-442d-9849-39a229c052cf" /> <img width="250" alt="image" src="https://github.com/user-attachments/assets/700690ba-f9fe-4726-8871-3bfbab586001" /> Full-stack MCP (Model Context Protocol) tool block integration that allows users to connect to any MCP server, discover available tools, authenticate via OAuth, and execute tools — all through the standard AutoGPT credential system. ### Backend - MCPToolBlock (`blocks/mcp/block.py`): New block using `CredentialsMetaInput` pattern with optional credentials (`default={}`), supporting both authenticated (OAuth) and public MCP servers. Includes auto-lookup fallback for backward compatibility. - MCP Client (`blocks/mcp/client.py`): HTTP transport with JSON-RPC 2.0, tool discovery, tool execution with robust error handling (type-checked error fields, non-JSON response handling) - MCP OAuth Handler (`blocks/mcp/oauth.py`): RFC 8414 discovery, dynamic per-server OAuth with PKCE, token storage and refresh via `raise_for_status=True` - MCP API Routes (`api/features/mcp/routes.py`): `discover-tools`, `oauth/login`, `oauth/callback` endpoints with credential cleanup, defensive OAuth metadata validation - Credential system integration: - `CredentialsMetaInput` model_validator normalizes legacy `"ProviderName.MCP"` format from Python 3.13's `str(StrEnum)` change - `CredentialsFieldInfo.combine()` supports URL-based credential discrimination (each MCP server gets its own credential entry) - `aggregate_credentials_inputs` checks block schema defaults for credential optionality - Executor normalizes credential data for both Pydantic and JSON schema validation paths - Chat credential matching handles MCP server URL filtering - `provider_matches()` helper used consistently for Python 3.13 StrEnum compatibility - Pre-run validation: `_validate_graph_get_errors` now calls `get_missing_input()` for custom block-level validation (MCP tool arguments) - Security: HTML tag stripping loop to prevent XSS bypass, SSRF protection (removed trusted_origins) ### Frontend - MCPToolDialog (`MCPToolDialog.tsx`): Full tool discovery UI — enter server URL, authenticate if needed, browse tools, select tool and configure - OAuth popup (`oauth-popup.ts`): Shared utility supporting cross-origin MCP OAuth flows with BroadcastChannel + localStorage fallback - Credential integration: MCP-specific OAuth flow in `useCredentialsInput`, server URL filtering in `useCredentials`, MCP callback page - CredentialsSelect: Auto-selects first available credential instead of defaulting to "None", credentials listed before "None" in dropdown - Node rendering: Dynamic tool input schema rendering on MCP nodes, proper handling in both legacy and new flow editors - Block title persistence: `customized_name` set at block creation for both MCP and Agent blocks — no fallback logic needed, titles survive save/load reliably - Stable credential ordering: Removed `sortByUnsetFirst` that caused credential inputs to jump when selected ### Tests (~2060 lines) - Unit tests: block, client, tool execution - Integration tests: mock MCP server with auth - OAuth flow tests - API endpoint tests - Credential combining/optionality tests - E2e tests (skipped in CI, run manually) ## Key Design Decisions 1. Optional credentials via `default={}`: MCP servers can be public (no auth) or private (OAuth). The `credentials` field has `default={}` making it optional at the schema level, so public servers work without prompting for credentials. 2. URL-based credential discrimination: Each MCP server URL gets its own credential entry in the "Run agent" form (via `discriminator="server_url"`), so agents using multiple MCP servers prompt for each independently. 3. Model-level normalization: Python 3.13 changed `str(StrEnum)` to return `"ClassName.MEMBER"`. Rather than scattering fixes across the codebase, a Pydantic `model_validator(mode="before")` on `CredentialsMetaInput` handles normalization centrally, and `provider_matches()` handles lookups. 4. Credential auto-select: `CredentialsSelect` component defaults to the first available credential and notifies the parent state, ensuring credentials are pre-filled in the "Run agent" dialog without requiring manual selection. 5. customized_name for block titles: Both MCP and Agent blocks set `customized_name` in metadata at creation time. This eliminates convoluted runtime fallback logic (`agent_name`, hostname extraction) — the title is persisted once and read directly. ## Test plan - [x] Unit/integration tests pass (68 MCP + 11 graph = 79 tests) - [x] Manual: MCP block with public server (DeepWiki) — no credentials needed, tools discovered and executable - [x] Manual: MCP block with OAuth server (Linear, Sentry) — OAuth flow prompts correctly - [x] Manual: "Run agent" form shows correct credential requirements per MCP server - [x] Manual: Credential auto-selects when exactly one matches, pre-selects first when multiple exist - [x] Manual: Credential ordering stays stable when selecting/deselecting - [x] Manual: MCP block title persists after save and refresh - [x] Manual: Agent block title persists after save and refresh (via customized_name) - [ ] Manual: Shared agent with MCP block prompts new user for credentials --------- Co-authored-by: Otto <otto@agpt.co> Co-authored-by: Ubbe <hi@ubbe.dev>	2026-02-13 16:17:03 +00:00
Zamil Majdy	52b3aebf71	feat(backend/sdk): Claude Agent SDK integration for CoPilot (#12103 ) ## Summary Full integration of the Claude Agent SDK to replace the existing one-turn OpenAI-compatible CoPilot implementation with a multi-turn, tool-using AI agent. ### What changed Core SDK Integration (`chat/sdk/` — new module) - `service.py`: Main orchestrator — spawns Claude Code CLI as a subprocess per user message, streams responses back via SSE. Handles conversation history compression, session lifecycle, and error recovery. - `response_adapter.py`: Translates Claude Agent SDK events (text deltas, tool use, errors, result messages) into the existing CoPilot `StreamEvent` protocol so the frontend works unchanged. - `tool_adapter.py`: Bridges CoPilot's MCP tools (find_block, run_block, create_agent, etc.) into the SDK's tool format. Handles schema conversion and result serialization. - `security_hooks.py`: Pre/Post tool-use hooks that enforce a strict allowlist of tools, block path traversal, sandbox file operations to per-session workspace directories, cap sub-agent spawning, and prevent the model from accessing unauthorized system resources. - `transcript.py`: JSONL transcript I/O utilities for the stateless `--resume` feature (see below). Stateless Multi-Turn Resume (new) - Instead of compressing conversation history via LLM on every turn (lossy and expensive), we capture Claude Code's native JSONL session transcript via a Stop hook callback, persist it in the DB (`ChatSession.sdkTranscript`), and restore it on the next turn via `--resume <file>`. - This preserves full tool call/result context across turns with zero token overhead for history. - Feature-flagged via `CLAUDE_AGENT_USE_RESUME` (default: off). - DB migration: `ALTER TABLE "ChatSession" ADD COLUMN "sdkTranscript" TEXT`. Sandboxed Tool Execution (`chat/tools/`) - `bash_exec.py`: Sandboxed bash execution using bubblewrap (`bwrap`) with read-only root filesystem, per-session writable workspace, resource limits (CPU, memory, file size), and network isolation. - `sandbox.py`: Shared bubblewrap sandbox infrastructure — generates `bwrap` command lines with configurable mounts, environment, and resource constraints. - `web_fetch.py`: URL fetching tool with domain allowlist, size limits, and content-type filtering. - `check_operation_status.py`: Polling tool for long-running operations (agent creation, block execution) so the SDK doesn't block waiting. - `find_block.py` / `run_block.py`: Enhanced with category filtering, optimized response size (removed raw JSON schemas), and better error handling. Security - Path traversal prevention: session IDs sanitized, all file ops confined to workspace dirs, symlink resolution. - Tool allowlist enforcement via SDK hooks — model cannot call arbitrary tools. - Built-in `Bash` tool blocked via `disallowed_tools` to prevent bypassing sandboxed `bash_exec`. - Sub-agent (`Task`) spawning capped at configurable limit (default: 10). - CodeQL-clean path sanitization patterns. Streaming & Reconnection - SSE stream registry backed by Redis Streams for crash-resilient reconnection. - Long-running operation tracking with TTL-based cleanup. - Atomic message append to prevent race conditions on concurrent writes. Configuration (`config.py`) - `use_claude_agent_sdk` — master toggle (default: on) - `claude_agent_model` — model override for SDK path - `claude_agent_max_buffer_size` — JSON parsing buffer (10MB) - `claude_agent_max_subtasks` — sub-agent cap (10) - `claude_agent_use_resume` — transcript-based resume (default: off) - `thinking_enabled` — extended thinking for Claude models Tests - `sdk/response_adapter_test.py` — 366 lines covering all event translation paths - `sdk/security_hooks_test.py` — 165 lines covering tool blocking, path traversal, subtask limits - `chat/model_test.py` — 214 lines covering session model serialization - `chat/service_test.py` — Integration tests including multi-turn resume keyword recall - `tools/find_block_test.py` / `run_block_test.py` — Extended with new tool behavior tests ## Test plan - [x] Unit tests pass (`sdk/response_adapter_test.py`, `security_hooks_test.py`, `model_test.py`) - [x] Integration test: multi-turn keyword recall via `--resume` (`service_test.py::test_sdk_resume_multi_turn`) - [x] Manual E2E: CoPilot chat sessions with tool calls, bash execution, and multi-turn context - [x] Pre-commit hooks pass (ruff, isort, black, pyright, flake8) - [ ] Staging deployment with `claude_agent_use_resume=false` initially - [ ] Enable resume in staging, verify transcript capture and recall <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> This PR replaces the existing OpenAI-compatible CoPilot with a full Claude Agent SDK integration, introducing multi-turn conversations, stateless resume via JSONL transcripts, and sandboxed tool execution. Key changes: - SDK integration (`chat/sdk/`): spawns Claude Code CLI subprocess per message, translates events to frontend protocol, bridges MCP tools - Stateless resume: captures JSONL transcripts via Stop hook, persists in `ChatSession.sdkTranscript`, restores with `--resume` (feature-flagged, default off) - Sandboxed execution: bubblewrap sandbox for bash commands with filesystem whitelist, network isolation, resource limits - Security hooks: tool allowlist enforcement, path traversal prevention, workspace-scoped file operations, sub-agent spawn limits - Long-running operations: delegates `create_agent`/`edit_agent` to existing stream_registry infrastructure for SSE reconnection - Feature flag: `CHAT_USE_CLAUDE_AGENT_SDK` with LaunchDarkly support, defaults to enabled Security issues found: - Path traversal validation has logic errors in `security_hooks.py:82` (tilde expansion order) and `service.py:266` (redundant `..` check) - Config validator always prefers env var over explicit `False` value (`config.py:162`) - Race condition in `routes.py:323` — message persisted before task registration, could duplicate on retry - Resource limits in sandbox may fail silently (`sandbox.py:109`) Test coverage is strong with 366 lines for response adapter, 165 for security hooks, and integration tests for multi-turn resume. </details> <details><summary><h3>Confidence Score: 3/5</h3></summary> - This PR is generally safe but has critical security issues in path validation that must be fixed before merge - Score reflects strong architecture and test coverage offset by real security vulnerabilities: the tilde expansion bug in `security_hooks.py` could allow sandbox escape, the race condition could cause message duplication, and the silent ulimit failures could bypass resource limits. The bubblewrap sandbox and allowlist enforcement are well-designed, but the path validation bugs need fixing. The transcript resume feature is properly feature-flagged. Overall the implementation is solid but the security issues prevent a higher score. - Pay close attention to `backend/api/features/chat/sdk/security_hooks.py` (path traversal vulnerability), `backend/api/features/chat/routes.py` (race condition), `backend/api/features/chat/tools/sandbox.py` (silent resource limit failures), and `backend/api/features/chat/sdk/service.py` (redundant security check) </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Frontend participant Routes as routes.py participant SDKService as sdk/service.py participant ClaudeSDK as Claude Agent SDK CLI participant SecurityHooks as security_hooks.py participant ToolAdapter as tool_adapter.py participant CoPilotTools as tools/* participant Sandbox as sandbox.py (bwrap) participant DB as Database participant Redis as stream_registry Frontend->>Routes: POST /chat (user message) Routes->>SDKService: stream_chat_completion_sdk() SDKService->>DB: get_chat_session() DB-->>SDKService: session + messages alt Resume enabled AND transcript exists SDKService->>SDKService: validate_transcript() SDKService->>SDKService: write_transcript_to_tempfile() Note over SDKService: Pass --resume to SDK else No resume SDKService->>SDKService: _compress_conversation_history() Note over SDKService: Inject history into user message end SDKService->>SecurityHooks: create_security_hooks() SDKService->>ToolAdapter: create_copilot_mcp_server() SDKService->>ClaudeSDK: spawn subprocess with MCP server loop Streaming Conversation ClaudeSDK->>SDKService: AssistantMessage (text/tool_use) SDKService->>Frontend: StreamTextDelta / StreamToolInputAvailable alt Tool Call ClaudeSDK->>SecurityHooks: PreToolUse hook SecurityHooks->>SecurityHooks: validate path, check allowlist alt Tool blocked SecurityHooks-->>ClaudeSDK: deny else Tool allowed SecurityHooks-->>ClaudeSDK: allow ClaudeSDK->>ToolAdapter: call MCP tool alt Long-running tool (create_agent, edit_agent) ToolAdapter->>Redis: register task ToolAdapter->>DB: save OperationPendingResponse ToolAdapter->>ToolAdapter: spawn background task ToolAdapter-->>ClaudeSDK: OperationStartedResponse else Regular tool (find_block, bash_exec) ToolAdapter->>CoPilotTools: execute() alt bash_exec CoPilotTools->>Sandbox: run_sandboxed() Sandbox->>Sandbox: build bwrap command Note over Sandbox: Network isolation,<br/>filesystem whitelist,<br/>resource limits Sandbox-->>CoPilotTools: stdout, stderr, exit_code end CoPilotTools-->>ToolAdapter: result ToolAdapter->>ToolAdapter: stash full output ToolAdapter-->>ClaudeSDK: MCP response end SecurityHooks->>SecurityHooks: PostToolUse hook (log) end end ClaudeSDK->>SDKService: UserMessage (ToolResultBlock) SDKService->>ToolAdapter: pop_pending_tool_output() SDKService->>Frontend: StreamToolOutputAvailable end ClaudeSDK->>SecurityHooks: Stop hook SecurityHooks->>SDKService: transcript_path callback SDKService->>SDKService: read_transcript_file() SDKService->>DB: save transcript to session.sdkTranscript ClaudeSDK->>SDKService: ResultMessage (success) SDKService->>Frontend: StreamFinish SDKService->>DB: upsert_chat_session() ``` </details> <sub>Last reviewed commit: 28c1121</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Co-authored-by: Swifty <craigswift13@gmail.com>	2026-02-13 15:49:03 +00:00
Bently	c2368f15ff	fix(blocks): disable PrintToConsoleBlock (#12100 ) ## Summary Disables the Print to Console block as requested by Nick Tindle. ## Changes - Added `disabled=True` to PrintToConsoleBlock in `basic.py` ## Testing - Block will no longer appear in the platform UI - Existing graphs using this block should be checked (block ID: `f3b1c1b2-4c4f-4f0d-8d2f-4c4f0d8d2f4c`) Closes OPEN-3000 <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Added `disabled=True` parameter to `PrintToConsoleBlock` in `basic.py` per Nick Tindle's request (OPEN-3000). - Block follows the same disabling pattern used by other blocks in the codebase (e.g., `BlockInstallationBlock`, video blocks, Ayrshare blocks) - Block will no longer appear in the platform UI for new graph creation - Existing graphs using this block (ID: `f3b1c1b2-4c4f-4f0d-8d2f-4c4f0d8d2f4c`) will need to be checked for compatibility - Comment properly documents the reason for disabling </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - Single-line change that adds a well-documented flag following existing patterns used throughout the codebase. The change is non-destructive and only affects UI visibility of the block for new graphs. - No files require special attention </details> <sub>Last reviewed commit: 759003b</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-13 15:20:23 +00:00
Swifty	5035b69c79	feat(platform): add feature request tools for CoPilot chat (#12102 ) Users can now search for existing feature requests and submit new ones directly through the CoPilot chat interface. Requests are tracked in Linear with customer need attribution. ### Changes 🏗️ Backend: - Added `SearchFeatureRequestsTool` and `CreateFeatureRequestTool` to the CoPilot chat tools registry - Integrated with Linear GraphQL API for searching issues in the feature requests project, creating new issues, upserting customers, and attaching customer needs - Added `linear_api_key` secret to settings for system-level Linear API access - Added response models (`FeatureRequestSearchResponse`, `FeatureRequestCreatedResponse`, `FeatureRequestInfo`) to the tools models Frontend: - Added `SearchFeatureRequestsTool` and `CreateFeatureRequestTool` UI components with full streaming state handling (input-streaming, input-available, output-available, output-error) - Added helper utilities for output parsing, type guards, animation text, and icon rendering - Wired tools into `ChatMessagesContainer` for rendering in the chat - Added styleguide examples covering all tool states ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified search returns matching feature requests from Linear - [x] Verified creating a new feature request creates an issue and customer need in Linear - [x] Verified adding a need to an existing issue works via `existing_issue_id` - [x] Verified error states render correctly in the UI - [x] Verified styleguide page renders all tool states #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) New secret: `LINEAR_API_KEY` — required for system-level Linear API operations (defaults to empty string). <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Adds feature request search and creation tools to CoPilot chat, integrating with Linear's GraphQL API to track user feedback. Users can now search existing feature requests and submit new ones (or add their need to existing issues) directly through conversation. Key changes: - Backend: `SearchFeatureRequestsTool` and `CreateFeatureRequestTool` with Linear API integration via system-level `LINEAR_API_KEY` - Frontend: React components with streaming state handling and accordion UI for search results and creation confirmations - Models: Added `FeatureRequestSearchResponse` and `FeatureRequestCreatedResponse` to response types - Customer need tracking: Upserts customers in Linear and attaches needs to issues for better feedback attribution Issues found: - Missing `LINEAR_API_KEY` entry in `.env.default` (required per PR description checklist) - Hardcoded project/team IDs reduce maintainability - Global singleton pattern could cause issues in async contexts - Using `user_id` as customer name reduces readability in Linear </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - Safe to merge with minor configuration fix required - The implementation is well-structured with proper error handling, type safety, and follows existing patterns in the codebase. The missing `.env.default` entry is a straightforward configuration issue that must be fixed before deployment but doesn't affect code quality. The other findings are style improvements that don't impact functionality. - Verify that `LINEAR_API_KEY` is added to `.env.default` before merging </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant CoPilot UI participant LLM participant FeatureRequestTool participant LinearClient participant Linear API User->>CoPilot UI: Request feature via chat CoPilot UI->>LLM: Send user message LLM->>FeatureRequestTool: search_feature_requests(query) FeatureRequestTool->>LinearClient: query(SEARCH_ISSUES_QUERY) LinearClient->>Linear API: POST /graphql (search) Linear API-->>LinearClient: searchIssues.nodes[] LinearClient-->>FeatureRequestTool: Feature request data FeatureRequestTool-->>LLM: FeatureRequestSearchResponse alt No existing requests found LLM->>FeatureRequestTool: create_feature_request(title, description) FeatureRequestTool->>LinearClient: mutate(CUSTOMER_UPSERT_MUTATION) LinearClient->>Linear API: POST /graphql (upsert customer) Linear API-->>LinearClient: customer {id, name} LinearClient-->>FeatureRequestTool: Customer data FeatureRequestTool->>LinearClient: mutate(ISSUE_CREATE_MUTATION) LinearClient->>Linear API: POST /graphql (create issue) Linear API-->>LinearClient: issue {id, identifier, url} LinearClient-->>FeatureRequestTool: Issue data FeatureRequestTool->>LinearClient: mutate(CUSTOMER_NEED_CREATE_MUTATION) LinearClient->>Linear API: POST /graphql (attach need) Linear API-->>LinearClient: need {id, issue} LinearClient-->>FeatureRequestTool: Need data FeatureRequestTool-->>LLM: FeatureRequestCreatedResponse else Existing request found LLM->>FeatureRequestTool: create_feature_request(title, description, existing_issue_id) FeatureRequestTool->>LinearClient: mutate(CUSTOMER_UPSERT_MUTATION) LinearClient->>Linear API: POST /graphql (upsert customer) Linear API-->>LinearClient: customer {id} LinearClient-->>FeatureRequestTool: Customer data FeatureRequestTool->>LinearClient: mutate(CUSTOMER_NEED_CREATE_MUTATION) LinearClient->>Linear API: POST /graphql (attach need to existing) Linear API-->>LinearClient: need {id, issue} LinearClient-->>FeatureRequestTool: Need data FeatureRequestTool-->>LLM: FeatureRequestCreatedResponse end LLM-->>CoPilot UI: Tool response + continuation CoPilot UI-->>User: Display result with accordion UI ``` </details> <sub>Last reviewed commit: af2e093</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-13 15:27:00 +01:00
Reinier van der Leer	648eb9638a	fix bodged merge	2026-02-13 14:59:01 +01:00
Reinier van der Leer	74477bbbf3	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-13 14:39:02 +01:00
Otto	dfa517300b	debug(copilot): Add detailed API error logging (#11942 ) ## Summary Adds comprehensive error logging for OpenRouter/OpenAI API errors to help diagnose issues like provider routing failures, context length exceeded, rate limits, etc. ## Background While investigating [SECRT-1859](https://linear.app/autogpt/issue/SECRT-1859), we found that when OpenRouter returns errors, the actual error details weren't being captured or logged. Langfuse traces showed `provider_name: 'unknown'` and `completion: null` without any insight into WHY all providers rejected the request. ## Changes - Add `_extract_api_error_details()` to extract rich information from API errors including: - Status code and request ID - Response body (contains OpenRouter's actual error message) - OpenRouter-specific headers (provider, model) - Rate limit headers - Add `_log_api_error()` helper that logs errors with context: - Session ID for correlation - Message count (helps identify context length issues) - Model being used - Retry count - Update error handling in `_stream_chat_chunks()` and `_generate_llm_continuation()` to use new logging - Extract provider's error message from response body for better user feedback ## Example log output ``` API error: { 'error_type': 'APIStatusError', 'error_message': 'Provider returned error', 'status_code': 400, 'request_id': 'req_xxx', 'response_body': {'error': {'message': 'context_length_exceeded', 'type': 'invalid_request_error'}}, 'openrouter_provider': 'unknown', 'session_id': '44fbb803-...', 'message_count': 52, 'model': 'anthropic/claude-opus-4.5', 'retry_count': 0 } ``` ## Testing - [ ] Verified code passes linting (black, isort, ruff) - [ ] Error details are properly extracted from different error types ## Refs - Linear: SECRT-1859 - Thread: https://discord.com/channels/1126875755960336515/1467066151002571034 --------- Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2026-02-13 13:15:17 +00:00
Reinier van der Leer	cabda535ea	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-13 13:06:39 +01:00
Reinier van der Leer	43b25b5e2f	ci(frontend): Speed up E2E test job (#12090 ) The frontend `e2e_test` doesn't have a working build cache setup, causing really slow builds = slow test jobs. These changes reduce total test runtime from ~12 minutes to ~5 minutes. ### Changes 🏗️ - Inject build cache config into docker compose config; let `buildx bake` use GHA cache directly - Add `docker-ci-fix-compose-build-cache.py` script - Optimize `backend/Dockerfile` + root `.dockerignore` - Replace broken DIY pnpm store caching with `actions/setup-node` built-in cache management - Add caching for test seed data created in DB ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - CI	2026-02-13 11:09:41 +01:00
Swifty	ab0b537cc7	refactor(backend): optimize find_block response size by removing raw JSON schemas (#12020 ) ### Changes 🏗️ The `find_block` AutoPilot tool was returning ~90K characters per response (10 blocks). The bloat came from including full JSON Schema objects (`input_schema`, `output_schema`) with all nested `$defs`, `anyOf`, and type definitions for every block. What changed: - `BlockInfoSummary` model: Removed `input_schema` (raw JSON Schema), `output_schema` (raw JSON Schema), and `categories`. Added `output_fields` (compact field-level summaries matching the existing `required_inputs` format). - `BlockListResponse` model: Removed `usage_hint` (info now in `message`). - `FindBlockTool._execute()`: Now extracts compact `output_fields` from output schema properties instead of including the entire raw schema. Credentials handling is unchanged. - Test: Added `test_response_size_average_chars_per_block` with realistic block schemas (HTTP, Email, Claude Code) to measure and assert response size stays under 2K chars/block. - `CLAUDE.md`: Clarified `dev` vs `master` branching strategy. Result: Average response size reduced from ~9,000 to ~1,300 chars per block (~85% reduction). This directly reduces LLM token consumption, latency, and API costs for AutoPilot interactions. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified models import and serialize correctly - [x] Verified response size: 3,970 chars for 3 realistic blocks (avg 1,323/block) - [x] Lint (`ruff check`) and type check (`pyright`) pass on changed files - [x] Frontend compatibility preserved: `blocks[].name` and `count` fields retained for `block_list` handler --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Toran Bruce Richards <toran.richards@gmail.com>	2026-02-13 11:08:51 +01:00
dependabot[bot]	9a8c6ad609	chore(libs/deps): bump the production-dependencies group across 1 directory with 4 updates (#12056 ) Bumps the production-dependencies group with 4 updates in the /autogpt_platform/autogpt_libs directory: [cryptography](https://github.com/pyca/cryptography), [fastapi](https://github.com/fastapi/fastapi), [launchdarkly-server-sdk](https://github.com/launchdarkly/python-server-sdk) and [supabase](https://github.com/supabase/supabase-py). Updates `cryptography` from 46.0.4 to 46.0.5 <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst">cryptography's changelog</a>.</em></p> <blockquote> <p>46.0.5 - 2026-02-10</p> <pre><code> * An attacker could create a malicious public key that reveals portions of your private key when using certain uncommon elliptic curves (binary curves). This version now includes additional security checks to prevent this attack. This issue only affects binary elliptic curves, which are rarely used in real-world applications. Credit to XlabAI Team of Tencent Xuanwu Lab and Atuin Automated Vulnerability Discovery Engine for reporting the issue. CVE-2026-26007 * Support for ``SECT`` binary elliptic curves is deprecated and will be removed in the next release. <p>.. v46-0-4:<br /> </code></pre></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`06e120e682`"><code>06e120e</code></a> bump version for 46.0.5 release (<a href="https://redirect.github.com/pyca/cryptography/issues/14289">#14289</a>)</li> <li><a href="`0eebb9dbb6`"><code>0eebb9d</code></a> EC check key on cofactor > 1 (<a href="https://redirect.github.com/pyca/cryptography/issues/14287">#14287</a>)</li> <li><a href="`bedf6e186b`"><code>bedf6e1</code></a> fix openssl version on 46 branch (<a href="https://redirect.github.com/pyca/cryptography/issues/14220">#14220</a>)</li> <li>See full diff in <a href="https://github.com/pyca/cryptography/compare/46.0.4...46.0.5">compare view</a></li> </ul> </details> <br /> Updates `fastapi` from 0.128.0 to 0.128.7 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/fastapi/fastapi/releases">fastapi's releases</a>.</em></p> <blockquote> <h2>0.128.7</h2> <h3>Features</h3> <ul> <li>✨ Show a clear error on attempt to include router into itself. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14258">#14258</a> by <a href="https://github.com/JavierSanchezCastro"><code>@JavierSanchezCastro</code></a>.</li> <li>✨ Replace <code>dict</code> by <code>Mapping</code> on <code>HTTPException.headers</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/12997">#12997</a> by <a href="https://github.com/rijenkii"><code>@rijenkii</code></a>.</li> </ul> <h3>Refactors</h3> <ul> <li>♻️ Simplify reading files in memory, do it sequentially instead of (fake) parallel. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14884">#14884</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h3>Docs</h3> <ul> <li>📝 Use <code>dfn</code> tag for definitions instead of <code>abbr</code> in docs. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14744">#14744</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h3>Internal</h3> <ul> <li>✅ Tweak comment in test to reference PR. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14885">#14885</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> <li>🔧 Update LLM-prompt for <code>abbr</code> and <code>dfn</code> tags. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14747">#14747</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> <li>✅ Test order for the submitted byte Files. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14828">#14828</a> by <a href="https://github.com/valentinDruzhinin"><code>@valentinDruzhinin</code></a>.</li> <li>🔧 Configure <code>test</code> workflow to run tests with <code>inline-snapshot=review</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14876">#14876</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h2>0.128.6</h2> <h3>Fixes</h3> <ul> <li>🐛 Fix <code>on_startup</code> and <code>on_shutdown</code> parameters of <code>APIRouter</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14873">#14873</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h3>Translations</h3> <ul> <li>🌐 Update translations for zh (update-outdated). PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14843">#14843</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h3>Internal</h3> <ul> <li>✅ Fix parameterized tests with snapshots. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14875">#14875</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h2>0.128.5</h2> <h3>Refactors</h3> <ul> <li>♻️ Refactor and simplify Pydantic v2 (and v1) compatibility internal utils. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14862">#14862</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h3>Internal</h3> <ul> <li>✅ Add inline snapshot tests for OpenAPI before changes from Pydantic v2. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14864">#14864</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h2>0.128.4</h2> <h3>Refactors</h3> <ul> <li>♻️ Refactor internals, simplify Pydantic v2/v1 utils, <code>create_model_field</code>, better types for <code>lenient_issubclass</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14860">#14860</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> <li>♻️ Simplify internals, remove Pydantic v1 only logic, no longer needed. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14857">#14857</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> <li>♻️ Refactor internals, cleanup unneeded Pydantic v1 specific logic. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14856">#14856</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`8f82c94de0`"><code>8f82c94</code></a> 🔖 Release version 0.128.7</li> <li><a href="`5bb3423205`"><code>5bb3423</code></a> 📝 Update release notes</li> <li><a href="`6ce5e3e961`"><code>6ce5e3e</code></a> ✅ Tweak comment in test to reference PR (<a href="https://redirect.github.com/fastapi/fastapi/issues/14885">#14885</a>)</li> <li><a href="`65da3dde12`"><code>65da3dd</code></a> 📝 Update release notes</li> <li><a href="`81f82fd955`"><code>81f82fd</code></a> 🔧 Update LLM-prompt for <code>abbr</code> and <code>dfn</code> tags (<a href="https://redirect.github.com/fastapi/fastapi/issues/14747">#14747</a>)</li> <li><a href="`ff721017df`"><code>ff72101</code></a> 📝 Update release notes</li> <li><a href="`ca76a4eba9`"><code>ca76a4e</code></a> 📝 Use <code>dfn</code> tag for definitions instead of <code>abbr</code> in docs (<a href="https://redirect.github.com/fastapi/fastapi/issues/14744">#14744</a>)</li> <li><a href="`1133a4594d`"><code>1133a45</code></a> 📝 Update release notes</li> <li><a href="`38f965985e`"><code>38f9659</code></a> ✅ Test order for the submitted byte Files (<a href="https://redirect.github.com/fastapi/fastapi/issues/14828">#14828</a>)</li> <li><a href="`3f1cc8f8f5`"><code>3f1cc8f</code></a> 📝 Update release notes</li> <li>Additional commits viewable in <a href="https://github.com/fastapi/fastapi/compare/0.128.0...0.128.7">compare view</a></li> </ul> </details> <br /> Updates `launchdarkly-server-sdk` from 9.14.1 to 9.15.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/launchdarkly/python-server-sdk/releases">launchdarkly-server-sdk's releases</a>.</em></p> <blockquote> <h2>v9.15.0</h2> <h2><a href="https://github.com/launchdarkly/python-server-sdk/compare/9.14.1...9.15.0">9.15.0</a> (2026-02-10)</h2> <h3>Features</h3> <ul> <li>Drop support for python 3.9 (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/393">#393</a>) (<a href="`5b761bd306`">5b761bd</a>)</li> <li>Update ChangeSet to always require a Selector (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/405">#405</a>) (<a href="`5dc4f81688`">5dc4f81</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li>Add context manager for clearer, safer locks (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/396">#396</a>) (<a href="`beca0fa498`">beca0fa</a>)</li> <li>Address potential race condition in FeatureStore update_availability (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/391">#391</a>) (<a href="`31cf4875c3`">31cf487</a>)</li> <li>Allow modifying fdv2 data source options independent of main config (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/403">#403</a>) (<a href="`d78079e7f3`">d78079e</a>)</li> <li>Mark copy_with_new_sdk_key method as deprecated (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/353">#353</a>) (<a href="`e471ccc3d5`">e471ccc</a>)</li> <li>Prevent immediate polling on recoverable error (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/399">#399</a>) (<a href="`da565a2dce`">da565a2</a>)</li> <li>Redis store is considered initialized when <code>$inited</code> key is written (<a href="`e99a27d48f`">e99a27d</a>)</li> <li>Stop FeatureStoreClientWrapper poller on close (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/397">#397</a>) (<a href="`468afdfef3`">468afdf</a>)</li> <li>Update DataSystemConfig to accept list of synchronizers (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/404">#404</a>) (<a href="`c73ad14090`">c73ad14</a>)</li> <li>Update reason documentation with inExperiment value (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/401">#401</a>) (<a href="`cbfc3dd887`">cbfc3dd</a>)</li> <li>Update Redis to write missing <code>$inited</code> key (<a href="`e99a27d48f`">e99a27d</a>)</li> </ul> <hr /> <p>This PR was generated with <a href="https://github.com/googleapis/release-please">Release Please</a>. See <a href="https://github.com/googleapis/release-please#release-please">documentation</a>.</p> <!-- raw HTML omitted --> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/launchdarkly/python-server-sdk/blob/main/CHANGELOG.md">launchdarkly-server-sdk's changelog</a>.</em></p> <blockquote> <h2><a href="https://github.com/launchdarkly/python-server-sdk/compare/9.14.1...9.15.0">9.15.0</a> (2026-02-10)</h2> <h3>⚠ BREAKING CHANGES</h3> <p><strong>Note:</strong> The following breaking changes apply only to FDv2 (Flag Delivery v2) early access features, which are not subject to semantic versioning and may change without a major version bump.</p> <ul> <li>Update ChangeSet to always require a Selector (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/405">#405</a>) (<a href="`5dc4f81688`">5dc4f81</a>) <ul> <li>The <code>ChangeSetBuilder.finish()</code> method now requires a <code>Selector</code> parameter.</li> </ul> </li> <li>Update DataSystemConfig to accept list of synchronizers (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/404">#404</a>) (<a href="`c73ad14090`">c73ad14</a>) <ul> <li>The <code>DataSystemConfig.synchronizers</code> field now accepts a list of synchronizers, and the <code>ConfigBuilder.synchronizers()</code> method accepts variadic arguments.</li> </ul> </li> </ul> <h3>Features</h3> <ul> <li>Drop support for python 3.9 (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/393">#393</a>) (<a href="`5b761bd306`">5b761bd</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li>Add context manager for clearer, safer locks (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/396">#396</a>) (<a href="`beca0fa498`">beca0fa</a>)</li> <li>Address potential race condition in FeatureStore update_availability (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/391">#391</a>) (<a href="`31cf4875c3`">31cf487</a>)</li> <li>Allow modifying fdv2 data source options independent of main config (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/403">#403</a>) (<a href="`d78079e7f3`">d78079e</a>)</li> <li>Mark copy_with_new_sdk_key method as deprecated (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/353">#353</a>) (<a href="`e471ccc3d5`">e471ccc</a>)</li> <li>Prevent immediate polling on recoverable error (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/399">#399</a>) (<a href="`da565a2dce`">da565a2</a>)</li> <li>Redis store is considered initialized when <code>$inited</code> key is written (<a href="`e99a27d48f`">e99a27d</a>)</li> <li>Stop FeatureStoreClientWrapper poller on close (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/397">#397</a>) (<a href="`468afdfef3`">468afdf</a>)</li> <li>Update reason documentation with inExperiment value (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/401">#401</a>) (<a href="`cbfc3dd887`">cbfc3dd</a>)</li> <li>Update Redis to write missing <code>$inited</code> key (<a href="`e99a27d48f`">e99a27d</a>)</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`e542f737a6`"><code>e542f73</code></a> chore(main): release 9.15.0 (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/394">#394</a>)</li> <li><a href="`e471ccc3d5`"><code>e471ccc</code></a> fix: Mark copy_with_new_sdk_key method as deprecated (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/353">#353</a>)</li> <li><a href="`5dc4f81688`"><code>5dc4f81</code></a> feat: Update ChangeSet to always require a Selector (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/405">#405</a>)</li> <li><a href="`f20fffeb1e`"><code>f20fffe</code></a> chore: Remove dead code, clarify names, other cleanup (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/398">#398</a>)</li> <li><a href="`c73ad14090`"><code>c73ad14</code></a> fix: Update DataSystemConfig to accept list of synchronizers (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/404">#404</a>)</li> <li><a href="`d78079e7f3`"><code>d78079e</code></a> fix: Allow modifying fdv2 data source options independent of main config (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/403">#403</a>)</li> <li><a href="`e99a27d48f`"><code>e99a27d</code></a> chore: Support persistent data store verification in contract tests (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/402">#402</a>)</li> <li><a href="`cbfc3dd887`"><code>cbfc3dd</code></a> fix: Update reason documentation with inExperiment value (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/401">#401</a>)</li> <li><a href="`5a1adbb2de`"><code>5a1adbb</code></a> chore: Update sdk_metadata features (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/400">#400</a>)</li> <li><a href="`da565a2dce`"><code>da565a2</code></a> fix: Prevent immediate polling on recoverable error (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/399">#399</a>)</li> <li>Additional commits viewable in <a href="https://github.com/launchdarkly/python-server-sdk/compare/9.14.1...9.15.0">compare view</a></li> </ul> </details> <br /> Updates `supabase` from 2.27.2 to 2.28.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/supabase/supabase-py/releases">supabase's releases</a>.</em></p> <blockquote> <h2>v2.28.0</h2> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.3...v2.28.0">2.28.0</a> (2026-02-10)</h2> <h3>Features</h3> <ul> <li><strong>storage:</strong> add list_v2 method to file_api client (<a href="https://redirect.github.com/supabase/supabase-py/issues/1377">#1377</a>) (<a href="`259f4ad42d`">259f4ad</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li><strong>auth:</strong> add missing is_sso_user, deleted_at, banned_until to User model (<a href="https://redirect.github.com/supabase/supabase-py/issues/1375">#1375</a>) (<a href="`7f84a62996`">7f84a62</a>)</li> <li><strong>realtime:</strong> ensure remove_channel removes channel from channels dict (<a href="https://redirect.github.com/supabase/supabase-py/issues/1373">#1373</a>) (<a href="`0923314039`">0923314</a>)</li> <li><strong>realtime:</strong> use pop with default in _handle_message to prevent KeyError (<a href="https://redirect.github.com/supabase/supabase-py/issues/1388">#1388</a>) (<a href="`baea26f7ce`">baea26f</a>)</li> <li><strong>storage3:</strong> replace print() with warnings.warn() for trailing slash notice (<a href="https://redirect.github.com/supabase/supabase-py/issues/1380">#1380</a>) (<a href="`50b099fa06`">50b099f</a>)</li> </ul> <h2>v2.27.3</h2> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.2...v2.27.3">2.27.3</a> (2026-02-03)</h2> <h3>Bug Fixes</h3> <ul> <li>deprecate python 3.9 in all packages (<a href="https://redirect.github.com/supabase/supabase-py/issues/1365">#1365</a>) (<a href="`cc72ed75d4`">cc72ed7</a>)</li> <li>ensure storage_url has trailing slash to prevent warning (<a href="https://redirect.github.com/supabase/supabase-py/issues/1367">#1367</a>) (<a href="`4267ff1345`">4267ff1</a>)</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/supabase/supabase-py/blob/main/CHANGELOG.md">supabase's changelog</a>.</em></p> <blockquote> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.3...v2.28.0">2.28.0</a> (2026-02-10)</h2> <h3>Features</h3> <ul> <li><strong>storage:</strong> add list_v2 method to file_api client (<a href="https://redirect.github.com/supabase/supabase-py/issues/1377">#1377</a>) (<a href="`259f4ad42d`">259f4ad</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li><strong>auth:</strong> add missing is_sso_user, deleted_at, banned_until to User model (<a href="https://redirect.github.com/supabase/supabase-py/issues/1375">#1375</a>) (<a href="`7f84a62996`">7f84a62</a>)</li> <li><strong>realtime:</strong> ensure remove_channel removes channel from channels dict (<a href="https://redirect.github.com/supabase/supabase-py/issues/1373">#1373</a>) (<a href="`0923314039`">0923314</a>)</li> <li><strong>realtime:</strong> use pop with default in _handle_message to prevent KeyError (<a href="https://redirect.github.com/supabase/supabase-py/issues/1388">#1388</a>) (<a href="`baea26f7ce`">baea26f</a>)</li> <li><strong>storage3:</strong> replace print() with warnings.warn() for trailing slash notice (<a href="https://redirect.github.com/supabase/supabase-py/issues/1380">#1380</a>) (<a href="`50b099fa06`">50b099f</a>)</li> </ul> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.2...v2.27.3">2.27.3</a> (2026-02-03)</h2> <h3>Bug Fixes</h3> <ul> <li>deprecate python 3.9 in all packages (<a href="https://redirect.github.com/supabase/supabase-py/issues/1365">#1365</a>) (<a href="`cc72ed75d4`">cc72ed7</a>)</li> <li>ensure storage_url has trailing slash to prevent warning (<a href="https://redirect.github.com/supabase/supabase-py/issues/1367">#1367</a>) (<a href="`4267ff1345`">4267ff1</a>)</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`59e338400b`"><code>59e3384</code></a> chore(main): release 2.28.0 (<a href="https://redirect.github.com/supabase/supabase-py/issues/1378">#1378</a>)</li> <li><a href="`baea26f7ce`"><code>baea26f</code></a> fix(realtime): use pop with default in _handle_message to prevent KeyError (#...</li> <li><a href="`259f4ad42d`"><code>259f4ad</code></a> feat(storage): add list_v2 method to file_api client (<a href="https://redirect.github.com/supabase/supabase-py/issues/1377">#1377</a>)</li> <li><a href="`50b099fa06`"><code>50b099f</code></a> fix(storage3): replace print() with warnings.warn() for trailing slash notice...</li> <li><a href="`0923314039`"><code>0923314</code></a> fix(realtime): ensure remove_channel removes channel from channels dict (<a href="https://redirect.github.com/supabase/supabase-py/issues/1373">#1373</a>)</li> <li><a href="`7f84a62996`"><code>7f84a62</code></a> fix(auth): add missing is_sso_user, deleted_at, banned_until to User model (#...</li> <li><a href="`57dd6e2195`"><code>57dd6e2</code></a> chore(deps): bump the uv group across 1 directory with 3 updates (<a href="https://redirect.github.com/supabase/supabase-py/issues/1369">#1369</a>)</li> <li><a href="`c357def670`"><code>c357def</code></a> chore(main): release 2.27.3 (<a href="https://redirect.github.com/supabase/supabase-py/issues/1368">#1368</a>)</li> <li><a href="`4267ff1345`"><code>4267ff1</code></a> fix: ensure storage_url has trailing slash to prevent warning (<a href="https://redirect.github.com/supabase/supabase-py/issues/1367">#1367</a>)</li> <li><a href="`cc72ed75d4`"><code>cc72ed7</code></a> fix: deprecate python 3.9 in all packages (<a href="https://redirect.github.com/supabase/supabase-py/issues/1365">#1365</a>)</li> <li>Additional commits viewable in <a href="https://github.com/supabase/supabase-py/compare/v2.27.2...v2.28.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore <dependency name> major version` will close this group update PR and stop Dependabot creating any more for the specific dependency's major version (unless you unignore this specific dependency's major version or upgrade to it yourself) - `@dependabot ignore <dependency name> minor version` will close this group update PR and stop Dependabot creating any more for the specific dependency's minor version (unless you unignore this specific dependency's minor version or upgrade to it yourself) - `@dependabot ignore <dependency name>` will close this group update PR and stop Dependabot creating any more for the specific dependency (unless you unignore this specific dependency or upgrade to it yourself) - `@dependabot unignore <dependency name>` will remove all of the ignore conditions of the specified dependency - `@dependabot unignore <dependency name> <ignore condition>` will remove the ignore condition of the specified dependency and ignore conditions </details> <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Dependency update bumps 4 packages in the production-dependencies group, including a critical security patch for `cryptography`* (CVE-2026-26007) that prevents malicious public key attacks on binary elliptic curves. The update also includes bug fixes for `fastapi`, `launchdarkly-server-sdk`, and `supabase`. - cryptography 46.0.4 → 46.0.5: patches CVE-2026-26007, deprecates SECT* binary curves - fastapi 0.128.0 → 0.128.7: bug fixes, improved error handling, relaxed Starlette constraint - launchdarkly-server-sdk 9.14.1 → 9.15.0: drops Python 3.9 support (requires >=3.10), fixes race conditions - supabase 2.27.2/2.27.3 → 2.28.0: realtime fixes, new User model fields The lock files correctly resolve all dependencies. Python 3.10+ requirement is already enforced in both packages. However, backend's `pyproject.toml` still specifies `launchdarkly-server-sdk = "^9.14.1"` while the lock file uses 9.15.0 (pulled from autogpt_libs dependency), creating a minor version constraint inconsistency. </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - This PR is safe to merge with one minor style suggestion - Automated dependency update with critical security patch for cryptography. All updates are backwards-compatible within semver constraints. Lock files correctly resolve all dependencies. Python 3.10+ is already enforced. Only minor issue is version constraint inconsistency in backend's pyproject.toml for launchdarkly-server-sdk, which doesn't affect functionality but should be aligned for clarity. - autogpt_platform/backend/pyproject.toml needs launchdarkly-server-sdk version constraint updated to ^9.15.0 </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Otto <otto@agpt.co>	2026-02-13 09:10:11 +00:00
Ubbe	e8c50b96d1	fix(frontend): improve CoPilot chat table styling (#12094 ) ## Summary - Remove left and right borders from tables rendered in CoPilot chat - Increase cell padding (py-3 → py-3.5) for better spacing between text and lines - Applies to both Streamdown (main chat) and MarkdownRenderer (tool outputs) Design feedback from Olivia to make tables "breathe" more. ## Test plan - [ ] Open CoPilot chat and trigger a response containing a table - [ ] Verify tables no longer have left/right borders - [ ] Verify increased spacing between rows - [ ] Check both light and dark modes 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Improved CoPilot chat table styling by removing left and right borders and increasing vertical padding from `py-3` to `py-3.5`. Changes apply to both: - Streamdown-rendered tables (via CSS selector in `globals.css`) - MarkdownRenderer tables (via Tailwind classes) The changes make tables "breathe" more per design feedback from Olivia. Issue Found: - The CSS padding value in `globals.css:192` is `0.625rem` (`py-2.5`) but should be `0.875rem` (`py-3.5`) to match the PR description and the MarkdownRenderer implementation. </details> <details><summary><h3>Confidence Score: 2/5</h3></summary> - This PR has a logical error that will cause inconsistent table styling between Streamdown and MarkdownRenderer tables - The implementation has an inconsistency where the CSS file uses `py-2.5` padding while the PR description and MarkdownRenderer use `py-3.5`. This will result in different table padding between the two rendering systems, contradicting the goal of consistent styling improvements. - Pay close attention to `autogpt_platform/frontend/src/app/globals.css` - the padding value needs to be corrected to match the intended design </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>	2026-02-13 09:38:59 +08:00
Ubbe	30e854569a	feat(frontend): add exact timestamp tooltip on run timestamps (#12087 ) Resolves OPEN-2693: Make exact timestamp of runs accessible through UI. The NewAgentLibraryView shows relative timestamps ("2 days ago") for runs and schedules, but unlike the OldAgentLibraryView it didn't show the exact timestamp on hover. This PR adds a native `title` tooltip so users can see the full date/time by hovering. ### Changes 🏗️ - Added `descriptionTitle` prop to `SidebarItemCard` that renders as a `title` attribute on the description text - `TaskListItem` now passes the exact `run.started_at` timestamp via `descriptionTitle` - `ScheduleListItem` now passes the exact `schedule.next_run_time` timestamp via `descriptionTitle` ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [ ] Open an agent in the library view - [ ] Hover over a run's relative timestamp (e.g. "2 days ago") and confirm the full date/time tooltip appears - [ ] Hover over a schedule's relative timestamp and confirm the full date/time tooltip appears 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Added native tooltip functionality to show exact timestamps in the library view. The implementation adds a `descriptionTitle` prop to `SidebarItemCard` that renders as a `title` attribute on the description text. This allows users to hover over relative timestamps (e.g., "2 days ago") to see the full date/time. Changes: - Added optional `descriptionTitle` prop to `SidebarItemCard` component (SidebarItemCard.tsx:10) - `TaskListItem` passes `run.started_at` as the tooltip value (TaskListItem.tsx:84-86) - `ScheduleListItem` passes `schedule.next_run_time` as the tooltip value (ScheduleListItem.tsx:32) - Unrelated fix included: Sentry configuration updated to suppress cross-origin stylesheet errors (instrumentation-client.ts:25-28) Note: The PR includes two separate commits - the main timestamp tooltip feature and a Sentry error suppression fix. The PR description only documents the timestamp feature. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The changes are straightforward and limited in scope - adding an optional prop that forwards a native HTML attribute for tooltip functionality. The Text component already supports forwarding arbitrary HTML attributes through its spread operator (...rest), ensuring the `title` attribute works correctly. Both the timestamp tooltip feature and the Sentry configuration fix are low-risk improvements with no breaking changes. - No files require special attention </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant TaskListItem participant ScheduleListItem participant SidebarItemCard participant Text participant Browser User->>TaskListItem: Hover over run timestamp TaskListItem->>SidebarItemCard: Pass descriptionTitle (run.started_at) SidebarItemCard->>Text: Render with title attribute Text->>Browser: Forward title attribute to DOM Browser->>User: Display native tooltip with exact timestamp User->>ScheduleListItem: Hover over schedule timestamp ScheduleListItem->>SidebarItemCard: Pass descriptionTitle (schedule.next_run_time) SidebarItemCard->>Text: Render with title attribute Text->>Browser: Forward title attribute to DOM Browser->>User: Display native tooltip with exact timestamp ``` </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 09:38:16 +08:00
Ubbe	301d7cbada	fix(frontend): suppress cross-origin stylesheet security error (#12086 ) ## Summary - Adds `ignoreErrors` to the Sentry client configuration (`instrumentation-client.ts`) to filter out `SecurityError: CSSStyleSheet.cssRules getter: Not allowed to access cross-origin stylesheet` errors - These errors are caused by Sentry Replay (rrweb) attempting to serialize DOM snapshots that include cross-origin stylesheets (from browser extensions or CDN-loaded CSS) - This was reported via Sentry on production, occurring on any page when logged in ## Changes - `frontend/instrumentation-client.ts`: Added `ignoreErrors: [/Not allowed to access cross-origin stylesheet/]` to `Sentry.init()` config ## Test plan - [ ] Verify the error no longer appears in Sentry after deployment - [ ] Verify Sentry Replay still works correctly for other errors - [ ] Verify no regressions in error tracking (other errors should still be captured) 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Adds error filtering to Sentry client configuration to suppress cross-origin stylesheet security errors that occur when Sentry Replay (rrweb) attempts to serialize DOM snapshots containing stylesheets from browser extensions or CDN-loaded CSS. This prevents noise in Sentry error logs without affecting the capture of legitimate errors. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The change adds a simple error filter to suppress benign cross-origin stylesheet errors that are caused by Sentry Replay itself. The regex pattern is specific and only affects client-side error reporting, with no impact on application functionality or legitimate error capture - No files require special attention </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 09:37:54 +08:00
Reinier van der Leer	746a36822d	Merge branch 'dev' into pwuts/open-2995-refactor-move-copilot-ai-generation-tool-execution-to	2026-02-13 00:47:00 +01:00
Ubbe	d95aef7665	fix(copilot): stream timeout, long-running tool polling, and CreateAgent UI refresh (#12070 ) Agent generation completes on the backend but the UI does not update/refresh to show the result. ### Changes 🏗️ ![Uploading Screenshot 2026-02-13 at 00.44.54.png…]() - Stream start timeout (12s): If the backend doesn't begin streaming within 12 seconds of submitting a message, the stream is aborted and a destructive toast is shown to the user. - Long-running tool polling: Added `useLongRunningToolPolling` hook that polls the session endpoint every 1.5s while a tool output is in an operating state (`operation_started` / `operation_pending` / `operation_in_progress`). When the backend completes, messages are refreshed so the UI reflects the final result. - CreateAgent UI improvements: Replaced the orbit loader / progress bar with a mini-game, added expanded accordion for saved agents, and improved the saved-agent card with image, icons, and links that open in new tabs. - Backend tweaks: Added `image_url` to `CreateAgentToolOutput`, minor model/service updates for the dummy agent generator. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Send a message and verify the stream starts within 12s or a toast appears - [x] Trigger agent creation and verify the UI updates when the backend completes - [x] Verify the saved-agent card renders correctly with image, links, and icons --------- Co-authored-by: Otto <otto@agpt.co> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 20:06:40 +00:00
Nicholas Tindle	cb166dd6fb	feat(blocks): Store sandbox files to workspace (#12073 ) Store files created by sandbox blocks (Claude Code, Code Executor) to the user's workspace for persistence across runs. ### Changes 🏗️ - New `sandbox_files.py` utility (`backend/util/sandbox_files.py`) - Shared module for extracting files from E2B sandboxes - Stores files to workspace via `store_media_file()` (includes virus scanning, size limits) - Returns `SandboxFileOutput` with path, content, and `workspace_ref` - Claude Code block (`backend/blocks/claude_code.py`) - Added `workspace_ref` field to `FileOutput` schema - Replaced inline `_extract_files()` with shared utility - Files from working directory now stored to workspace automatically - Code Executor block (`backend/blocks/code_executor.py`) - Added `files` output field to `ExecuteCodeBlock.Output` - Creates `/output` directory in sandbox before execution - Extracts all files (text + binary) from `/output` after execution - Updated `execute_code()` to support file extraction with `extract_files` param ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Create agent with Claude Code block, have it create a file, verify `workspace_ref` in output - [x] Create agent with Code Executor block, write file to `/output`, verify `workspace_ref` in output - [x] Verify files persist in workspace after sandbox disposal - [x] Verify binary files (images, etc.) work correctly in Code Executor - [x] Verify existing graphs using `content` field still work (backward compat) #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) No configuration changes required - this is purely additive backend code. --- Related: Closes SECRT-1931 <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Adds automatic extraction and workspace storage of sandbox-written files (including binaries for code execution), which can affect output payload size, performance, and file-handling edge cases. > > Overview > Sandbox blocks now persist generated files to workspace. A new shared utility (`backend/util/sandbox_files.py`) extracts files from an E2B sandbox (scoped by a start timestamp) and stores them via `store_media_file`, returning `SandboxFileOutput` with `workspace_ref`. > > `ClaudeCodeBlock` replaces its inline file-scraping logic with this utility and updates the `files` output schema to include `workspace_ref`. > > `ExecuteCodeBlock` adds a `files` output and extends the executor mixin to optionally extract/store files (text + binary) when an `execution_context` is provided; related mocks/tests and docs are updated accordingly. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `343854c0cf`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 15:56:59 +00:00
Swifty	3d31f62bf1	Revert "added feature request tooling" This reverts commit `b8b6c9de23`.	2026-02-12 16:39:24 +01:00
Swifty	b8b6c9de23	added feature request tooling	2026-02-12 16:38:17 +01:00
Reinier van der Leer	2a46d3fbf4	address more comments	2026-02-12 15:57:35 +01:00
Reinier van der Leer	ab25516a46	fix _consume_run check	2026-02-12 15:23:18 +01:00
Reinier van der Leer	6e2f595c7d	address comments	2026-02-12 15:10:11 +01:00
Reinier van der Leer	e523eb62b5	fix lint	2026-02-12 14:30:13 +01:00

1 2 3 4 5 ...

1970 Commits