AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-02-19 02:54:28 -05:00

Author	SHA1	Message	Date
abhi1992002	a100676c89	refactor(backend): clean up imports in rest_api.py and manager.py - Removed unused import of asyncio in manager.py. - Reorganized imports in rest_api.py to improve clarity by moving exception imports to the top and removing redundant lines.	2026-02-19 10:55:52 +05:30
abhi1992002	42dc37ab5e	refactor(backend): rewrite _get_descendant_folder_ids to use single query Replace N recursive DB queries with a single fetch of all user folders and an in-memory tree walk, matching the approach used by _is_descendant_of. Also removes the unnecessary transaction wrapper from the caller since a single query doesn't need it.	2026-02-19 10:54:36 +05:30
abhi1992002	ec31fed2b7	refactor(backend): extract LIBRARY_FOLDER_INCLUDE constant to includes.py Move the repeated folder include dict (LibraryAgents + Children with isDeleted filter) into a shared constant in includes.py, replacing all 4 inline occurrences in db.py.	2026-02-19 10:54:36 +05:30
abhi1992002	d1540128dc	refactor(backend): replace raw existence checks with get_folder/get_library_agent helpers - create_folder: remove parent check entirely (FK constraint handles it) - update_folder: use get_folder for userId authorization - move_folder: use get_folder for self-check, remove parent check (_is_descendant_of already validates via user folder fetch) - delete_folder: use get_folder for userId authorization - move_agent_to_folder: remove agent check (get_library_agent at end handles it), use get_folder for folder authorization - bulk_move_agents_to_folder: use get_folder for folder authorization	2026-02-19 10:54:36 +05:30
abhi1992002	c6223707d1	refactor(backend): inline _check_circular_reference into move_folder Remove redundant wrapper that just delegates to _is_descendant_of.	2026-02-19 10:54:36 +05:30
abhi1992002	661446ab75	refactor(backend): rename include_counts to include_relations in folder helpers Prisma Python doesn't support _count in includes, so we fetch full relations and derive counts via len(). Rename the parameter to accurately reflect what it does.	2026-02-19 10:54:36 +05:30
abhi1992002	346c9a175b	refactor(backend): use get_library_agent_by_graph_id helper in add_store_agent_to_library Replace raw prisma query with existing helper for the non-deleted path.	2026-02-19 10:54:36 +05:30
Abhimanyu Yadav	48bfc400b2	Merge branch 'dev' into abhi/folder-inside-library	2026-02-19 09:28:34 +05:30
Otto	dc77e7b6e6	feat(frontend): Replace advanced switch with chevron on builder nodes (#12152 ) ## Summary Replaces the "Advanced" switch/toggle on builder nodes with a chevron control, matching the UX pattern used for the outputs section. Resolves [OPEN-3006](https://linear.app/autogpt/issue/OPEN-3006/replace-advanced-switch-with-chevron-on-builder-nodes) Before <img width="443" height="348" alt="Screenshot 2026-02-17 at 9 01 31 pm" src="https://github.com/user-attachments/assets/40e98669-3136-4e53-8d46-df18ea32c4d7" /> After <img width="443" height="348" alt="Screenshot 2026-02-17 at 9 00 21 pm" src="https://github.com/user-attachments/assets/0836e3ac-1d0a-43d7-9392-c9d5741b32b6" /> ## Changes - `NodeAdvancedToggle.tsx` — Replaced switch component with a chevron expand/collapse toggle ## Testing Tested and verified by @kpczerwinski <!-- greptile_comment --> <details><summary><h3>Greptile Summary</h3></summary> Replaces the `Switch` toggle for the "Advanced" section on builder nodes with a chevron (`CaretDownIcon`) expand/collapse control, matching the existing UX pattern used in `OutputHandler.tsx`. The change is clean and consistent with the codebase. - Swapped `Switch` component for a ghost `Button` + `CaretDownIcon` with a `rotate-180` transition for visual feedback - Pattern closely mirrors the output section toggle in `OutputHandler.tsx` (lines 120-136) - Removed the top border separator and rounded bottom corners from the container, adjusting the visual spacing - Toggle logic correctly inverts the `showAdvanced` boolean state - Uses Phosphor Icons and design system components per project conventions </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge — it is a small, focused UI change with no logic or security concerns. - Single file changed with a straightforward UI component swap. The new implementation follows an established pattern already in use in OutputHandler.tsx. Toggle logic is correct and all conventions (Phosphor Icons, design system components, Tailwind styling) are followed. - No files require special attention. </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant NodeAdvancedToggle participant nodeStore User->>NodeAdvancedToggle: Click chevron button NodeAdvancedToggle->>nodeStore: setShowAdvanced(nodeId, !showAdvanced) nodeStore-->>NodeAdvancedToggle: Updated showAdvanced state NodeAdvancedToggle->>NodeAdvancedToggle: Rotate CaretDownIcon (0° ↔ 180°) Note over NodeAdvancedToggle: Advanced fields shown/hidden via FormCreator ``` </details> <sub>Last reviewed commit: ad66080</sub> <!-- greptile_other_comments_section --> Context used: - Context from `dashboard` - autogpt_platform/frontend/CLAUDE.md ([source](https://app.greptile.com/review/custom-context?memory=39861924-d320-41ba-a1a7-a8bff44f780a)) - Context from `dashboard` - autogpt_platform/frontend/src/app/(platform)/build/components/FlowEditor/ARCHITECTURE_FLOW_EDITOR.md ([source](https://app.greptile.com/review/custom-context?memory=0c5511fe-9aeb-4cf1-bbe9-798f2093b748)) <!-- /greptile_comment --> --------- Co-authored-by: Krzysztof Czerwinski <kpczerwinski@gmail.com> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Ubbe <0ubbe@users.noreply.github.com> Co-authored-by: Ubbe <hi@ubbe.dev>	2026-02-18 15:34:02 +00:00
Otto	ba75cc28b5	fix(copilot): Remove description from feature request search, add PII prevention (#12155 ) Two targeted changes to the CoPilot feature request tools: 1. Remove description from search results — The `search_feature_requests` tool no longer returns issue descriptions. Only the title is needed for duplicate detection, reducing unnecessary data exposure. 2. Prevent PII in created issues — Updated the `create_feature_request` tool description and parameter descriptions to explicitly instruct the LLM to never include personally identifiable information (names, emails, company names, etc.) in Linear issue titles and descriptions. Resolves [SECRT-2010](https://linear.app/autogpt/issue/SECRT-2010)	2026-02-18 14:36:12 +01:00
Otto	15bcdae4e8	fix(backend/copilot): Clean up GCSWorkspaceStorage per worker (#12153 ) The copilot executor runs each worker in its own thread with a dedicated event loop (`asyncio.new_event_loop()`). `aiohttp.ClientSession` is bound to the event loop where it was created — using it from a different loop causes `asyncio.timeout()` to fail with: ``` RuntimeError: Timeout context manager should be used inside a task ``` This was the root cause of transcript upload failures tracked in SECRT-2009 and [Sentry #7272473694](https://significant-gravitas.sentry.io/issues/7272473694/). ### Fix One `GCSWorkspaceStorage` instance per event loop instead of a single shared global. - `get_workspace_storage()` now returns a per-loop GCS instance (keyed by `id(asyncio.get_running_loop())`). Local storage remains shared since it has no async I/O. - `shutdown_workspace_storage()` closes the instance for the current loop only, so `session.close()` always runs on the loop that created the session. - `CoPilotProcessor.cleanup()` shuts down workspace storage on the worker's own loop, then stops the loop. - Manager cleanup submits `cleanup_worker` to each thread pool worker before shutting down the executor — replacing the old approach of creating a temporary event loop that couldn't close cross-loop sessions. ### Changes \| File \| Change \| \|------\|--------\| \| `util/workspace_storage.py` \| `GCSWorkspaceStorage` back to simple single-session class; `get_workspace_storage()` returns per-loop GCS instance; `shutdown_workspace_storage()` scoped to current loop \| \| `copilot/executor/processor.py` \| Added `CoPilotProcessor.cleanup()` and `cleanup_worker()` \| \| `copilot/executor/manager.py` \| Calls `cleanup_worker` on each thread pool worker during shutdown \| Fixes SECRT-2009 --------- Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2026-02-18 11:17:39 +00:00
Otto	e9ba7e51db	fix(copilot): Route workspace through db_accessors, fix transcript upload (#12148 ) ## Summary Fixes two bugs in the copilot executor: ### SECRT-2008: WorkspaceManager bypasses db_accessors `backend/util/workspace.py` imported 6 workspace functions directly from `backend/data/workspace.py`, which call `prisma()` directly. In the copilot executor (no Prisma connection), these fail. Fix: Replace direct imports with `workspace_db()` from `db_accessors`, routing through the database_manager HTTP client when Prisma is unavailable. Also: - Register all 6 workspace functions in `DatabaseManager` and `DatabaseManagerAsyncClient` - Add `UniqueViolationError` to the service `EXCEPTION_MAPPING` so it's properly re-raised over HTTP (needed for race-condition retry logic) ### SECRT-2009: Transcript upload asyncio.timeout error `asyncio.create_task()` at line 696 of `service.py` creates an orphaned background task in the executor's thread event loop. `gcloud-aio-storage`'s `asyncio.timeout()` fails in this context. Fix: Replace `create_task` with direct `await`. The upload runs after streaming completes (all chunks already yielded), so no user-facing latency impact. The function already has internal try/except error handling.	2026-02-17 22:24:19 +00:00
Reinier van der Leer	d23248f065	feat(backend/copilot): Copilot Executor Microservice (#12057 ) Uncouple Copilot task execution from the REST API server. This should help performance and scalability, and allows task execution to continue regardless of the state of the user's connection. - Resolves #12023 ### Changes 🏗️ - Add `backend.copilot.executor`->`CoPilotExecutor` (setup similar to `backend.executor`->`ExecutionManager`). This executor service uses RabbitMQ-based task distribution, and sticks with the existing Redis Streams setup for task output. It uses a cluster lock mechanism to ensure a task is only executed by one pod, and the `DatabaseManager` for pooled DB access. - Add `backend.data.db_accessors` for automatic choice of direct/proxied DB access Chat requests now flow: API → RabbitMQ → CoPilot Executor → Redis Streams → SSE Client. This enables horizontal scaling of chat processing and isolates long-running LLM operations from the API service. - Move non-API Copilot stuff into `backend.copilot` (from `backend.api.features.chat`) - Updated import paths for all usages - Move `backend.executor.database` to `backend.data.db_manager` and add methods for copilot executor - Updated import paths for all usages - Make `backend.copilot.db` RPC-compatible (-> DB ops return ~~Prisma~~ Pydantic models) - Make `backend.data.workspace` RPC-compatible - Make `backend.data.graphs.get_store_listed_graphs` RPC-compatible DX: - Add `copilot_executor` service to Docker setup Config: - Add `Config.num_copilot_workers` (default 5) and `Config.copilot_executor_port` (default 8008) - Remove unused `Config.agent_server_port` > [!WARNING] > This change adds a new microservice to the system, with entrypoint `backend.copilot.executor`. > The `docker compose` setup has been updated, but if you run the Platform on something else, you'll have to update your deployment config to include this new service. > > When running locally, the `CoPilotExecutor` uses port 8008 by default. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Copilot works - [x] Processes messages when triggered - [x] Can use its tools #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co>	2026-02-17 16:15:28 +00:00
Bently	905373a712	fix(frontend): use singleton Shiki highlighter for code syntax highlighting (#12144 ) ## Summary Addresses SENTRY-1051: Shiki warning about multiple highlighter instances. ## Problem The `@streamdown/code` package creates a new Shiki highlighter for each language encountered. When users view AI chat responses with code blocks in multiple languages (JavaScript, Python, JSON, YAML, etc.), this creates 10+ highlighter instances, triggering Shiki's warning: > "10 instances have been created. Shiki is supposed to be used as a singleton, consider refactoring your code to cache your highlighter instance" This causes memory bloat and performance degradation. ## Solution Introduced a custom code highlighting plugin that properly implements the singleton pattern: ### New files: - `src/lib/shiki-highlighter.ts` - Singleton highlighter management - `src/lib/streamdown-code-plugin.ts` - Drop-in replacement for `@streamdown/code` ### Key features: - Single shared highlighter - One instance serves all code blocks - Preloaded common languages - JS, TS, Python, JSON, Bash, YAML, etc. - Lazy loading - Additional languages loaded on demand - Result caching - Avoids re-highlighting identical code blocks ### Changes: - Added `shiki` as direct dependency - Updated `message.tsx` to use the new plugin ## Testing - [ ] Verify code blocks render correctly in AI chat - [ ] Confirm no Shiki singleton warnings in console - [ ] Test with multiple languages in same conversation ## Related - Linear: SENTRY-1051 - Sentry: Multiple Shiki instances warning <!-- greptile_comment --> <details><summary><h3>Greptile Summary</h3></summary> Replaced `@streamdown/code` with a custom singleton-based Shiki highlighter implementation to resolve memory bloat from creating multiple highlighter instances per language. The new implementation creates a single shared highlighter with preloaded common languages (JS, TS, Python, JSON, etc.) and lazy-loads additional languages on demand. Results are cached to avoid re-highlighting identical code blocks. Key changes: - Added `shiki` v3.21.0 as a direct dependency - Created `shiki-highlighter.ts` with singleton pattern and language management utilities - Created `streamdown-code-plugin.ts` as a drop-in replacement for `@streamdown/code` - Updated `message.tsx` to import from the new plugin instead of `@streamdown/code` The implementation follows React best practices with async highlighting and callback-based notifications. The cache key uses code length + prefix/suffix for efficient lookups on large code blocks. </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - Safe to merge with minor considerations for edge cases - The implementation is solid with proper singleton pattern, caching, and async handling. The code is well-structured and addresses the stated problem. However, there's a subtle potential race condition in the callback handling where multiple concurrent requests for the same cache key could trigger duplicate highlight operations before the first completes. The cache key generation using prefix/suffix could theoretically cause false cache hits for large files with identical prefixes and suffixes. Despite these edge cases, the implementation should work correctly for the vast majority of use cases. - No files require special attention </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant UI as Streamdown Component participant Plugin as Custom Code Plugin participant Cache as Token Cache participant Singleton as Shiki Highlighter (Singleton) participant Callbacks as Pending Callbacks UI->>Plugin: highlight(code, lang) Plugin->>Cache: Check cache key alt Cache hit Cache-->>Plugin: Return cached result Plugin-->>UI: Return highlighted tokens else Cache miss Plugin->>Callbacks: Register callback Plugin->>Singleton: Get highlighter instance alt First call Singleton->>Singleton: Create highlighter with preloaded languages end Singleton-->>Plugin: Return highlighter alt Language not loaded Plugin->>Singleton: Load language dynamically end Plugin->>Singleton: codeToTokens(code, lang, themes) Singleton-->>Plugin: Return tokens Plugin->>Cache: Store result Plugin->>Callbacks: Notify all waiting callbacks Callbacks-->>UI: Async callback with result end ``` </details> <sub>Last reviewed commit: 96c793b</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-17 12:15:53 +00:00
Otto	ee9d39bc0f	refactor(copilot): Replace legacy delete dialog with molecules/Dialog (#12136 ) ## Summary Updates the session delete confirmation in CoPilot to use the new `Dialog` component from `molecules/Dialog` instead of the legacy `DeleteConfirmDialog`. ## Changes - ChatSidebar: Use Dialog component for delete confirmation (desktop) - CopilotPage: Use Dialog component for delete confirmation (mobile) ## Behavior - Dialog stays open during deletion with loading state on button - Cancel button disabled while delete is in progress - Delete button shows loading spinner during deletion - Dialog only closes on successful delete or when cancel is clicked (if not deleting) ## Screenshots Dialog uses the same styling as other molecules/Dialog instances in the app ## Requested by @0ubbe <!-- greptile_comment --> <details><summary><h3>Greptile Summary</h3></summary> Replaces the legacy `DeleteConfirmDialog` component with the new `molecules/Dialog` component for session delete confirmations in both desktop (ChatSidebar) and mobile (CopilotPage) views. The new implementation maintains the same behavior: dialog stays open during deletion with a loading state on the delete button and disabled cancel button, closing only on successful deletion or cancel click. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - This is a straightforward component replacement that maintains the same behavior and UX. The Dialog component API is properly used with controlled state, the loading states are correctly implemented, and both mobile and desktop views are handled consistently. The changes are well-tested patterns used elsewhere in the codebase. - No files require special attention </details> <details><summary><h3>Flowchart</h3></summary> ```mermaid flowchart TD A[User clicks delete button] --> B{isMobile?} B -->\|Yes\| C[CopilotPage Dialog] B -->\|No\| D[ChatSidebar Dialog] C --> E[Set sessionToDelete state] D --> E E --> F[Dialog opens with controlled.isOpen] F --> G{User action?} G -->\|Cancel\| H{isDeleting?} H -->\|No\| I[handleCancelDelete: setSessionToDelete null] H -->\|Yes\| J[Cancel button disabled] G -->\|Confirm Delete\| K[handleConfirmDelete called] K --> L[deleteSession mutation] L --> M[isDeleting = true] M --> N[Button shows loading spinner] M --> O[Cancel button disabled] L --> P{Mutation result?} P -->\|Success\| Q[Invalidate sessions query] Q --> R[Clear sessionId if current] R --> S[setSessionToDelete null] S --> T[Dialog closes] P -->\|Error\| U[Show toast error] U --> V[setSessionToDelete null] V --> W[Dialog closes] ``` </details> <sub>Last reviewed commit: 275950c</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Co-authored-by: Lluis Agusti <hi@llu.lu> Co-authored-by: Ubbe <hi@ubbe.dev>	2026-02-17 19:12:27 +07:00
Swifty	05aaf7a85e	fix(backend): Rename LINEAR_API_KEY to COPILOT_LINEAR_API_KEY to prevent global access (#12143 ) The `LINEAR_API_KEY` environment variable name is too generic — it matches the key name used by integrations/blocks, meaning that if set globally, it could inadvertently grant all users access to Linear through the blocks system rather than restricting it to the copilot feature-request tool. This renames the setting to `COPILOT_LINEAR_API_KEY` to make it clear this key is scoped exclusively to the copilot's feature-request functionality, preventing it from being picked up as a general-purpose Linear credential. ### Changes 🏗️ - Renamed `linear_api_key` → `copilot_linear_api_key` in `Secrets` settings model (`backend/util/settings.py`) - Updated all references in the copilot feature-request tool (`backend/api/features/chat/tools/feature_requests.py`) ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified the rename is consistent across all references (settings + feature_requests tool) - [x] No other files reference the old `linear_api_key` setting name #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) > Note: The env var changes from `LINEAR_API_KEY` to `COPILOT_LINEAR_API_KEY`. Any deployment using the old name will need to update accordingly. <!-- greptile_comment --> <details><summary><h3>Greptile Summary</h3></summary> Renamed `LINEAR_API_KEY` to `COPILOT_LINEAR_API_KEY` in settings and the copilot feature-request tool to prevent unintended access through Linear blocks. Key changes: - Updated `Secrets.linear_api_key` → `Secrets.copilot_linear_api_key` in `backend/util/settings.py` - Updated all references in `backend/api/features/chat/tools/feature_requests.py` - The rename prevents the copilot Linear key from being picked up by the Linear blocks integration (which uses `LINEAR_API_KEY` via `ProviderBuilder` in `backend/blocks/linear/_config.py`) Issues found: - `.env.default` still references `LINEAR_API_KEY` instead of `COPILOT_LINEAR_API_KEY` - Frontend styleguide has a hardcoded error message with the old variable name </details> <details><summary><h3>Confidence Score: 3/5</h3></summary> - Generally safe but requires fixing `.env.default` before deployment - The code changes are correct and achieve the intended security improvement by preventing scope leakage. However, the PR is incomplete - `.env.default` wasn't updated (critical for deployment) and a frontend error message reference was missed. These issues will cause configuration problems for anyone deploying with the new variable name. - Check `autogpt_platform/backend/.env.default` and `autogpt_platform/frontend/src/app/(platform)/copilot/styleguide/page.tsx` - both need updates to match the renamed variable </details> <details><summary><h3>Flowchart</h3></summary> ```mermaid flowchart TD A[".env file<br/>COPILOT_LINEAR_API_KEY"] --> B["Secrets model<br/>copilot_linear_api_key"] B --> C["feature_requests.py<br/>_get_linear_config()"] C --> D["Creates APIKeyCredentials<br/>for copilot feature requests"] E[".env file<br/>LINEAR_API_KEY"] --> F["ProviderBuilder<br/>in blocks/linear/_config.py"] F --> G["Linear blocks integration<br/>for user workflows"] style A fill:#90EE90 style B fill:#90EE90 style C fill:#90EE90 style D fill:#90EE90 style E fill:#FFD700 style F fill:#FFD700 style G fill:#FFD700 ``` </details> <sub>Last reviewed commit: 86dc57a</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-17 11:16:43 +01:00
abhi1992002	728d82739d	refactor(api): improve formatting of exception handler registration - Reformatted the registration of the `FolderAlreadyExistsError` exception handler for better readability by adding line breaks. - This change enhances the clarity of the code structure in the REST API error handling section.	2026-02-17 12:32:35 +05:30
abhi1992002	347e450728	fix(api): update OpenAPI description for folder ID handling - Modified the description for the `folder_id` field in the OpenAPI specification to clarify that using "None" will move the agent to the root folder instead of an empty string. - This change improves the accuracy and clarity of the API documentation regarding folder operations.	2026-02-17 12:18:15 +05:30
abhi1992002	2a4bb22d99	fix(api): ensure field updates are preserved when updating library agent - Modified the `update_library_agent` function to apply any provided field updates before fetching the current agent, preventing loss of data during the update process. - This change enhances the integrity of the library agent update functionality by ensuring all relevant fields are retained.	2026-02-17 12:10:48 +05:30
abhi1992002	ee14e54e80	feat(api): add FolderAlreadyExistsError exception handling - Introduced `FolderAlreadyExistsError` to handle cases where a folder with the same name already exists, improving error management in folder operations. - Updated the REST API to include this new exception in the error handling mechanism, providing clearer responses for folder-related requests. - Refactored import statements for better organization and clarity in the codebase. - Enhanced the `list_library_agents` and `list_favorite_library_agents` functions by simplifying the error handling logic and improving readability.	2026-02-17 12:03:53 +05:30
abhi1992002	d60d79efb6	fix(api): simplify validation error response for circular references - Updated the OpenAPI specification to modify the description of the 400 response for validation errors related to circular references, removing the mention of "depth exceeded" for clarity. - This change enhances the accuracy of the API documentation regarding validation error responses.	2026-02-17 10:52:50 +05:30
abhi1992002	0b16c4da2c	refactor(api): optimize folder ancestry check in database - Updated the `is_descendant_of` function to fetch all user folders in a single query, reducing database round-trips and improving performance. - Replaced the previous method of querying each folder's parent ID with an in-memory parent map, enhancing efficiency in checking folder ancestry. - These changes streamline the folder validation process and improve overall responsiveness of the API.	2026-02-17 10:36:38 +05:30
Abhimanyu Yadav	0952d72fba	Merge branch 'dev' into abhi/folder-inside-library	2026-02-17 10:30:52 +05:30
abhi1992002	cd2e281193	refactor(api): clean up imports and add spacing for readability - Removed duplicate import of `FolderValidationError` in `rest_api.py` to streamline the code. - Added a blank line in `db.py` before the `list_folders` function to enhance code readability. - These changes improve the organization and clarity of the API codebase.	2026-02-17 10:30:09 +05:30
abhi1992002	792e141183	refactor(library): remove unnecessary indexes from LibraryFolder model and migration - Removed the `@@index` directives for `userId` and `parentId` from the `LibraryFolder` model in `schema.prisma` to streamline the database schema. - Updated the migration SQL file to reflect these changes, ensuring a cleaner migration process. - These modifications enhance the efficiency of the database structure by eliminating redundant indexing.	2026-02-17 10:28:43 +05:30
abhi1992002	090975374c	fix(library): enhance error handling and streamline folder operations - Added `FolderValidationError` exception handling in the REST API to improve error responses for folder-related operations. - Refactored several functions in the library routes to remove redundant try-except blocks, simplifying the code and improving readability. - Updated the `LibraryAgent` model to directly use `folderId` and `Folder.name`, enhancing data consistency. - These changes improve the overall robustness and maintainability of the library management system.	2026-02-17 10:18:38 +05:30
abhi1992002	383e60fba5	fix(library): merge settings when updating library agent - Updated the `update_library_agent` function to merge existing and new settings when provided, ensuring that only unset fields are updated. - Removed the `get_folder_depth` function and related depth validation logic to streamline folder operations, as it was deemed unnecessary for current functionality. - These changes enhance the integrity of agent settings management and simplify folder validation processes.	2026-02-17 10:07:32 +05:30
Reinier van der Leer	9d4dcbd9e0	fix(backend/docker): Make `server` last (= default) build stage Without specifying an explicit build target it would build the `migrate` stage because it is the last stage in the Dockerfile. This caused deployment failures. - Follow-up to #12124 and `074be7ae`	2026-02-16 14:49:30 +01:00
Reinier van der Leer	074be7aea6	fix(backend/docker): Update run commands to match deployment - Follow-up to #12124 Changes: - Update `run` commands for all backend services in `docker-compose.platform.yml` to match the deployment commands used in production - Add trigger on `docker-compose(.platform)?.yml` changes to the Frontend CI workflow	2026-02-16 14:23:29 +01:00
Otto	39d28b24fc	ci(backend): Upgrade RabbitMQ from 3.12 (EOL) to 4.1.4 (#12118 ) ## Summary Upgrades RabbitMQ from the end-of-life `rabbitmq:3.12-management` to `rabbitmq:4.1.4`, aligning CI, local dev, and e2e testing with production. ## Changes ### CI Workflow (`.github/workflows/platform-backend-ci.yml`) - Image: `rabbitmq:3.12-management` → `rabbitmq:4.1.4` - Port: Removed 15672 (management UI) — not used - Health check: Added to prevent flaky tests from race conditions during startup ### Docker Compose (`docker-compose.platform.yml`, `docker-compose.test.yaml`) - Image: `rabbitmq:management` → `rabbitmq:4.1.4` - Port: Removed 15672 (management UI) — not used ## Why - RabbitMQ 3.12 is EOL - We don't use the management interface, so `-management` variant is unnecessary - CI and local dev/e2e should match production (4.1.4) ## Testing CI validates that backend tests pass against RabbitMQ 4.1.4 on Python 3.11, 3.12, and 3.13. --- Closes SECRT-1703	2026-02-16 12:45:39 +00:00
Reinier van der Leer	bf79a7748a	fix(backend/build): Update stale Poetry usage in Dockerfile (#12124 ) [SECRT-2006: Dev deployment failing: poetry not found in container PATH](https://linear.app/autogpt/issue/SECRT-2006) - Follow-up to #12090 ### Changes 🏗️ - Remove now-broken Poetry path config values - Remove usage of now-broken `poetry run` in container run command - Add trigger on `backend/Dockerfile` changes to Frontend CI workflow ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - If it works, CI will pass	2026-02-16 13:54:20 +01:00
Otto	649d4ab7f5	feat(chat): Add delete chat session endpoint and UI (#12112 ) ## Summary Adds the ability to delete chat sessions from the CoPilot interface. ## Changes ### Backend - Add `DELETE /api/chat/sessions/{session_id}` endpoint in `routes.py` - Returns 204 on success, 404 if not found or not owned by user - Reuses existing `delete_chat_session` function from `model.py` ### Frontend - Add delete button (trash icon) that appears on hover for each chat session - Add confirmation dialog before deletion using existing `DeleteConfirmDialog` component - Refresh session list after successful delete - Clear current session selection if the deleted session was active - Update OpenAPI spec with new endpoint ## Testing 1. Hover over a chat session in sidebar → trash icon appears 2. Click trash icon → confirmation dialog 3. Confirm deletion → session removed, list refreshes 4. If deleted session was active, selection is cleared ## Screenshots Delete button appears on hover, confirmation dialog on click. ## Related Issues Closes SECRT-1928 <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Adds the ability to delete chat sessions from the CoPilot interface — a new `DELETE /api/chat/sessions/{session_id}` backend endpoint and a corresponding delete button with confirmation dialog in the `ChatSidebar` frontend component. - Backend route (`routes.py`): Clean implementation reusing the existing `delete_chat_session` model function with proper auth guards and 204/404 responses. No issues. - Frontend (`ChatSidebar.tsx`): Adds hover-visible trash icon per session, confirmation dialog, mutation with cache invalidation, and active session clearing on delete. However, it uses a `__legacy__` component (`DeleteConfirmDialog`) which violates the project's style guide — new code should use the modern design system components. Error handling only logs to console without user-facing feedback (project convention is to use toast notifications for mutation errors). `isDeleting` is destructured but unused. - OpenAPI spec updated correctly. - Unrelated file included: `notes/plan-SECRT-1959-graph-edge-desync.md` is a planning document for a different ticket and should be removed from this PR. The `notes/` directory is newly introduced and both plan files should be reconsidered for inclusion. </details> <details><summary><h3>Confidence Score: 3/5</h3></summary> - Functionally correct but has style guide violations and includes unrelated files that should be addressed before merge. - The core feature implementation (backend DELETE endpoint and frontend mutation logic) is sound and follows existing patterns. Score is lowered because: (1) the frontend uses a legacy component explicitly prohibited by the project's style guide, (2) mutation errors are not surfaced to the user, and (3) the PR includes an unrelated planning document for a different ticket. - Pay close attention to `ChatSidebar.tsx` for the legacy component import and error handling, and `notes/plan-SECRT-1959-graph-edge-desync.md` which should be removed. </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant ChatSidebar as ChatSidebar (Frontend) participant ReactQuery as React Query participant API as DELETE /api/chat/sessions/{id} participant Model as model.delete_chat_session participant DB as db.delete_chat_session (Prisma) participant Redis as Redis Cache User->>ChatSidebar: Click trash icon on session ChatSidebar->>ChatSidebar: Show DeleteConfirmDialog User->>ChatSidebar: Confirm deletion ChatSidebar->>ReactQuery: deleteSession({ sessionId }) ReactQuery->>API: DELETE /api/chat/sessions/{session_id} API->>Model: delete_chat_session(session_id, user_id) Model->>DB: delete_many(where: {id, userId}) DB-->>Model: bool (deleted count > 0) Model->>Redis: Delete session cache key Model->>Model: Clean up session lock Model-->>API: True API-->>ReactQuery: 204 No Content ReactQuery->>ChatSidebar: onSuccess callback ChatSidebar->>ReactQuery: invalidateQueries(sessions list) ChatSidebar->>ChatSidebar: Clear sessionId if deleted was active ``` </details> <sub>Last reviewed commit: 44a92c6</sub> <!-- greptile_other_comments_section --> <details><summary><h4>Context used (3)</h4></summary> - Context from `dashboard` - autogpt_platform/frontend/CLAUDE.md ([source](https://app.greptile.com/review/custom-context?memory=39861924-d320-41ba-a1a7-a8bff44f780a)) - Context from `dashboard` - autogpt_platform/frontend/CONTRIBUTING.md ([source](https://app.greptile.com/review/custom-context?memory=cc4f1b17-cb5c-4b63-b218-c772b48e20ee)) - Context from `dashboard` - autogpt_platform/CLAUDE.md ([source](https://app.greptile.com/review/custom-context?memory=6e9dc5dc-8942-47df-8677-e60062ec8c3a)) </details> <!-- /greptile_comment --> --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co>	2026-02-16 12:19:18 +00:00
Ubbe	223df9d3da	feat(frontend): improve create/edit copilot UX (#12117 ) ## Changes 🏗️ Make the UX nicer when running long tasks in Copilot, like creating an agent, editing it or running a task. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Run locally and play the game! <!-- greptile_comment --> <details><summary><h3>Greptile Summary</h3></summary> This PR replaces the static progress bar and idle wait screens with an interactive mini-game across the Create, Edit, and Run Agent copilot tools. The existing mini-game (a simple runner with projectile-dodge boss encounters) is significantly overhauled into a two-mode game: a runner mode with animated tree obstacles and a duel mode featuring a melee boss fight with attack, guard, and movement mechanics. Sprite-based rendering replaces the previous shape-drawing approach. - Create/Edit/Run Agent UX: All three tool views now show the mini-game with contextual overlays during long-running operations, replacing the progress bar in EditAgent and adding the game to RunAgent - Game mechanics overhaul: Boss encounters changed from projectile-dodging to melee duel with attack (Z), block (X), movement (arrows), and jump (Space) controls - Sprite rendering: Added 9 sprite sheet assets for characters, trees, and boss animations with fallback to shape rendering if images fail to load - UI overlays: Added React-managed overlay states for idle, boss-intro, boss-defeated, and game-over screens with continue/retry buttons - Minor issues found: Unused `isRunActive` variable in `MiniGame.tsx`, unreachable "leaving" boss phase in `useMiniGame.ts`, and a missing `expanded` property in `getAccordionMeta` return type annotation in `EditAgent.tsx` - Unused asset: `archer-shoot.png` is included in the PR but never imported or referenced in any code </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - This PR is safe to merge — it only affects the copilot mini-game UX with no backend or data model changes. - The changes are entirely frontend/cosmetic, scoped to the copilot tools' waiting UX. The mini-game logic is self-contained in a canvas-based hook and doesn't affect any application state, API calls, or routing. The issues found are minor (unused variable, dead code, type annotation gap, unused asset) and don't impact runtime behavior. - `useMiniGame.ts` has the most complex logic changes (boss AI, death animations, sprite rendering) and contains unreachable dead code in the "leaving" phase handler. `EditAgent.tsx` has a return type annotation that doesn't include `expanded`. </details> <details><summary><h3>Flowchart</h3></summary> ```mermaid flowchart TD A[Game Idle] -->\|"Start button"\| B[Run Mode] B -->\|"Jump over trees"\| C{Score >= Threshold?} C -->\|No\| B C -->\|"Yes, obstacles clear"\| D[Boss Intro Overlay] D -->\|"Continue button"\| E[Duel Mode] E -->\|"Attack Z / Guard X / Move ←→"\| F{Boss HP <= 0?} F -->\|No\| G{Player hit & not guarding?} G -->\|No\| E G -->\|Yes\| H[Player Death Animation] H --> I[Game Over Overlay] I -->\|"Retry button"\| B F -->\|Yes\| J[Boss Death Animation] J --> K[Boss Defeated Overlay] K -->\|"Continue button"\| L[Reset Boss & Resume Run] L --> B ``` </details> <sub>Last reviewed commit: ad80e24</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-16 10:53:08 +00:00
Ubbe	187ab04745	refactor(frontend): remove OldAgentLibraryView and NEW_AGENT_RUNS flag (#12088 ) ## Summary - Removes the deprecated `OldAgentLibraryView` directory (13 files, ~2200 lines deleted) - Removes the `NEW_AGENT_RUNS` feature flag from the `Flag` enum and defaults - Removes the legacy agent library page at `library/legacy/[id]` - Moves shared `CronScheduler` components to `src/components/contextual/CronScheduler/` - Moves `agent-run-draft-view` and `agent-status-chip` to `legacy-builder/` (co-located with their only consumer) - Updates all import paths in consuming files (`AgentInfoStep`, `SaveControl`, `RunnerInputUI`, `useRunGraph`) ## Test plan - [x] `pnpm format` passes - [x] `pnpm types` passes (no TypeScript errors) - [x] No remaining references to `OldAgentLibraryView`, `NEW_AGENT_RUNS`, or `new-agent-runs` in the codebase - [x] Verify `RunnerInputUI` dialog still works in the legacy builder - [x] Verify `AgentInfoStep` cron scheduling works in the publish modal - [x] Verify `SaveControl` cron scheduling works in the legacy builder 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> This PR removes deprecated code from the legacy agent library view system and consolidates the codebase to use the new agent runs implementation exclusively. The refactor successfully removes ~2200 lines of code across 13 deleted files while properly relocating shared components. Key changes: - Removed the entire `OldAgentLibraryView` directory and its 13 component files - Removed the `NEW_AGENT_RUNS` feature flag from the `Flag` enum and defaults - Deleted the legacy agent library page route at `library/legacy/[id]` - Moved `CronScheduler` components to `src/components/contextual/CronScheduler/` for shared use across the application - Moved `agent-run-draft-view` and `agent-status-chip` to `legacy-builder/` directory, co-locating them with their only consumer - Updated `useRunGraph.ts` to import `GraphExecutionMeta` from the generated API models instead of the deleted custom type definition - Updated all import paths in consuming components (`AgentInfoStep`, `SaveControl`, `RunnerInputUI`) Technical notes: - The new import path for `GraphExecutionMeta` (`@/app/api/__generated__/models/graphExecutionMeta`) will be generated when running `pnpm generate:api` from the OpenAPI spec - All references to the old code have been cleanly removed from the codebase - The refactor maintains proper separation of concerns by moving shared components to contextual locations </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - This PR is safe to merge with minimal risk, pending manual verification of the UI components mentioned in the test plan - The refactor is well-structured and all code changes are correct. The score of 4 (rather than 5) reflects that the PR author has marked three manual testing items as incomplete in the test plan: verifying `RunnerInputUI` dialog, `AgentInfoStep` cron scheduling, and `SaveControl` cron scheduling. While the code changes are sound, these UI components should be manually tested before merging to ensure the moved components work correctly in their new locations. - No files require special attention. The author should complete the manual testing checklist items for `RunnerInputUI`, `AgentInfoStep`, and `SaveControl` as noted in the test plan. </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Dev as Developer participant FE as Frontend Build participant API as Backend API participant Gen as Generated Types Note over Dev,Gen: Refactor: Remove OldAgentLibraryView & NEW_AGENT_RUNS flag Dev->>FE: Delete OldAgentLibraryView (13 files, ~2200 lines) Dev->>FE: Remove NEW_AGENT_RUNS from Flag enum Dev->>FE: Delete library/legacy/[id]/page.tsx Dev->>FE: Move CronScheduler → src/components/contextual/ Dev->>FE: Move agent-run-draft-view → legacy-builder/ Dev->>FE: Move agent-status-chip → legacy-builder/ Dev->>FE: Update RunnerInputUI import path Dev->>FE: Update SaveControl import path Dev->>FE: Update AgentInfoStep import path Dev->>FE: Update useRunGraph.ts FE->>Gen: Import GraphExecutionMeta from generated models Note over Gen: Type available after pnpm generate:api Gen-->>API: Uses OpenAPI spec schema API-->>FE: Type-safe GraphExecutionMeta model ``` </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-16 18:29:59 +08:00
Abhimanyu Yadav	91f7c7b0db	Merge branch 'dev' into abhi/folder-inside-library	2026-02-16 13:55:30 +05:30
abhi1992002	01f18acba8	fix(library): validate folder existence when updating agent folder ID - Enhanced the `update_library_agent` function to verify that the specified folder ID belongs to the user and is not deleted before updating the agent's folder ID. - This change improves error handling by raising a `NotFoundError` if the folder is not found, ensuring better data integrity and user feedback during agent updates.	2026-02-16 13:51:27 +05:30
abhi1992002	09f74594ab	fix(tests): update test_get_library_agents_success to include folder_id and include_root_only parameters - Added `folder_id` and `include_root_only` parameters to the `test_get_library_agents_success` function to ensure comprehensive testing of library agent retrieval functionality. - This change enhances the test coverage for scenarios involving folder-specific agent queries.	2026-02-16 13:39:23 +05:30
abhi1992002	85b24468e8	feat(library): add folder_id and folder_name to agent snapshots - Introduced `folder_id` and `folder_name` fields in agent snapshots to support folder organization. - This enhancement improves the structure of agent data, facilitating better management and retrieval of agents within folders.	2026-02-16 13:25:38 +05:30
abhi1992002	f4848a43af	feat(library): improve folder and agent management with code enhancements - Refactored `delete_folder` function to include an asynchronous cleanup process for affected agents, ensuring proper resource management during folder deletions. - Added `FolderValidationError` exception to enhance error handling in folder operations. - Improved logging for database errors in the `update_library_agent` function, providing clearer feedback during agent updates. - Enhanced UI components for better readability and structure, including adjustments to the `LibraryAgentCard` and `FavoritesSection`. These changes enhance the functionality and reliability of folder and agent management, improving user experience and error handling in the library interface.	2026-02-16 13:10:02 +05:30
Abhimanyu Yadav	e2d3c8a217	fix(frontend): Prevent node drag when selecting text in object editor key input (#11955 ) ## Summary - Add `nodrag` class to the key name input wrapper in `WrapIfAdditionalTemplate.tsx` - This prevents the node from being dragged when users try to select text in the key name input field - Follows the same pattern used by other input components like `TextWidget.tsx` ## Test plan - [x] Open the new builder - [x] Add a custom node with an Object input field - [x] Try to select text in the key name input by clicking and dragging - [x] Verify that text selection works without moving the block Co-authored-by: Claude <noreply@anthropic.com>	2026-02-16 06:59:33 +00:00
Eve	647c8ed8d4	feat(backend/blocks): enhance list concatenation with advanced operations (#12105 ) ## Summary Enhances the existing `ConcatenateListsBlock` and adds five new companion blocks for comprehensive list manipulation, addressing issue #11139 ("Implement block to concatenate lists"). ### Changes - Enhanced `ConcatenateListsBlock` with optional deduplication (`deduplicate`) and None-value filtering (`remove_none`), plus an output `length` field - New `FlattenListBlock`: Recursively flattens nested list structures with configurable `max_depth` - New `InterleaveListsBlock`: Round-robin interleaving of elements from multiple lists - New `ZipListsBlock`: Zips corresponding elements from multiple lists with support for padding to longest or truncating to shortest - New `ListDifferenceBlock`: Computes set difference between two lists (regular or symmetric) - New `ListIntersectionBlock`: Finds common elements between two lists, preserving order ### Helper Utilities Extracted reusable helper functions for validation, flattening, deduplication, interleaving, chunking, and statistics computation to support the blocks and enable future reuse. ### Test Coverage Comprehensive test suite with 188 test functions across 29 test classes covering: - Built-in block test harness validation for all 6 blocks - Manual edge-case tests for each block (empty inputs, large lists, mixed types, nested structures) - Internal method tests for all block classes - Unit tests for all helper utility functions Closes #11139 ## Test plan - [x] All files pass Python syntax validation (`ast.parse`) - [x] Built-in `test_input`/`test_output` tests defined for all blocks - [x] Manual tests cover edge cases: empty lists, large lists, mixed types, nested structures, deduplication, None removal - [x] Helper function tests validate all utility functions independently - [x] All block IDs are valid UUID4 - [x] Block categories set to `BlockCategory.BASIC` for consistency with existing list blocks <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Enhanced `ConcatenateListsBlock` with deduplication and None-filtering options, and added five new list manipulation blocks (`FlattenListBlock`, `InterleaveListsBlock`, `ZipListsBlock`, `ListDifferenceBlock`, `ListIntersectionBlock`) with comprehensive helper functions and test coverage. Key Changes: - Enhanced `ConcatenateListsBlock` with `deduplicate` and `remove_none` options, plus `length` output field - Added `FlattenListBlock` for recursively flattening nested lists with configurable `max_depth` - Added `InterleaveListsBlock` for round-robin element interleaving - Added `ZipListsBlock` with support for padding/truncation - Added `ListDifferenceBlock` and `ListIntersectionBlock` for set operations - Extracted 12 reusable helper functions for validation, flattening, deduplication, etc. - Comprehensive test suite with 188 test functions covering edge cases Minor Issues: - Helper function `_deduplicate_list` has redundant logic in the `else` branch that duplicates the `if` branch - Three helper functions (`_filter_empty_collections`, `_compute_list_statistics`, `_chunk_list`) are defined but unused - consider removing unless planned for future use - The `_make_hashable` function uses `hash(repr(item))` for unhashable types, which correctly treats structurally identical dicts/lists as duplicates </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - Safe to merge with minor style improvements recommended - The implementation is well-structured with comprehensive test coverage (188 tests), proper error handling, and follows existing block patterns. All blocks use valid UUID4 IDs and correct categories. The helper functions provide good code reuse. The minor issues are purely stylistic (redundant code, unused helpers) and don't affect functionality or safety. - No files require special attention - both files are well-tested and follow project conventions </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant Block as List Block participant Helper as Helper Functions participant Output User->>Block: Input (lists/parameters) Block->>Helper: _validate_all_lists() Helper-->>Block: validation result alt validation fails Block->>Output: error message else validation succeeds Block->>Helper: _concatenate_lists_simple() / _flatten_nested_list() / etc. Helper-->>Block: processed result opt deduplicate enabled Block->>Helper: _deduplicate_list() Helper-->>Block: deduplicated result end opt remove_none enabled Block->>Helper: _filter_none_values() Helper-->>Block: filtered result end Block->>Output: result + length end Output-->>User: Block outputs ``` </details> <sub>Last reviewed commit: a6d5445</sub> <!-- greptile_other_comments_section --> <sub>(2/5) Greptile learns from your feedback when you react with thumbs up/down!</sub> <!-- /greptile_comment --> --------- Co-authored-by: Otto <otto@agpt.co>	2026-02-16 05:39:53 +00:00
Zamil Majdy	27d94e395c	feat(backend/sdk): enable WebSearch, block WebFetch, consolidate tool constants (#12108 ) ## Summary - Enable Claude Agent SDK built-in WebSearch tool (Brave Search via Anthropic API) for the CoPilot SDK agent - Explicitly block WebFetch via `SDK_DISALLOWED_TOOLS`. The agent uses the SSRF-protected `mcp__copilot__web_fetch` MCP tool instead - Consolidate all tool security constants (`BLOCKED_TOOLS`, `WORKSPACE_SCOPED_TOOLS`, `DANGEROUS_PATTERNS`, `SDK_DISALLOWED_TOOLS`) into `tool_adapter.py` as a single source of truth — previously scattered across `tool_adapter.py`, `security_hooks.py`, and inline in `service.py` ## Changes - `tool_adapter.py`: Add `WebSearch` to `_SDK_BUILTIN_TOOLS`, add `SDK_DISALLOWED_TOOLS`, move security constants here - `security_hooks.py`: Import constants from `tool_adapter.py` instead of defining locally - `service.py`: Use `SDK_DISALLOWED_TOOLS` instead of inline `["Bash"]` ## Test plan - [x] All 21 security hooks tests pass - [x] Ruff lint clean - [x] All pre-commit hooks pass - [ ] Verify WebSearch works in CoPilot chat (manual test) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Consolidates tool security constants into `tool_adapter.py` as single source of truth, enables WebSearch (Brave via Anthropic API), and explicitly blocks WebFetch to prevent SSRF attacks. The change improves security by ensuring the agent uses the SSRF-protected `mcp__copilot__web_fetch` tool instead of the built-in WebFetch which can access internal networks like `localhost:8006`. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The changes improve security by blocking WebFetch (SSRF risk) while enabling safe WebSearch. The consolidation of constants into a single source of truth improves maintainability. All existing tests pass (21 security hooks tests), and the refactoring is straightforward with no behavioral changes to existing security logic. The only suggestions are minor improvements: adding a test for WebFetch blocking and considering a lowercase alias for consistency. - No files require special attention </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Agent as SDK Agent participant Hooks as Security Hooks participant TA as tool_adapter.py participant MCP as MCP Tools Note over TA: SDK_DISALLOWED_TOOLS = ["Bash", "WebFetch"] Note over TA: _SDK_BUILTIN_TOOLS includes WebSearch Agent->>Hooks: Request WebSearch (Brave API) Hooks->>TA: Check BLOCKED_TOOLS TA-->>Hooks: Not blocked Hooks-->>Agent: Allowed ✓ Agent->>Agent: Execute via Anthropic API Agent->>Hooks: Request WebFetch (SSRF risk) Hooks->>TA: Check BLOCKED_TOOLS Note over TA: WebFetch in SDK_DISALLOWED_TOOLS TA-->>Hooks: Blocked Hooks-->>Agent: Denied ✗ Note over Agent: Use mcp__copilot__web_fetch instead Agent->>Hooks: Request mcp__copilot__web_fetch Hooks->>MCP: Validate (MCP tool, not SDK builtin) MCP-->>Hooks: Has SSRF protection Hooks-->>Agent: Allowed ✓ Agent->>MCP: Execute with SSRF checks ``` </details> <sub>Last reviewed commit: 2d9975f</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-15 06:51:25 +00:00
DEEVEN SERU	b8f5c208d0	Handle errors in Jina ExtractWebsiteContentBlock (#12048 ) ## Summary - catch Jina reader client/server errors in ExtractWebsiteContentBlock and surface a clear error output keyed to the user URL - guard empty responses to return an explicit error instead of yielding blank content - add regression tests covering the happy path and HTTP client failures via a monkeypatched fetch ## Testing - not run (pytest unavailable in this environment) --------- Co-authored-by: Nicholas Tindle <nicktindle@outlook.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2026-02-13 19:15:09 +00:00
Otto	ca216dfd7f	ci(docs-claude-review): Update comments instead of creating new ones (#12106 ) ## Changes 🏗️ This PR updates the Claude Block Docs Review CI workflow to update existing comments instead of creating new ones on each push. ### What's Changed: 1. Concurrency group - Prevents race conditions if the workflow runs twice simultaneously 2. Comment cleanup step - Deletes any previous Claude review comment before posting a new one 3. Marker instruction - Instructs Claude to include a `<!-- CLAUDE_DOCS_REVIEW -->` marker in its comment for identification ### Why: Previously, every PR push would create a new review comment, cluttering the PR with multiple comments. Now only the most recent review is shown. ### Testing: 1. Create a PR that triggers this workflow (modify a file in `docs/integrations/` or `autogpt_platform/backend/backend/blocks/`) 2. Verify first run creates comment with marker 3. Push another commit 4. Verify old comment is deleted and new comment is created (not accumulated) Requested by @Bentlybro --- ## Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [ ] I have made a test plan - [ ] I have tested my changes according to the test plan (will be tested on merge) #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Added concurrency control and comment deduplication to prevent multiple Claude review comments from accumulating on PRs. The workflow now deletes previous review comments (identified by `<!-- CLAUDE_DOCS_REVIEW -->` marker) before posting new ones, and uses concurrency groups to prevent race conditions. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The changes are well-contained, follow GitHub Actions best practices, and use built-in GitHub APIs safely. The concurrency control prevents race conditions, and the comment cleanup logic uses proper filtering with `head -1` to handle edge cases. The HTML comment marker approach is standard and reliable. - No files require special attention </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant GH as GitHub PR Event participant WF as Workflow participant API as GitHub API participant Claude as Claude Action GH->>WF: PR opened/synchronized WF->>WF: Check concurrency group Note over WF: Cancel any in-progress runs<br/>for same PR number WF->>API: Query PR comments API-->>WF: Return all comments WF->>WF: Filter for CLAUDE_DOCS_REVIEW marker alt Previous comment exists WF->>API: DELETE comment by ID API-->>WF: Comment deleted else No previous comment WF->>WF: Skip deletion end WF->>Claude: Run code review Claude->>API: POST new comment with marker API-->>Claude: Comment created ``` </details> <sub>Last reviewed commit: fb1b436</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-13 16:46:23 +00:00
Zamil Majdy	f9f358c526	feat(mcp): Add MCP tool block with OAuth, tool discovery, and standard credential integration (#12011 ) ## Summary <img width="1000" alt="image" src="https://github.com/user-attachments/assets/18e8ef34-d222-453c-8b0a-1b25ef8cf806" /> <img width="250" alt="image" src="https://github.com/user-attachments/assets/ba97556c-09c5-4f76-9f4e-49a2e8e57468" /> <img width="250" alt="image" src="https://github.com/user-attachments/assets/68f7804a-fe74-442d-9849-39a229c052cf" /> <img width="250" alt="image" src="https://github.com/user-attachments/assets/700690ba-f9fe-4726-8871-3bfbab586001" /> Full-stack MCP (Model Context Protocol) tool block integration that allows users to connect to any MCP server, discover available tools, authenticate via OAuth, and execute tools — all through the standard AutoGPT credential system. ### Backend - MCPToolBlock (`blocks/mcp/block.py`): New block using `CredentialsMetaInput` pattern with optional credentials (`default={}`), supporting both authenticated (OAuth) and public MCP servers. Includes auto-lookup fallback for backward compatibility. - MCP Client (`blocks/mcp/client.py`): HTTP transport with JSON-RPC 2.0, tool discovery, tool execution with robust error handling (type-checked error fields, non-JSON response handling) - MCP OAuth Handler (`blocks/mcp/oauth.py`): RFC 8414 discovery, dynamic per-server OAuth with PKCE, token storage and refresh via `raise_for_status=True` - MCP API Routes (`api/features/mcp/routes.py`): `discover-tools`, `oauth/login`, `oauth/callback` endpoints with credential cleanup, defensive OAuth metadata validation - Credential system integration: - `CredentialsMetaInput` model_validator normalizes legacy `"ProviderName.MCP"` format from Python 3.13's `str(StrEnum)` change - `CredentialsFieldInfo.combine()` supports URL-based credential discrimination (each MCP server gets its own credential entry) - `aggregate_credentials_inputs` checks block schema defaults for credential optionality - Executor normalizes credential data for both Pydantic and JSON schema validation paths - Chat credential matching handles MCP server URL filtering - `provider_matches()` helper used consistently for Python 3.13 StrEnum compatibility - Pre-run validation: `_validate_graph_get_errors` now calls `get_missing_input()` for custom block-level validation (MCP tool arguments) - Security: HTML tag stripping loop to prevent XSS bypass, SSRF protection (removed trusted_origins) ### Frontend - MCPToolDialog (`MCPToolDialog.tsx`): Full tool discovery UI — enter server URL, authenticate if needed, browse tools, select tool and configure - OAuth popup (`oauth-popup.ts`): Shared utility supporting cross-origin MCP OAuth flows with BroadcastChannel + localStorage fallback - Credential integration: MCP-specific OAuth flow in `useCredentialsInput`, server URL filtering in `useCredentials`, MCP callback page - CredentialsSelect: Auto-selects first available credential instead of defaulting to "None", credentials listed before "None" in dropdown - Node rendering: Dynamic tool input schema rendering on MCP nodes, proper handling in both legacy and new flow editors - Block title persistence: `customized_name` set at block creation for both MCP and Agent blocks — no fallback logic needed, titles survive save/load reliably - Stable credential ordering: Removed `sortByUnsetFirst` that caused credential inputs to jump when selected ### Tests (~2060 lines) - Unit tests: block, client, tool execution - Integration tests: mock MCP server with auth - OAuth flow tests - API endpoint tests - Credential combining/optionality tests - E2e tests (skipped in CI, run manually) ## Key Design Decisions 1. Optional credentials via `default={}`: MCP servers can be public (no auth) or private (OAuth). The `credentials` field has `default={}` making it optional at the schema level, so public servers work without prompting for credentials. 2. URL-based credential discrimination: Each MCP server URL gets its own credential entry in the "Run agent" form (via `discriminator="server_url"`), so agents using multiple MCP servers prompt for each independently. 3. Model-level normalization: Python 3.13 changed `str(StrEnum)` to return `"ClassName.MEMBER"`. Rather than scattering fixes across the codebase, a Pydantic `model_validator(mode="before")` on `CredentialsMetaInput` handles normalization centrally, and `provider_matches()` handles lookups. 4. Credential auto-select: `CredentialsSelect` component defaults to the first available credential and notifies the parent state, ensuring credentials are pre-filled in the "Run agent" dialog without requiring manual selection. 5. customized_name for block titles: Both MCP and Agent blocks set `customized_name` in metadata at creation time. This eliminates convoluted runtime fallback logic (`agent_name`, hostname extraction) — the title is persisted once and read directly. ## Test plan - [x] Unit/integration tests pass (68 MCP + 11 graph = 79 tests) - [x] Manual: MCP block with public server (DeepWiki) — no credentials needed, tools discovered and executable - [x] Manual: MCP block with OAuth server (Linear, Sentry) — OAuth flow prompts correctly - [x] Manual: "Run agent" form shows correct credential requirements per MCP server - [x] Manual: Credential auto-selects when exactly one matches, pre-selects first when multiple exist - [x] Manual: Credential ordering stays stable when selecting/deselecting - [x] Manual: MCP block title persists after save and refresh - [x] Manual: Agent block title persists after save and refresh (via customized_name) - [ ] Manual: Shared agent with MCP block prompts new user for credentials --------- Co-authored-by: Otto <otto@agpt.co> Co-authored-by: Ubbe <hi@ubbe.dev>	2026-02-13 16:17:03 +00:00
abhi1992002	94bd91388f	feat(library): enhance folder and agent management with error handling improvements - Added support for updating the folder ID in the `update_library_agent` function, allowing agents to be moved to different folders. - Implemented cleanup of schedules and webhooks for affected agents during folder deletion, ensuring proper resource management. - Improved error handling in various folder-related API endpoints, standardizing error messages to provide clearer feedback to users. - Updated the `LibraryFolderEditDialog` to handle API errors more effectively, enhancing user experience during folder operations. These changes improve the functionality and reliability of folder and agent management within the library, providing users with a smoother experience when organizing their agents.	2026-02-13 21:23:52 +05:30
Zamil Majdy	52b3aebf71	feat(backend/sdk): Claude Agent SDK integration for CoPilot (#12103 ) ## Summary Full integration of the Claude Agent SDK to replace the existing one-turn OpenAI-compatible CoPilot implementation with a multi-turn, tool-using AI agent. ### What changed Core SDK Integration (`chat/sdk/` — new module) - `service.py`: Main orchestrator — spawns Claude Code CLI as a subprocess per user message, streams responses back via SSE. Handles conversation history compression, session lifecycle, and error recovery. - `response_adapter.py`: Translates Claude Agent SDK events (text deltas, tool use, errors, result messages) into the existing CoPilot `StreamEvent` protocol so the frontend works unchanged. - `tool_adapter.py`: Bridges CoPilot's MCP tools (find_block, run_block, create_agent, etc.) into the SDK's tool format. Handles schema conversion and result serialization. - `security_hooks.py`: Pre/Post tool-use hooks that enforce a strict allowlist of tools, block path traversal, sandbox file operations to per-session workspace directories, cap sub-agent spawning, and prevent the model from accessing unauthorized system resources. - `transcript.py`: JSONL transcript I/O utilities for the stateless `--resume` feature (see below). Stateless Multi-Turn Resume (new) - Instead of compressing conversation history via LLM on every turn (lossy and expensive), we capture Claude Code's native JSONL session transcript via a Stop hook callback, persist it in the DB (`ChatSession.sdkTranscript`), and restore it on the next turn via `--resume <file>`. - This preserves full tool call/result context across turns with zero token overhead for history. - Feature-flagged via `CLAUDE_AGENT_USE_RESUME` (default: off). - DB migration: `ALTER TABLE "ChatSession" ADD COLUMN "sdkTranscript" TEXT`. Sandboxed Tool Execution (`chat/tools/`) - `bash_exec.py`: Sandboxed bash execution using bubblewrap (`bwrap`) with read-only root filesystem, per-session writable workspace, resource limits (CPU, memory, file size), and network isolation. - `sandbox.py`: Shared bubblewrap sandbox infrastructure — generates `bwrap` command lines with configurable mounts, environment, and resource constraints. - `web_fetch.py`: URL fetching tool with domain allowlist, size limits, and content-type filtering. - `check_operation_status.py`: Polling tool for long-running operations (agent creation, block execution) so the SDK doesn't block waiting. - `find_block.py` / `run_block.py`: Enhanced with category filtering, optimized response size (removed raw JSON schemas), and better error handling. Security - Path traversal prevention: session IDs sanitized, all file ops confined to workspace dirs, symlink resolution. - Tool allowlist enforcement via SDK hooks — model cannot call arbitrary tools. - Built-in `Bash` tool blocked via `disallowed_tools` to prevent bypassing sandboxed `bash_exec`. - Sub-agent (`Task`) spawning capped at configurable limit (default: 10). - CodeQL-clean path sanitization patterns. Streaming & Reconnection - SSE stream registry backed by Redis Streams for crash-resilient reconnection. - Long-running operation tracking with TTL-based cleanup. - Atomic message append to prevent race conditions on concurrent writes. Configuration (`config.py`) - `use_claude_agent_sdk` — master toggle (default: on) - `claude_agent_model` — model override for SDK path - `claude_agent_max_buffer_size` — JSON parsing buffer (10MB) - `claude_agent_max_subtasks` — sub-agent cap (10) - `claude_agent_use_resume` — transcript-based resume (default: off) - `thinking_enabled` — extended thinking for Claude models Tests - `sdk/response_adapter_test.py` — 366 lines covering all event translation paths - `sdk/security_hooks_test.py` — 165 lines covering tool blocking, path traversal, subtask limits - `chat/model_test.py` — 214 lines covering session model serialization - `chat/service_test.py` — Integration tests including multi-turn resume keyword recall - `tools/find_block_test.py` / `run_block_test.py` — Extended with new tool behavior tests ## Test plan - [x] Unit tests pass (`sdk/response_adapter_test.py`, `security_hooks_test.py`, `model_test.py`) - [x] Integration test: multi-turn keyword recall via `--resume` (`service_test.py::test_sdk_resume_multi_turn`) - [x] Manual E2E: CoPilot chat sessions with tool calls, bash execution, and multi-turn context - [x] Pre-commit hooks pass (ruff, isort, black, pyright, flake8) - [ ] Staging deployment with `claude_agent_use_resume=false` initially - [ ] Enable resume in staging, verify transcript capture and recall <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> This PR replaces the existing OpenAI-compatible CoPilot with a full Claude Agent SDK integration, introducing multi-turn conversations, stateless resume via JSONL transcripts, and sandboxed tool execution. Key changes: - SDK integration (`chat/sdk/`): spawns Claude Code CLI subprocess per message, translates events to frontend protocol, bridges MCP tools - Stateless resume: captures JSONL transcripts via Stop hook, persists in `ChatSession.sdkTranscript`, restores with `--resume` (feature-flagged, default off) - Sandboxed execution: bubblewrap sandbox for bash commands with filesystem whitelist, network isolation, resource limits - Security hooks: tool allowlist enforcement, path traversal prevention, workspace-scoped file operations, sub-agent spawn limits - Long-running operations: delegates `create_agent`/`edit_agent` to existing stream_registry infrastructure for SSE reconnection - Feature flag: `CHAT_USE_CLAUDE_AGENT_SDK` with LaunchDarkly support, defaults to enabled Security issues found: - Path traversal validation has logic errors in `security_hooks.py:82` (tilde expansion order) and `service.py:266` (redundant `..` check) - Config validator always prefers env var over explicit `False` value (`config.py:162`) - Race condition in `routes.py:323` — message persisted before task registration, could duplicate on retry - Resource limits in sandbox may fail silently (`sandbox.py:109`) Test coverage is strong with 366 lines for response adapter, 165 for security hooks, and integration tests for multi-turn resume. </details> <details><summary><h3>Confidence Score: 3/5</h3></summary> - This PR is generally safe but has critical security issues in path validation that must be fixed before merge - Score reflects strong architecture and test coverage offset by real security vulnerabilities: the tilde expansion bug in `security_hooks.py` could allow sandbox escape, the race condition could cause message duplication, and the silent ulimit failures could bypass resource limits. The bubblewrap sandbox and allowlist enforcement are well-designed, but the path validation bugs need fixing. The transcript resume feature is properly feature-flagged. Overall the implementation is solid but the security issues prevent a higher score. - Pay close attention to `backend/api/features/chat/sdk/security_hooks.py` (path traversal vulnerability), `backend/api/features/chat/routes.py` (race condition), `backend/api/features/chat/tools/sandbox.py` (silent resource limit failures), and `backend/api/features/chat/sdk/service.py` (redundant security check) </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Frontend participant Routes as routes.py participant SDKService as sdk/service.py participant ClaudeSDK as Claude Agent SDK CLI participant SecurityHooks as security_hooks.py participant ToolAdapter as tool_adapter.py participant CoPilotTools as tools/* participant Sandbox as sandbox.py (bwrap) participant DB as Database participant Redis as stream_registry Frontend->>Routes: POST /chat (user message) Routes->>SDKService: stream_chat_completion_sdk() SDKService->>DB: get_chat_session() DB-->>SDKService: session + messages alt Resume enabled AND transcript exists SDKService->>SDKService: validate_transcript() SDKService->>SDKService: write_transcript_to_tempfile() Note over SDKService: Pass --resume to SDK else No resume SDKService->>SDKService: _compress_conversation_history() Note over SDKService: Inject history into user message end SDKService->>SecurityHooks: create_security_hooks() SDKService->>ToolAdapter: create_copilot_mcp_server() SDKService->>ClaudeSDK: spawn subprocess with MCP server loop Streaming Conversation ClaudeSDK->>SDKService: AssistantMessage (text/tool_use) SDKService->>Frontend: StreamTextDelta / StreamToolInputAvailable alt Tool Call ClaudeSDK->>SecurityHooks: PreToolUse hook SecurityHooks->>SecurityHooks: validate path, check allowlist alt Tool blocked SecurityHooks-->>ClaudeSDK: deny else Tool allowed SecurityHooks-->>ClaudeSDK: allow ClaudeSDK->>ToolAdapter: call MCP tool alt Long-running tool (create_agent, edit_agent) ToolAdapter->>Redis: register task ToolAdapter->>DB: save OperationPendingResponse ToolAdapter->>ToolAdapter: spawn background task ToolAdapter-->>ClaudeSDK: OperationStartedResponse else Regular tool (find_block, bash_exec) ToolAdapter->>CoPilotTools: execute() alt bash_exec CoPilotTools->>Sandbox: run_sandboxed() Sandbox->>Sandbox: build bwrap command Note over Sandbox: Network isolation,<br/>filesystem whitelist,<br/>resource limits Sandbox-->>CoPilotTools: stdout, stderr, exit_code end CoPilotTools-->>ToolAdapter: result ToolAdapter->>ToolAdapter: stash full output ToolAdapter-->>ClaudeSDK: MCP response end SecurityHooks->>SecurityHooks: PostToolUse hook (log) end end ClaudeSDK->>SDKService: UserMessage (ToolResultBlock) SDKService->>ToolAdapter: pop_pending_tool_output() SDKService->>Frontend: StreamToolOutputAvailable end ClaudeSDK->>SecurityHooks: Stop hook SecurityHooks->>SDKService: transcript_path callback SDKService->>SDKService: read_transcript_file() SDKService->>DB: save transcript to session.sdkTranscript ClaudeSDK->>SDKService: ResultMessage (success) SDKService->>Frontend: StreamFinish SDKService->>DB: upsert_chat_session() ``` </details> <sub>Last reviewed commit: 28c1121</sub> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Co-authored-by: Swifty <craigswift13@gmail.com>	2026-02-13 15:49:03 +00:00
Otto	965b7d3e04	dx: Add PR overlap detection & alert (#12104 ) ## Summary Adds an automated workflow that detects potential merge conflicts between open PRs, helping contributors coordinate proactively. Example output: [See comment on PR #12057](https://github.com/Significant-Gravitas/AutoGPT/pull/12057#issuecomment-3897330632) ## How it works 1. Triggered on PR events — runs when a PR is opened, pushed to, or reopened 2. Compares against all open PRs targeting the same base branch 3. Detects overlaps at multiple levels: - File overlap (same files modified) - Line overlap (same line ranges modified) - Actual merge conflicts (attempts real merges) 4. Posts a comment on the PR with findings ## Features - Full file paths with common prefix extraction for readability - Conflict size (number of conflict regions + lines affected) - Conflict types (content, added, deleted, modified/deleted, etc.) - Last-updated timestamps for each PR - Risk categorization (conflict, medium, low) - Ignores noise files (openapi.json, lock files) - Updates existing comment on subsequent pushes (no spam) - Filters out PRs older than 14 days - Clone-once optimization for fast merge testing (~48s for 19 PRs) ## Files - `.github/scripts/detect_overlaps.py` — main detection script - `.github/workflows/pr-overlap-check.yml` — workflow definition	2026-02-13 15:45:10 +00:00
abhi1992002	d7d571f1be	feat(library): enhance folder color handling and animation in Library components - Exported `resolveColor` function from `FolderIcon` to improve color resolution for folders. - Introduced `folderCardStyles` to define background and border styles for each folder color. - Updated `LibraryFolder` component to utilize the new color resolution and styling, enhancing visual consistency. - Refined animation handling in `LibraryAgentList` by removing unnecessary properties for smoother transitions. These changes improve the visual representation of folders and enhance the overall user experience in the library interface.	2026-02-13 21:08:16 +05:30
abhi1992002	784c025938	feat(library): refine folder filtering and enhance animation handling in LibraryAgentList - Updated `list_library_agents` function to improve folder filtering logic, ensuring it only applies when not searching. - Enhanced animation handling in `LibraryAgentList` by implementing explicit initial and animate states for items, improving visibility during dynamic updates. - Adjusted transition timings for smoother animations, particularly when items are added or removed. These changes enhance the user experience by providing clearer folder management and more responsive animations in the library interface.	2026-02-13 20:55:44 +05:30

1 2 3 4 5 ...

7956 Commits