AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-02-13 08:14:58 -05:00

Author	SHA1	Message	Date
Zamil Majdy	52c8a25531	fix(chat/sdk): fix transcript validation and type captured_transcript properly - Replace dict[str,str] with CapturedTranscript dataclass for type safety - Fix validate_transcript requiring >=3 lines — after stripping metadata, a valid 1-turn conversation is just user+assistant (2 lines) - Apply CodeQL autofix: internalize max_len in _sanitize_id, add fallback	2026-02-13 16:32:06 +04:00
Zamil Majdy	d0f0c32e70	fix(chat/sdk): validate cwd against sandbox prefix to fix CodeQL alert CodeQL traces session_id → cwd → os.makedirs/open as uncontrolled path. Add realpath + startswith check against /tmp/copilot- prefix directly in write_transcript_to_tempfile so CodeQL recognizes the sanitization. Also resolve the prefix with realpath for macOS where /tmp → /private/tmp.	2026-02-13 15:49:30 +04:00
Zamil Majdy	8dfd0a77a0	fix(chat/sdk): sanitize IDs in transcript paths to fix CodeQL alert Add _sanitize_id() that strips non-hex characters from session/user IDs before using them in file paths. Also add realpath containment check in write_transcript_to_tempfile as defence-in-depth.	2026-02-13 15:44:29 +04:00
Zamil Majdy	4bfd6c8870	fix(chat/sdk): address additional PR review feedback - transcript: compare bytes-to-bytes in size guard (not str vs bytes) - service: move user message preview from INFO to DEBUG level (PII)	2026-02-13 15:41:45 +04:00
Zamil Majdy	1918828405	fix(chat/sdk): flatten transcript storage path, remove duplicate session_id Before: chat-transcripts/{user_id}/{session_id}/{session_id}.jsonl After: chat-transcripts/{user_id}/{session_id}.jsonl	2026-02-13 15:39:43 +04:00
Zamil Majdy	9c855b501b	fix(chat/sdk): address PR review feedback on security and robustness - security_hooks: use realpath instead of normpath to resolve symlinks - security_hooks: check tool-results as path segment, not substring - response_adapter: emit StreamFinish for unknown ResultMessage subtypes - tool_adapter: delete file after read (prevent accumulation in pods) - check_operation_status: guard against None.strip() from LLM null args - service: remove redundant ".." check (realpath already resolves)	2026-02-13 15:37:22 +04:00
Zamil Majdy	5c9d0577c0	poetry lock	2026-02-13 15:34:01 +04:00
Zamil Majdy	a79bd88e7c	feat(chat/sdk): move transcript storage from DB column to bucket Replace the sdkTranscript TEXT column with WorkspaceStorageBackend (GCS/local) for persisting Claude Code JSONL transcripts. This removes the implicit 512KB cap that caused --resume to degrade after a few tool-heavy turns (JSONL is append-only and never shrinks). Key changes: - Strip progress/metadata entries before storing (~30% size reduction) with parentUuid reparenting for orphaned children - Upload in background (asyncio.create_task) to avoid blocking SSE - Size-based conflict guard: never overwrite a larger (newer) transcript - Validate stripped content before upload - Log warning when falling back to compression approach - Enable claude_agent_use_resume by default - Remove sdkTranscript column from schema, model, and DB layer - Storage path: chat-transcripts/{user_id}/{session_id}/{session_id}.jsonl	2026-02-13 15:26:53 +04:00
Zamil Majdy	28c1121a8f	fix(chat/sdk): block built-in Bash via disallowed_tools and resolve merge conflicts - Add disallowed_tools=["Bash"] to SDK options so the model never tries the built-in Bash tool (previously it tried Bash, got blocked by the security hook, then fell back to bash_exec — wasting a turn) - Resolve merge conflicts in tools/models.py (keep both HEAD additions and incoming BlockDetails/BlockDetailsResponse) - Fix pyright error in find_block.py (pass categories to BlockInfoSummary)	2026-02-13 14:44:42 +04:00
Zamil Majdy	cb3839198c	conflict resolve	2026-02-13 14:35:41 +04:00
Zamil Majdy	80804986b0	Merge branch 'dev' of github.com:Significant-Gravitas/AutoGPT into feat/copilot-claude-code-continue-session	2026-02-13 14:30:09 +04:00
Reinier van der Leer	43b25b5e2f	ci(frontend): Speed up E2E test job (#12090 ) The frontend `e2e_test` doesn't have a working build cache setup, causing really slow builds = slow test jobs. These changes reduce total test runtime from ~12 minutes to ~5 minutes. ### Changes 🏗️ - Inject build cache config into docker compose config; let `buildx bake` use GHA cache directly - Add `docker-ci-fix-compose-build-cache.py` script - Optimize `backend/Dockerfile` + root `.dockerignore` - Replace broken DIY pnpm store caching with `actions/setup-node` built-in cache management - Add caching for test seed data created in DB ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - CI	2026-02-13 11:09:41 +01:00
Swifty	ab0b537cc7	refactor(backend): optimize find_block response size by removing raw JSON schemas (#12020 ) ### Changes 🏗️ The `find_block` AutoPilot tool was returning ~90K characters per response (10 blocks). The bloat came from including full JSON Schema objects (`input_schema`, `output_schema`) with all nested `$defs`, `anyOf`, and type definitions for every block. What changed: - `BlockInfoSummary` model: Removed `input_schema` (raw JSON Schema), `output_schema` (raw JSON Schema), and `categories`. Added `output_fields` (compact field-level summaries matching the existing `required_inputs` format). - `BlockListResponse` model: Removed `usage_hint` (info now in `message`). - `FindBlockTool._execute()`: Now extracts compact `output_fields` from output schema properties instead of including the entire raw schema. Credentials handling is unchanged. - Test: Added `test_response_size_average_chars_per_block` with realistic block schemas (HTTP, Email, Claude Code) to measure and assert response size stays under 2K chars/block. - `CLAUDE.md`: Clarified `dev` vs `master` branching strategy. Result: Average response size reduced from ~9,000 to ~1,300 chars per block (~85% reduction). This directly reduces LLM token consumption, latency, and API costs for AutoPilot interactions. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified models import and serialize correctly - [x] Verified response size: 3,970 chars for 3 realistic blocks (avg 1,323/block) - [x] Lint (`ruff check`) and type check (`pyright`) pass on changed files - [x] Frontend compatibility preserved: `blocks[].name` and `count` fields retained for `block_list` handler --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Toran Bruce Richards <toran.richards@gmail.com>	2026-02-13 11:08:51 +01:00
Zamil Majdy	b915e67a9b	feat(chat/sdk): add stateless multi-turn resume via JSONL transcripts Capture Claude Code CLI session transcripts via the Stop hook and persist them in the DB. On subsequent turns, write the transcript to a temp file and pass --resume so the CLI restores full conversation context without lossy history compression. Key changes: - transcript.py: read/write/validate JSONL transcript utilities - security_hooks: register Stop hook to capture transcript_path - service.py: resume strategy with fallback to compression - schema.prisma: add sdkTranscript column to ChatSession - Feature flag: CLAUDE_AGENT_USE_RESUME (default off)	2026-02-13 13:48:04 +04:00
dependabot[bot]	9a8c6ad609	chore(libs/deps): bump the production-dependencies group across 1 directory with 4 updates (#12056 ) Bumps the production-dependencies group with 4 updates in the /autogpt_platform/autogpt_libs directory: [cryptography](https://github.com/pyca/cryptography), [fastapi](https://github.com/fastapi/fastapi), [launchdarkly-server-sdk](https://github.com/launchdarkly/python-server-sdk) and [supabase](https://github.com/supabase/supabase-py). Updates `cryptography` from 46.0.4 to 46.0.5 <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst">cryptography's changelog</a>.</em></p> <blockquote> <p>46.0.5 - 2026-02-10</p> <pre><code> * An attacker could create a malicious public key that reveals portions of your private key when using certain uncommon elliptic curves (binary curves). This version now includes additional security checks to prevent this attack. This issue only affects binary elliptic curves, which are rarely used in real-world applications. Credit to XlabAI Team of Tencent Xuanwu Lab and Atuin Automated Vulnerability Discovery Engine for reporting the issue. CVE-2026-26007 * Support for ``SECT`` binary elliptic curves is deprecated and will be removed in the next release. <p>.. v46-0-4:<br /> </code></pre></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`06e120e682`"><code>06e120e</code></a> bump version for 46.0.5 release (<a href="https://redirect.github.com/pyca/cryptography/issues/14289">#14289</a>)</li> <li><a href="`0eebb9dbb6`"><code>0eebb9d</code></a> EC check key on cofactor > 1 (<a href="https://redirect.github.com/pyca/cryptography/issues/14287">#14287</a>)</li> <li><a href="`bedf6e186b`"><code>bedf6e1</code></a> fix openssl version on 46 branch (<a href="https://redirect.github.com/pyca/cryptography/issues/14220">#14220</a>)</li> <li>See full diff in <a href="https://github.com/pyca/cryptography/compare/46.0.4...46.0.5">compare view</a></li> </ul> </details> <br /> Updates `fastapi` from 0.128.0 to 0.128.7 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/fastapi/fastapi/releases">fastapi's releases</a>.</em></p> <blockquote> <h2>0.128.7</h2> <h3>Features</h3> <ul> <li>✨ Show a clear error on attempt to include router into itself. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14258">#14258</a> by <a href="https://github.com/JavierSanchezCastro"><code>@JavierSanchezCastro</code></a>.</li> <li>✨ Replace <code>dict</code> by <code>Mapping</code> on <code>HTTPException.headers</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/12997">#12997</a> by <a href="https://github.com/rijenkii"><code>@rijenkii</code></a>.</li> </ul> <h3>Refactors</h3> <ul> <li>♻️ Simplify reading files in memory, do it sequentially instead of (fake) parallel. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14884">#14884</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h3>Docs</h3> <ul> <li>📝 Use <code>dfn</code> tag for definitions instead of <code>abbr</code> in docs. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14744">#14744</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h3>Internal</h3> <ul> <li>✅ Tweak comment in test to reference PR. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14885">#14885</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> <li>🔧 Update LLM-prompt for <code>abbr</code> and <code>dfn</code> tags. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14747">#14747</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> <li>✅ Test order for the submitted byte Files. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14828">#14828</a> by <a href="https://github.com/valentinDruzhinin"><code>@valentinDruzhinin</code></a>.</li> <li>🔧 Configure <code>test</code> workflow to run tests with <code>inline-snapshot=review</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14876">#14876</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h2>0.128.6</h2> <h3>Fixes</h3> <ul> <li>🐛 Fix <code>on_startup</code> and <code>on_shutdown</code> parameters of <code>APIRouter</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14873">#14873</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h3>Translations</h3> <ul> <li>🌐 Update translations for zh (update-outdated). PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14843">#14843</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h3>Internal</h3> <ul> <li>✅ Fix parameterized tests with snapshots. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14875">#14875</a> by <a href="https://github.com/YuriiMotov"><code>@YuriiMotov</code></a>.</li> </ul> <h2>0.128.5</h2> <h3>Refactors</h3> <ul> <li>♻️ Refactor and simplify Pydantic v2 (and v1) compatibility internal utils. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14862">#14862</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h3>Internal</h3> <ul> <li>✅ Add inline snapshot tests for OpenAPI before changes from Pydantic v2. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14864">#14864</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <h2>0.128.4</h2> <h3>Refactors</h3> <ul> <li>♻️ Refactor internals, simplify Pydantic v2/v1 utils, <code>create_model_field</code>, better types for <code>lenient_issubclass</code>. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14860">#14860</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> <li>♻️ Simplify internals, remove Pydantic v1 only logic, no longer needed. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14857">#14857</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> <li>♻️ Refactor internals, cleanup unneeded Pydantic v1 specific logic. PR <a href="https://redirect.github.com/fastapi/fastapi/pull/14856">#14856</a> by <a href="https://github.com/tiangolo"><code>@tiangolo</code></a>.</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`8f82c94de0`"><code>8f82c94</code></a> 🔖 Release version 0.128.7</li> <li><a href="`5bb3423205`"><code>5bb3423</code></a> 📝 Update release notes</li> <li><a href="`6ce5e3e961`"><code>6ce5e3e</code></a> ✅ Tweak comment in test to reference PR (<a href="https://redirect.github.com/fastapi/fastapi/issues/14885">#14885</a>)</li> <li><a href="`65da3dde12`"><code>65da3dd</code></a> 📝 Update release notes</li> <li><a href="`81f82fd955`"><code>81f82fd</code></a> 🔧 Update LLM-prompt for <code>abbr</code> and <code>dfn</code> tags (<a href="https://redirect.github.com/fastapi/fastapi/issues/14747">#14747</a>)</li> <li><a href="`ff721017df`"><code>ff72101</code></a> 📝 Update release notes</li> <li><a href="`ca76a4eba9`"><code>ca76a4e</code></a> 📝 Use <code>dfn</code> tag for definitions instead of <code>abbr</code> in docs (<a href="https://redirect.github.com/fastapi/fastapi/issues/14744">#14744</a>)</li> <li><a href="`1133a4594d`"><code>1133a45</code></a> 📝 Update release notes</li> <li><a href="`38f965985e`"><code>38f9659</code></a> ✅ Test order for the submitted byte Files (<a href="https://redirect.github.com/fastapi/fastapi/issues/14828">#14828</a>)</li> <li><a href="`3f1cc8f8f5`"><code>3f1cc8f</code></a> 📝 Update release notes</li> <li>Additional commits viewable in <a href="https://github.com/fastapi/fastapi/compare/0.128.0...0.128.7">compare view</a></li> </ul> </details> <br /> Updates `launchdarkly-server-sdk` from 9.14.1 to 9.15.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/launchdarkly/python-server-sdk/releases">launchdarkly-server-sdk's releases</a>.</em></p> <blockquote> <h2>v9.15.0</h2> <h2><a href="https://github.com/launchdarkly/python-server-sdk/compare/9.14.1...9.15.0">9.15.0</a> (2026-02-10)</h2> <h3>Features</h3> <ul> <li>Drop support for python 3.9 (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/393">#393</a>) (<a href="`5b761bd306`">5b761bd</a>)</li> <li>Update ChangeSet to always require a Selector (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/405">#405</a>) (<a href="`5dc4f81688`">5dc4f81</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li>Add context manager for clearer, safer locks (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/396">#396</a>) (<a href="`beca0fa498`">beca0fa</a>)</li> <li>Address potential race condition in FeatureStore update_availability (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/391">#391</a>) (<a href="`31cf4875c3`">31cf487</a>)</li> <li>Allow modifying fdv2 data source options independent of main config (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/403">#403</a>) (<a href="`d78079e7f3`">d78079e</a>)</li> <li>Mark copy_with_new_sdk_key method as deprecated (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/353">#353</a>) (<a href="`e471ccc3d5`">e471ccc</a>)</li> <li>Prevent immediate polling on recoverable error (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/399">#399</a>) (<a href="`da565a2dce`">da565a2</a>)</li> <li>Redis store is considered initialized when <code>$inited</code> key is written (<a href="`e99a27d48f`">e99a27d</a>)</li> <li>Stop FeatureStoreClientWrapper poller on close (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/397">#397</a>) (<a href="`468afdfef3`">468afdf</a>)</li> <li>Update DataSystemConfig to accept list of synchronizers (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/404">#404</a>) (<a href="`c73ad14090`">c73ad14</a>)</li> <li>Update reason documentation with inExperiment value (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/401">#401</a>) (<a href="`cbfc3dd887`">cbfc3dd</a>)</li> <li>Update Redis to write missing <code>$inited</code> key (<a href="`e99a27d48f`">e99a27d</a>)</li> </ul> <hr /> <p>This PR was generated with <a href="https://github.com/googleapis/release-please">Release Please</a>. See <a href="https://github.com/googleapis/release-please#release-please">documentation</a>.</p> <!-- raw HTML omitted --> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/launchdarkly/python-server-sdk/blob/main/CHANGELOG.md">launchdarkly-server-sdk's changelog</a>.</em></p> <blockquote> <h2><a href="https://github.com/launchdarkly/python-server-sdk/compare/9.14.1...9.15.0">9.15.0</a> (2026-02-10)</h2> <h3>⚠ BREAKING CHANGES</h3> <p><strong>Note:</strong> The following breaking changes apply only to FDv2 (Flag Delivery v2) early access features, which are not subject to semantic versioning and may change without a major version bump.</p> <ul> <li>Update ChangeSet to always require a Selector (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/405">#405</a>) (<a href="`5dc4f81688`">5dc4f81</a>) <ul> <li>The <code>ChangeSetBuilder.finish()</code> method now requires a <code>Selector</code> parameter.</li> </ul> </li> <li>Update DataSystemConfig to accept list of synchronizers (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/404">#404</a>) (<a href="`c73ad14090`">c73ad14</a>) <ul> <li>The <code>DataSystemConfig.synchronizers</code> field now accepts a list of synchronizers, and the <code>ConfigBuilder.synchronizers()</code> method accepts variadic arguments.</li> </ul> </li> </ul> <h3>Features</h3> <ul> <li>Drop support for python 3.9 (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/393">#393</a>) (<a href="`5b761bd306`">5b761bd</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li>Add context manager for clearer, safer locks (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/396">#396</a>) (<a href="`beca0fa498`">beca0fa</a>)</li> <li>Address potential race condition in FeatureStore update_availability (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/391">#391</a>) (<a href="`31cf4875c3`">31cf487</a>)</li> <li>Allow modifying fdv2 data source options independent of main config (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/403">#403</a>) (<a href="`d78079e7f3`">d78079e</a>)</li> <li>Mark copy_with_new_sdk_key method as deprecated (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/353">#353</a>) (<a href="`e471ccc3d5`">e471ccc</a>)</li> <li>Prevent immediate polling on recoverable error (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/399">#399</a>) (<a href="`da565a2dce`">da565a2</a>)</li> <li>Redis store is considered initialized when <code>$inited</code> key is written (<a href="`e99a27d48f`">e99a27d</a>)</li> <li>Stop FeatureStoreClientWrapper poller on close (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/397">#397</a>) (<a href="`468afdfef3`">468afdf</a>)</li> <li>Update reason documentation with inExperiment value (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/401">#401</a>) (<a href="`cbfc3dd887`">cbfc3dd</a>)</li> <li>Update Redis to write missing <code>$inited</code> key (<a href="`e99a27d48f`">e99a27d</a>)</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`e542f737a6`"><code>e542f73</code></a> chore(main): release 9.15.0 (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/394">#394</a>)</li> <li><a href="`e471ccc3d5`"><code>e471ccc</code></a> fix: Mark copy_with_new_sdk_key method as deprecated (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/353">#353</a>)</li> <li><a href="`5dc4f81688`"><code>5dc4f81</code></a> feat: Update ChangeSet to always require a Selector (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/405">#405</a>)</li> <li><a href="`f20fffeb1e`"><code>f20fffe</code></a> chore: Remove dead code, clarify names, other cleanup (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/398">#398</a>)</li> <li><a href="`c73ad14090`"><code>c73ad14</code></a> fix: Update DataSystemConfig to accept list of synchronizers (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/404">#404</a>)</li> <li><a href="`d78079e7f3`"><code>d78079e</code></a> fix: Allow modifying fdv2 data source options independent of main config (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/403">#403</a>)</li> <li><a href="`e99a27d48f`"><code>e99a27d</code></a> chore: Support persistent data store verification in contract tests (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/402">#402</a>)</li> <li><a href="`cbfc3dd887`"><code>cbfc3dd</code></a> fix: Update reason documentation with inExperiment value (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/401">#401</a>)</li> <li><a href="`5a1adbb2de`"><code>5a1adbb</code></a> chore: Update sdk_metadata features (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/400">#400</a>)</li> <li><a href="`da565a2dce`"><code>da565a2</code></a> fix: Prevent immediate polling on recoverable error (<a href="https://redirect.github.com/launchdarkly/python-server-sdk/issues/399">#399</a>)</li> <li>Additional commits viewable in <a href="https://github.com/launchdarkly/python-server-sdk/compare/9.14.1...9.15.0">compare view</a></li> </ul> </details> <br /> Updates `supabase` from 2.27.2 to 2.28.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/supabase/supabase-py/releases">supabase's releases</a>.</em></p> <blockquote> <h2>v2.28.0</h2> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.3...v2.28.0">2.28.0</a> (2026-02-10)</h2> <h3>Features</h3> <ul> <li><strong>storage:</strong> add list_v2 method to file_api client (<a href="https://redirect.github.com/supabase/supabase-py/issues/1377">#1377</a>) (<a href="`259f4ad42d`">259f4ad</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li><strong>auth:</strong> add missing is_sso_user, deleted_at, banned_until to User model (<a href="https://redirect.github.com/supabase/supabase-py/issues/1375">#1375</a>) (<a href="`7f84a62996`">7f84a62</a>)</li> <li><strong>realtime:</strong> ensure remove_channel removes channel from channels dict (<a href="https://redirect.github.com/supabase/supabase-py/issues/1373">#1373</a>) (<a href="`0923314039`">0923314</a>)</li> <li><strong>realtime:</strong> use pop with default in _handle_message to prevent KeyError (<a href="https://redirect.github.com/supabase/supabase-py/issues/1388">#1388</a>) (<a href="`baea26f7ce`">baea26f</a>)</li> <li><strong>storage3:</strong> replace print() with warnings.warn() for trailing slash notice (<a href="https://redirect.github.com/supabase/supabase-py/issues/1380">#1380</a>) (<a href="`50b099fa06`">50b099f</a>)</li> </ul> <h2>v2.27.3</h2> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.2...v2.27.3">2.27.3</a> (2026-02-03)</h2> <h3>Bug Fixes</h3> <ul> <li>deprecate python 3.9 in all packages (<a href="https://redirect.github.com/supabase/supabase-py/issues/1365">#1365</a>) (<a href="`cc72ed75d4`">cc72ed7</a>)</li> <li>ensure storage_url has trailing slash to prevent warning (<a href="https://redirect.github.com/supabase/supabase-py/issues/1367">#1367</a>) (<a href="`4267ff1345`">4267ff1</a>)</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/supabase/supabase-py/blob/main/CHANGELOG.md">supabase's changelog</a>.</em></p> <blockquote> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.3...v2.28.0">2.28.0</a> (2026-02-10)</h2> <h3>Features</h3> <ul> <li><strong>storage:</strong> add list_v2 method to file_api client (<a href="https://redirect.github.com/supabase/supabase-py/issues/1377">#1377</a>) (<a href="`259f4ad42d`">259f4ad</a>)</li> </ul> <h3>Bug Fixes</h3> <ul> <li><strong>auth:</strong> add missing is_sso_user, deleted_at, banned_until to User model (<a href="https://redirect.github.com/supabase/supabase-py/issues/1375">#1375</a>) (<a href="`7f84a62996`">7f84a62</a>)</li> <li><strong>realtime:</strong> ensure remove_channel removes channel from channels dict (<a href="https://redirect.github.com/supabase/supabase-py/issues/1373">#1373</a>) (<a href="`0923314039`">0923314</a>)</li> <li><strong>realtime:</strong> use pop with default in _handle_message to prevent KeyError (<a href="https://redirect.github.com/supabase/supabase-py/issues/1388">#1388</a>) (<a href="`baea26f7ce`">baea26f</a>)</li> <li><strong>storage3:</strong> replace print() with warnings.warn() for trailing slash notice (<a href="https://redirect.github.com/supabase/supabase-py/issues/1380">#1380</a>) (<a href="`50b099fa06`">50b099f</a>)</li> </ul> <h2><a href="https://github.com/supabase/supabase-py/compare/v2.27.2...v2.27.3">2.27.3</a> (2026-02-03)</h2> <h3>Bug Fixes</h3> <ul> <li>deprecate python 3.9 in all packages (<a href="https://redirect.github.com/supabase/supabase-py/issues/1365">#1365</a>) (<a href="`cc72ed75d4`">cc72ed7</a>)</li> <li>ensure storage_url has trailing slash to prevent warning (<a href="https://redirect.github.com/supabase/supabase-py/issues/1367">#1367</a>) (<a href="`4267ff1345`">4267ff1</a>)</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`59e338400b`"><code>59e3384</code></a> chore(main): release 2.28.0 (<a href="https://redirect.github.com/supabase/supabase-py/issues/1378">#1378</a>)</li> <li><a href="`baea26f7ce`"><code>baea26f</code></a> fix(realtime): use pop with default in _handle_message to prevent KeyError (#...</li> <li><a href="`259f4ad42d`"><code>259f4ad</code></a> feat(storage): add list_v2 method to file_api client (<a href="https://redirect.github.com/supabase/supabase-py/issues/1377">#1377</a>)</li> <li><a href="`50b099fa06`"><code>50b099f</code></a> fix(storage3): replace print() with warnings.warn() for trailing slash notice...</li> <li><a href="`0923314039`"><code>0923314</code></a> fix(realtime): ensure remove_channel removes channel from channels dict (<a href="https://redirect.github.com/supabase/supabase-py/issues/1373">#1373</a>)</li> <li><a href="`7f84a62996`"><code>7f84a62</code></a> fix(auth): add missing is_sso_user, deleted_at, banned_until to User model (#...</li> <li><a href="`57dd6e2195`"><code>57dd6e2</code></a> chore(deps): bump the uv group across 1 directory with 3 updates (<a href="https://redirect.github.com/supabase/supabase-py/issues/1369">#1369</a>)</li> <li><a href="`c357def670`"><code>c357def</code></a> chore(main): release 2.27.3 (<a href="https://redirect.github.com/supabase/supabase-py/issues/1368">#1368</a>)</li> <li><a href="`4267ff1345`"><code>4267ff1</code></a> fix: ensure storage_url has trailing slash to prevent warning (<a href="https://redirect.github.com/supabase/supabase-py/issues/1367">#1367</a>)</li> <li><a href="`cc72ed75d4`"><code>cc72ed7</code></a> fix: deprecate python 3.9 in all packages (<a href="https://redirect.github.com/supabase/supabase-py/issues/1365">#1365</a>)</li> <li>Additional commits viewable in <a href="https://github.com/supabase/supabase-py/compare/v2.27.2...v2.28.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore <dependency name> major version` will close this group update PR and stop Dependabot creating any more for the specific dependency's major version (unless you unignore this specific dependency's major version or upgrade to it yourself) - `@dependabot ignore <dependency name> minor version` will close this group update PR and stop Dependabot creating any more for the specific dependency's minor version (unless you unignore this specific dependency's minor version or upgrade to it yourself) - `@dependabot ignore <dependency name>` will close this group update PR and stop Dependabot creating any more for the specific dependency (unless you unignore this specific dependency or upgrade to it yourself) - `@dependabot unignore <dependency name>` will remove all of the ignore conditions of the specified dependency - `@dependabot unignore <dependency name> <ignore condition>` will remove the ignore condition of the specified dependency and ignore conditions </details> <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Dependency update bumps 4 packages in the production-dependencies group, including a critical security patch for `cryptography`* (CVE-2026-26007) that prevents malicious public key attacks on binary elliptic curves. The update also includes bug fixes for `fastapi`, `launchdarkly-server-sdk`, and `supabase`. - cryptography 46.0.4 → 46.0.5: patches CVE-2026-26007, deprecates SECT* binary curves - fastapi 0.128.0 → 0.128.7: bug fixes, improved error handling, relaxed Starlette constraint - launchdarkly-server-sdk 9.14.1 → 9.15.0: drops Python 3.9 support (requires >=3.10), fixes race conditions - supabase 2.27.2/2.27.3 → 2.28.0: realtime fixes, new User model fields The lock files correctly resolve all dependencies. Python 3.10+ requirement is already enforced in both packages. However, backend's `pyproject.toml` still specifies `launchdarkly-server-sdk = "^9.14.1"` while the lock file uses 9.15.0 (pulled from autogpt_libs dependency), creating a minor version constraint inconsistency. </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - This PR is safe to merge with one minor style suggestion - Automated dependency update with critical security patch for cryptography. All updates are backwards-compatible within semver constraints. Lock files correctly resolve all dependencies. Python 3.10+ is already enforced. Only minor issue is version constraint inconsistency in backend's pyproject.toml for launchdarkly-server-sdk, which doesn't affect functionality but should be aligned for clarity. - autogpt_platform/backend/pyproject.toml needs launchdarkly-server-sdk version constraint updated to ^9.15.0 </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Otto <otto@agpt.co>	2026-02-13 09:10:11 +00:00
Zamil Majdy	32e9dda30d	fix(chat/sdk): resolve relative paths in security hooks and unify workspace access The security hook's path validation blocked SDK Read/Write tools because it didn't resolve relative paths against sdk_cwd. Since the SDK sets cwd, Claude naturally uses relative paths like "test.txt" which failed the absolute path prefix check. Now relative paths are joined with sdk_cwd before validation, and denial messages include the allowed workspace path. Also clarifies the workspace model: SDK Read/Write + bash_exec share the same ephemeral session directory, while workspace_file tools provide persistent cloud storage across sessions.	2026-02-13 10:40:41 +04:00
Ubbe	e8c50b96d1	fix(frontend): improve CoPilot chat table styling (#12094 ) ## Summary - Remove left and right borders from tables rendered in CoPilot chat - Increase cell padding (py-3 → py-3.5) for better spacing between text and lines - Applies to both Streamdown (main chat) and MarkdownRenderer (tool outputs) Design feedback from Olivia to make tables "breathe" more. ## Test plan - [ ] Open CoPilot chat and trigger a response containing a table - [ ] Verify tables no longer have left/right borders - [ ] Verify increased spacing between rows - [ ] Check both light and dark modes 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Improved CoPilot chat table styling by removing left and right borders and increasing vertical padding from `py-3` to `py-3.5`. Changes apply to both: - Streamdown-rendered tables (via CSS selector in `globals.css`) - MarkdownRenderer tables (via Tailwind classes) The changes make tables "breathe" more per design feedback from Olivia. Issue Found: - The CSS padding value in `globals.css:192` is `0.625rem` (`py-2.5`) but should be `0.875rem` (`py-3.5`) to match the PR description and the MarkdownRenderer implementation. </details> <details><summary><h3>Confidence Score: 2/5</h3></summary> - This PR has a logical error that will cause inconsistent table styling between Streamdown and MarkdownRenderer tables - The implementation has an inconsistency where the CSS file uses `py-2.5` padding while the PR description and MarkdownRenderer use `py-3.5`. This will result in different table padding between the two rendering systems, contradicting the goal of consistent styling improvements. - Pay close attention to `autogpt_platform/frontend/src/app/globals.css` - the padding value needs to be corrected to match the intended design </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>	2026-02-13 09:38:59 +08:00
Ubbe	30e854569a	feat(frontend): add exact timestamp tooltip on run timestamps (#12087 ) Resolves OPEN-2693: Make exact timestamp of runs accessible through UI. The NewAgentLibraryView shows relative timestamps ("2 days ago") for runs and schedules, but unlike the OldAgentLibraryView it didn't show the exact timestamp on hover. This PR adds a native `title` tooltip so users can see the full date/time by hovering. ### Changes 🏗️ - Added `descriptionTitle` prop to `SidebarItemCard` that renders as a `title` attribute on the description text - `TaskListItem` now passes the exact `run.started_at` timestamp via `descriptionTitle` - `ScheduleListItem` now passes the exact `schedule.next_run_time` timestamp via `descriptionTitle` ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [ ] Open an agent in the library view - [ ] Hover over a run's relative timestamp (e.g. "2 days ago") and confirm the full date/time tooltip appears - [ ] Hover over a schedule's relative timestamp and confirm the full date/time tooltip appears 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Added native tooltip functionality to show exact timestamps in the library view. The implementation adds a `descriptionTitle` prop to `SidebarItemCard` that renders as a `title` attribute on the description text. This allows users to hover over relative timestamps (e.g., "2 days ago") to see the full date/time. Changes: - Added optional `descriptionTitle` prop to `SidebarItemCard` component (SidebarItemCard.tsx:10) - `TaskListItem` passes `run.started_at` as the tooltip value (TaskListItem.tsx:84-86) - `ScheduleListItem` passes `schedule.next_run_time` as the tooltip value (ScheduleListItem.tsx:32) - Unrelated fix included: Sentry configuration updated to suppress cross-origin stylesheet errors (instrumentation-client.ts:25-28) Note: The PR includes two separate commits - the main timestamp tooltip feature and a Sentry error suppression fix. The PR description only documents the timestamp feature. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The changes are straightforward and limited in scope - adding an optional prop that forwards a native HTML attribute for tooltip functionality. The Text component already supports forwarding arbitrary HTML attributes through its spread operator (...rest), ensuring the `title` attribute works correctly. Both the timestamp tooltip feature and the Sentry configuration fix are low-risk improvements with no breaking changes. - No files require special attention </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant User participant TaskListItem participant ScheduleListItem participant SidebarItemCard participant Text participant Browser User->>TaskListItem: Hover over run timestamp TaskListItem->>SidebarItemCard: Pass descriptionTitle (run.started_at) SidebarItemCard->>Text: Render with title attribute Text->>Browser: Forward title attribute to DOM Browser->>User: Display native tooltip with exact timestamp User->>ScheduleListItem: Hover over schedule timestamp ScheduleListItem->>SidebarItemCard: Pass descriptionTitle (schedule.next_run_time) SidebarItemCard->>Text: Render with title attribute Text->>Browser: Forward title attribute to DOM Browser->>User: Display native tooltip with exact timestamp ``` </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 09:38:16 +08:00
Ubbe	301d7cbada	fix(frontend): suppress cross-origin stylesheet security error (#12086 ) ## Summary - Adds `ignoreErrors` to the Sentry client configuration (`instrumentation-client.ts`) to filter out `SecurityError: CSSStyleSheet.cssRules getter: Not allowed to access cross-origin stylesheet` errors - These errors are caused by Sentry Replay (rrweb) attempting to serialize DOM snapshots that include cross-origin stylesheets (from browser extensions or CDN-loaded CSS) - This was reported via Sentry on production, occurring on any page when logged in ## Changes - `frontend/instrumentation-client.ts`: Added `ignoreErrors: [/Not allowed to access cross-origin stylesheet/]` to `Sentry.init()` config ## Test plan - [ ] Verify the error no longer appears in Sentry after deployment - [ ] Verify Sentry Replay still works correctly for other errors - [ ] Verify no regressions in error tracking (other errors should still be captured) 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Adds error filtering to Sentry client configuration to suppress cross-origin stylesheet security errors that occur when Sentry Replay (rrweb) attempts to serialize DOM snapshots containing stylesheets from browser extensions or CDN-loaded CSS. This prevents noise in Sentry error logs without affecting the capture of legitimate errors. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The change adds a simple error filter to suppress benign cross-origin stylesheet errors that are caused by Sentry Replay itself. The regex pattern is specific and only affects client-side error reporting, with no impact on application functionality or legitimate error capture - No files require special attention </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment --> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 09:37:54 +08:00
Ubbe	d95aef7665	fix(copilot): stream timeout, long-running tool polling, and CreateAgent UI refresh (#12070 ) Agent generation completes on the backend but the UI does not update/refresh to show the result. ### Changes 🏗️ ![Uploading Screenshot 2026-02-13 at 00.44.54.png…]() - Stream start timeout (12s): If the backend doesn't begin streaming within 12 seconds of submitting a message, the stream is aborted and a destructive toast is shown to the user. - Long-running tool polling: Added `useLongRunningToolPolling` hook that polls the session endpoint every 1.5s while a tool output is in an operating state (`operation_started` / `operation_pending` / `operation_in_progress`). When the backend completes, messages are refreshed so the UI reflects the final result. - CreateAgent UI improvements: Replaced the orbit loader / progress bar with a mini-game, added expanded accordion for saved agents, and improved the saved-agent card with image, icons, and links that open in new tabs. - Backend tweaks: Added `image_url` to `CreateAgentToolOutput`, minor model/service updates for the dummy agent generator. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Send a message and verify the stream starts within 12s or a toast appears - [x] Trigger agent creation and verify the UI updates when the backend completes - [x] Verify the saved-agent card renders correctly with image, links, and icons --------- Co-authored-by: Otto <otto@agpt.co> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 20:06:40 +00:00
Zamil Majdy	cb45e7957b	feat: fix openapi.json	2026-02-12 23:39:47 +04:00
Zamil Majdy	f1d02fb8f3	fix(chat/sdk): move cwd setup inside try block to ensure cleanup Move _make_sdk_cwd() and os.makedirs() inside the try block so the finally cleanup always runs, preventing /tmp dir leaks if setup fails.	2026-02-12 23:32:26 +04:00
Zamil Majdy	47de6b6420	feat(chat): add check_operation_status tool for long-running ops Lets the CoPilot agent query whether a create_agent/edit_agent operation is still running, completed, or failed. Accepts operation_id or task_id from a previous operation_started response and looks up the task status in Redis via stream_registry.	2026-02-12 23:30:51 +04:00
Zamil Majdy	62cd2eea89	fix(chat/sandbox): use --symlink for compat paths on Debian 13 On Debian 13 (bookworm+), /bin, /lib, /sbin, /lib64 are symlinks to /usr/*. bwrap --ro-bind cannot create a symlink as a mount target inside the sandbox, causing "execvp: No such file or directory" because the ELF dynamic linker at /lib64/ld-linux-x86-64.so.2 is unreachable. Detect symlinks at runtime with os.path.islink() and use bwrap --symlink instead of --ro-bind. Falls back to --ro-bind on older distros where these are real directories.	2026-02-12 22:55:29 +04:00
Zamil Majdy	ae61ec692e	Merge branch 'dev' into feat/copitlot-claude-code	2026-02-12 22:27:50 +04:00
Zamil Majdy	9296bd8736	fix(chat/sandbox): fix bwrap inside Docker containers Three fixes for bubblewrap sandbox: - Fix --tmpdir (invalid) to --tmpfs (correct bwrap option) - Add --unshare-user so bwrap can create namespaces inside unprivileged Docker containers (no CAP_SYS_ADMIN needed) - Reorder mounts: --tmpfs /tmp first, then --bind workspace on top, so the workspace directory is visible through the fresh tmpfs	2026-02-12 22:22:39 +04:00
Zamil Majdy	308113c03d	fix(chat/sdk): remove obsolete Bash allowlist tests The SDK built-in Bash tool is now unconditionally blocked (bash_exec MCP tool with bubblewrap is used instead). Remove tests that expected safe Bash commands to be allowed and replace with a single test that verifies Bash is always denied.	2026-02-12 22:19:30 +04:00
Zamil Majdy	51abf13254	feat(chat): use LaunchDarkly flag for copilot SDK rollout Replace static CHAT_USE_CLAUDE_AGENT_SDK env var with a LaunchDarkly feature flag (copilot-sdk) for per-user rollout control. The env var value serves as the default when LD is not configured or the flag doesn't exist yet.	2026-02-12 22:02:28 +04:00
Zamil Majdy	54b03d3a29	fix(frontend): remove python_exec from openapi.json ResponseType enum The python_exec tool was removed from the backend but the generated openapi.json still referenced the enum value.	2026-02-12 21:55:25 +04:00
Zamil Majdy	239dff5ebd	feat(chat/sandbox): add resource limits to bubblewrap sandbox Add ulimit-based resource caps inside the bwrap sandbox to prevent fork bombs and resource exhaustion: - max 64 processes (stops fork bombs) - 512 MB virtual memory - 50 MB max file size - 256 open file descriptors Limits are applied via `sh -c 'ulimit ...; exec "$@"'` wrapper inside the sandbox, so they're inherited by all child processes.	2026-02-12 21:47:49 +04:00
Zamil Majdy	1dd53db21c	feat(chat/sandbox): bubblewrap sandbox for bash_exec, remove python_exec - Replace `--ro-bind / /` with whitelist-only filesystem: only /usr, /etc, /bin, /lib, /sbin mounted read-only. /app, /root, /home, /opt, /var are completely invisible inside the sandbox. - Add `--clearenv` to wipe all inherited env vars (API keys, DB passwords). Only safe vars (PATH, HOME=workspace, LANG) are explicitly set. - Remove python_exec tool — bash_exec can run `python3 -c` or heredocs with identical bubblewrap protection, reducing attack surface. - Remove all fallback security code (import hooks, blocked modules, network command lists). Tools now hard-require bubblewrap — disabled on platforms without bwrap. - Clean up security_hooks.py: remove ~200 lines of dead bash validation code, add Bash to BLOCKED_TOOLS as defence-in-depth. - Wire up long-running tool callback in SDK service for create_agent/edit_agent delegation to Redis Streams background infrastructure.	2026-02-12 21:44:40 +04:00
Zamil Majdy	06c16ee2fe	fix(chat/sdk): non-blocking long-running tools, tighten security - Long-running tools (create_agent) now run in background and return immediately with an operation_id. Add check_operation MCP tool for polling results. Prevents 3+ min blocking and survives page refresh. - Fix CodeQL path traversal alert: use normpath+startswith sanitizer in _make_sdk_cwd() instead of assert. - Tighten _read_file_handler: restrict from ~/.claude/ to only ~/.claude/projects/**/tool-results/ (sentry review feedback). - Fix bash redirect bypass: strip quoted strings before checking for unquoted > operator, catches `echo hello>file` (sentry review).	2026-02-12 20:39:33 +04:00
Zamil Majdy	8d2a649ee5	refactor(chat/sdk): remove Langfuse tracing — OpenRouter handles observability Delete tracing.py (~408 lines) and all TracedSession/hook references from the SDK path. OpenRouter already provides token usage, cost tracking, and request logging, making manual Langfuse integration redundant. This also fixes the broken 'Langfuse' object has no attribute 'trace' warning on every request.	2026-02-12 20:24:27 +04:00
Nicholas Tindle	cb166dd6fb	feat(blocks): Store sandbox files to workspace (#12073 ) Store files created by sandbox blocks (Claude Code, Code Executor) to the user's workspace for persistence across runs. ### Changes 🏗️ - New `sandbox_files.py` utility (`backend/util/sandbox_files.py`) - Shared module for extracting files from E2B sandboxes - Stores files to workspace via `store_media_file()` (includes virus scanning, size limits) - Returns `SandboxFileOutput` with path, content, and `workspace_ref` - Claude Code block (`backend/blocks/claude_code.py`) - Added `workspace_ref` field to `FileOutput` schema - Replaced inline `_extract_files()` with shared utility - Files from working directory now stored to workspace automatically - Code Executor block (`backend/blocks/code_executor.py`) - Added `files` output field to `ExecuteCodeBlock.Output` - Creates `/output` directory in sandbox before execution - Extracts all files (text + binary) from `/output` after execution - Updated `execute_code()` to support file extraction with `extract_files` param ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Create agent with Claude Code block, have it create a file, verify `workspace_ref` in output - [x] Create agent with Code Executor block, write file to `/output`, verify `workspace_ref` in output - [x] Verify files persist in workspace after sandbox disposal - [x] Verify binary files (images, etc.) work correctly in Code Executor - [x] Verify existing graphs using `content` field still work (backward compat) #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) No configuration changes required - this is purely additive backend code. --- Related: Closes SECRT-1931 <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Adds automatic extraction and workspace storage of sandbox-written files (including binaries for code execution), which can affect output payload size, performance, and file-handling edge cases. > > Overview > Sandbox blocks now persist generated files to workspace. A new shared utility (`backend/util/sandbox_files.py`) extracts files from an E2B sandbox (scoped by a start timestamp) and stores them via `store_media_file`, returning `SandboxFileOutput` with `workspace_ref`. > > `ClaudeCodeBlock` replaces its inline file-scraping logic with this utility and updates the `files` output schema to include `workspace_ref`. > > `ExecuteCodeBlock` adds a `files` output and extends the executor mixin to optionally extract/store files (text + binary) when an `execution_context` is provided; related mocks/tests and docs are updated accordingly. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `343854c0cf`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 15:56:59 +00:00
Zamil Majdy	9589474709	Merge branch 'dev' into feat/copitlot-claude-code	2026-02-12 19:40:32 +04:00
Swifty	3d31f62bf1	Revert "added feature request tooling" This reverts commit `b8b6c9de23`.	2026-02-12 16:39:24 +01:00
Swifty	b8b6c9de23	added feature request tooling	2026-02-12 16:38:17 +01:00
Zamil Majdy	749a78723a	refactor(chat/sdk): deduplicate code and remove anthropic fallback - Extract shared `make_session_path()` into sandbox.py (single source of truth for workspace path sanitization), replace duplicate in service.py - Delete anthropic_fallback.py (~360 lines) — redundant third code path; routes.py already falls back to non-SDK service - Remove dead `traced_session()`, `get_tool_definitions()`, `get_tool_handlers()`, `_current_tool_call_id` ContextVar - Fix hardcoded model in tracing — pass actual resolved model - Fix inconsistent model name splitting in anthropic fallback	2026-02-12 19:26:29 +04:00
Zamil Majdy	bec2e1ddee	fix(chat/tools): sanitize session_id in sandbox workspace path Align with SDK's _make_sdk_cwd() to prevent path traversal and ensure python_exec/bash_exec share the same workspace as SDK file tools.	2026-02-12 19:08:47 +04:00
Zamil Majdy	ec1ab06e0d	chore(chat): bump default max_subtasks from 3 to 10	2026-02-12 19:07:42 +04:00
Zamil Majdy	f31cb49557	feat(chat/tools): add sandboxed python_exec, bash_exec, web_fetch tools and enable Task - Add sandbox.py with network-isolated execution via unshare --net (Linux) and import/command blocklist fallback (macOS dev) - Add python_exec tool: runs Python in subprocess with no network, workspace-scoped - Add bash_exec tool: full Bash scripting with no network, workspace-scoped - Add web_fetch tool: SSRF-protected URL fetching via backend Requests utility - Remove SDK built-in Bash from allowlist (replaced by sandboxed bash_exec) - Enable SDK built-in Task (sub-agents) with per-session rate limit (default 3) - Add claude_agent_max_subtasks config field	2026-02-12 19:07:19 +04:00
Zamil Majdy	fd28c386f4	Merge branch 'dev' into feat/copitlot-claude-code	2026-02-12 18:50:11 +04:00
Zamil Majdy	3bea584659	feat(chat/sdk): route SDK through OpenRouter with observability (#12084 ) ## Summary - Routes Claude Agent SDK API calls through OpenRouter via `ANTHROPIC_BASE_URL` / `ANTHROPIC_AUTH_TOKEN` env vars, enabling per-call token and cost tracking on the OpenRouter dashboard - Adds `sdk_model` and `sdk_max_budget_usd` config fields for SDK-specific model selection and budget control - Emits `StreamUsage` from SDK `ResultMessage` so the frontend receives token counts, and persists usage to `session.usage` - Fixes Langfuse tracing to use the configured model name instead of a hardcoded default - Updates Anthropic fallback to use `config.api_key` / `config.base_url` (OpenRouter routing) instead of raw `ANTHROPIC_API_KEY` env var ## Test plan - [ ] Deploy and send a CoPilot message — verify the API call appears on the OpenRouter dashboard - [ ] Check Langfuse trace shows correct model name (e.g. `claude-opus-4.6` not hardcoded `claude-sonnet-4-20250514`) - [ ] Verify frontend receives `StreamUsage` with `promptTokens` / `completionTokens` values - [ ] Set `CHAT_SDK_MAX_BUDGET_USD` and verify budget is respected - [ ] Test fallback path (without `claude-agent-sdk` installed) still works via OpenRouter <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Routes Claude Agent SDK API calls through OpenRouter for enhanced observability and cost tracking. The PR enables per-call token tracking on the OpenRouter dashboard by configuring the SDK to use `ANTHROPIC_BASE_URL` and `ANTHROPIC_AUTH_TOKEN` environment variables derived from the chat configuration. Key changes: - Added `sdk_model` and `sdk_max_budget_usd` configuration fields for SDK-specific control - Implemented automatic model name resolution that strips OpenRouter provider prefixes - Updated SDK client initialization to route through OpenRouter with proper environment variables - Emits `StreamUsage` events from SDK `ResultMessage` for frontend token visibility - Persists usage data to `session.usage` for historical tracking - Fixed Langfuse tracing to use the configured model name instead of hardcoded defaults - Updated fallback path to use OpenRouter routing instead of direct Anthropic API </details> <details><summary><h3>Confidence Score: 4/5</h3></summary> - Safe to merge with minor observations - the implementation is solid and the changes are well-structured - The code quality is high with proper error handling, clear separation of concerns, and good defensive coding practices. The changes integrate cleanly with existing patterns. Minor observations include missing validation for sdk_max_budget_usd and a potential edge case in model name resolution, but these don't block merging - No files require special attention - all changes follow existing patterns and maintain consistency </details> <details><summary><h3>Sequence Diagram</h3></summary> ```mermaid sequenceDiagram participant Frontend participant Backend participant SDK as Claude Agent SDK participant OpenRouter participant Anthropic participant Langfuse Frontend->>Backend: POST /chat/completions Backend->>Backend: Load config (api_key, base_url) Backend->>Backend: Resolve SDK model (strip OpenRouter prefix) Backend->>Backend: Build SDK env vars (ANTHROPIC_BASE_URL, ANTHROPIC_AUTH_TOKEN) Backend->>Langfuse: Initialize TracedSession with model name Backend->>SDK: ClaudeSDKClient(model, env, max_budget_usd) SDK->>SDK: Use ANTHROPIC_BASE_URL from env SDK->>OpenRouter: POST /messages (via configured base_url) OpenRouter->>Anthropic: Forward request with routing Anthropic-->>OpenRouter: Stream response chunks OpenRouter-->>SDK: Stream response with usage data loop For each SDK message SDK-->>Backend: AssistantMessage/UserMessage/ResultMessage Backend->>Langfuse: log_sdk_message() Backend->>Backend: SDKResponseAdapter.convert_message() Backend->>Backend: Extract usage from ResultMessage Backend->>Backend: Persist Usage to session.usage Backend-->>Frontend: StreamUsage(promptTokens, completionTokens) Backend-->>Frontend: StreamTextDelta/StreamToolInput/etc end Backend->>Langfuse: Log final generation with model name Backend->>Backend: Save session with usage data Backend-->>Frontend: StreamFinish ``` </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-12 21:47:39 +07:00
Abhimanyu Yadav	4f6055f494	refactor(frontend): remove default expiration date from API key credentials form (#12092 ) ### Changes 🏗️ Removed the default expiration date for API keys in the credentials modal. Previously, API keys were set to expire the next day by default, but now the expiration date field starts empty, allowing users to explicitly choose whether they want to set an expiration date. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Open the API key credentials modal and verify the expiration date field is empty by default - [x] Test creating an API key with and without an expiration date - [x] Verify both scenarios work correctly <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Removed the default expiration date for API key credentials in the credentials modal. Previously, API keys were automatically set to expire the next day at midnight. Now the expiration date field starts empty, allowing users to explicitly choose whether to set an expiration. - Removed `getDefaultExpirationDate()` helper function that calculated tomorrow's date - Changed default `expiresAt` value from calculated date to empty string - Backend already supports optional expiration (`expires_at?: number`), so no backend changes needed - Form submission correctly handles empty expiration by passing `undefined` to the API </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The changes are straightforward and well-contained. The refactor removes a helper function and changes a default value. The backend API already supports optional expiration dates, and the form submission logic correctly handles empty values by passing undefined. The change improves UX by not forcing a default expiration date on users. - No files require special attention </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-12 12:57:06 +00:00
Otto	695a185fa1	fix(frontend): remove fixed min-height from CoPilot message container (#12091 ) ## Summary Removes the `min-h-screen` class from `ConversationContent` in ChatMessagesContainer, which was causing fixed height layout issues in the CoPilot chat interface. ## Changes - Removed `min-h-screen` from ConversationContent className ## Linear Fixes [SECRT-1944](https://linear.app/autogpt/issue/SECRT-1944) <!-- greptile_comment --> <h2>Greptile Overview</h2> <details><summary><h3>Greptile Summary</h3></summary> Removes the `min-h-screen` (100vh) class from `ConversationContent` that was causing the chat message container to enforce a minimum viewport height. The parent container already handles height constraints with `h-full min-h-0` and flexbox layout, so the fixed minimum height was creating layout conflicts. The component now properly grows within its flex container using `flex-1`. </details> <details><summary><h3>Confidence Score: 5/5</h3></summary> - This PR is safe to merge with minimal risk - The change removes a single problematic CSS class that was causing fixed height layout issues. The parent container already handles height constraints properly with flexbox, and removing min-h-screen allows the component to size correctly within its flex parent. This is a targeted, low-risk bug fix with no logic changes. - No files require special attention </details> <!-- greptile_other_comments_section --> <!-- /greptile_comment -->	2026-02-12 12:46:29 +00:00
Reinier van der Leer	113e87a23c	refactor(backend): Reduce circular imports (#12068 ) I'm getting circular import issues because there is a lot of cross-importing between `backend.data`, `backend.blocks`, and other modules. This change reduces block-related cross-imports and thus risk of breaking circular imports. ### Changes 🏗️ - Strip down `backend.data.block` - Move `Block` base class and related class/enum defs to `backend.blocks._base` - Move `is_block_auth_configured` to `backend.blocks._utils` - Move `get_blocks()`, `get_io_block_ids()` etc. to `backend.blocks` (`__init__.py`) - Update imports everywhere - Remove unused and poorly typed `Block.create()` - Change usages from `block_cls.create()` to `block_cls()` - Improve typing of `load_all_blocks` and `get_blocks` - Move cross-import of `backend.api.features.library.model` from `backend/data/__init__.py` to `backend/data/integrations.py` - Remove deprecated attribute `NodeModel.webhook` - Re-generate OpenAPI spec and fix frontend usage - Eliminate module-level `backend.blocks` import from `blocks/agent.py` - Eliminate module-level `backend.data.execution` and `backend.executor.manager` imports from `blocks/helpers/review.py` - Replace `BlockInput` with `GraphInput` for graph inputs ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - CI static type-checking + tests should be sufficient for this	2026-02-12 12:07:49 +00:00
Abhimanyu Yadav	d09f1532a4	feat(frontend): replace legacy builder with new flow editor (#12081) ### Changes 🏗️ This PR completes the migration from the legacy builder to the new Flow editor by removing all legacy code and feature flags. Removed: - Old builder view toggle functionality (`BuilderViewTabs.tsx`) - Legacy debug panel (`RightSidebar.tsx`) - Feature flags: `NEW_FLOW_EDITOR` and `BUILDER_VIEW_SWITCH` - `useBuilderView` hook and related view-switching logic Updated: - Simplified `build/page.tsx` to always render the new Flow editor - Added CSS styling (`flow.css`) to properly render Phosphor icons in React Flow handles Tests: - Skipped e2e test suite in `build.spec.ts` (legacy builder tests) - Follow-up PR (#12082) will add new e2e tests for the Flow editor ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Create a new flow and verify it loads correctly - [x] Add nodes and connections to verify basic functionality works - [x] Verify that node handles render correctly with the new CSS - [x] Check that the UI is clean without the old debug panel or view toggles #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes	2026-02-12 11:16:01 +00:00
Zamil Majdy	d7f7a2747f	fix(backend/chat): Atomic message append to prevent race condition Replace the read-modify-write pattern in stream_chat_post with an atomic append_and_save_message helper that acquires the session lock before re-fetching and appending. This prevents message loss when concurrent requests modify the same session.	2026-02-12 09:10:43 +04:00
Zamil Majdy	68849e197c	format	2026-02-12 08:26:26 +04:00
Zamil Majdy	211478bb29	Revert "style: run ruff format and isort" This reverts commit `40b58807ab`.	2026-02-12 08:25:22 +04:00

1 2 3 4 5 ...

7988 Commits