AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-02-08 05:45:07 -05:00

Author	SHA1	Message	Date
Zamil Majdy	8b4eb6f87c	fix(backend): resolve SmartDecisionMaker ChatCompletionMessage error and enhance tool call token counting (#11059 ) ## Summary Fix two critical production issues affecting SmartDecisionMaker functionality and prompt compression accuracy. ### 🔧 Changes Made #### Issue 1: SmartDecisionMaker ChatCompletionMessage Error Problem: PR #11015 introduced code that appended `response.raw_response` (ChatCompletionMessage object) directly to conversation history, causing `'ChatCompletionMessage' object has no attribute 'get'` errors. Root Cause: ChatCompletionMessage objects don't have `.get()` method but conversation history processing expects dictionary objects with `.get()` capability. Solution: Created `_convert_raw_response_to_dict()` helper function for type-safe conversion: - ✅ Helper function: Safely converts raw_response to dictionary format for conversation history - ✅ Type safety: Handles OpenAI (ChatCompletionMessage), Anthropic (Message), and Ollama (string) responses - ✅ Preserves context: Maintains conversation flow for multi-turn tool calling scenarios - ✅ DRY principle: Single helper used in both validation error path (line 624) and success path (line 681) - ✅ No breaking changes: Tool call continuity preserved for complex workflows #### Issue 2: Tool Call Token Counting in Prompt Compression Problem: `_msg_tokens()` function only counted tokens in 'content' field, severely undercounting tool calls which store data in different fields (tool_calls, function.arguments, etc.). Root Cause: Tool calls have no 'content' to calculate length of, causing massive token undercounting during prompt compression that could lead to context overflow. Solution: Enhanced `_msg_tokens()` to handle both OpenAI and Anthropic tool call formats: - ✅ OpenAI format: Count tokens in `tool_calls[].id`, `type`, `function.name`, `function.arguments` - ✅ Anthropic format: Count tokens in `content[].tool_use` (`id`, `name`, `input`) and `content[].tool_result` - ✅ Backward compatibility: Regular string content counted exactly as before - ✅ Comprehensive testing: Added 11 unit tests in `prompt_test.py` ### 📊 Validation Results - ✅ SmartDecisionMaker errors resolved: No more ChatCompletionMessage.get() failures - ✅ Token counting accuracy: OpenAI tool calls 9+ tokens vs previous 3-4 wrapper-only tokens - ✅ Token counting accuracy: Anthropic tool calls 13+ tokens vs previous 3-4 wrapper-only tokens - ✅ Backward compatibility: Regular messages maintain exact same token count - ✅ Type safety: 0 type errors in both modified files - ✅ Test coverage: All 11 new unit tests pass + existing SmartDecisionMaker tests pass - ✅ Multi-turn conversations: Tool call workflows continue working correctly ### 🎯 Impact - Resolves Sentry issue OPEN-2750: ChatCompletionMessage errors eliminated - Prevents context overflow: Accurate token counting during prompt compression for long tool call conversations - Production stability: SmartDecisionMaker retry mechanism works correctly with proper conversation flow - Resource efficiency: Better memory management through accurate token accounting - Zero breaking changes: Full backward compatibility maintained ### 🧪 Test Plan - [x] Verified SmartDecisionMaker no longer crashes with ChatCompletionMessage errors - [x] Validated tool call token counting accuracy with comprehensive unit tests (11 tests all pass) - [x] Confirmed backward compatibility for regular message token counting - [x] Tested both OpenAI and Anthropic tool call formats - [x] Verified type safety with pyright checks - [x] Ensured conversation history flows correctly with helper function - [x] Confirmed multi-turn tool calling scenarios work with preserved context ### 📝 Files Modified - `backend/blocks/smart_decision_maker.py` - Added `_convert_raw_response_to_dict()` helper for safe conversion - `backend/util/prompt.py` - Enhanced tool call token counting for accurate prompt compression - `backend/util/prompt_test.py` - Comprehensive unit tests for token counting (11 tests) ### ⚡ Ready for Review Both fixes are critical for production stability and have been thoroughly tested with zero breaking changes. The helper function approach ensures type safety while preserving essential conversation context for complex tool calling workflows. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-03 00:25:21 +00:00
Reinier van der Leer	4b7d17b9d2	refactor(blocks/code): Clean up & rename code execution blocks (#11019 ) The code execution blocks' implementations are heavily duplicated and their names aren't very clear. E.g. the "InstantiationBlock" just shows up as "Instantiation" in the block list. I would've done this in #11017 but kept the refactoring separate for easier reviewing. ### Changes 🏗️ - Rename "Code Execution" block to "Execute Code" - Rename "Instantiation" block to "Instantiate Code Sandbox" - Rename "Step Execution" block to "Execute Code Step" - Deduplicate implementation of the three code execution blocks - Add `dispose_sandbox` toggle to "Execute Code" and "Execute Code Step" blocks - Note: it's default `True` on the Execute Code block, default `False` on the Execute Code Step block - Update block and input descriptions to clarify behavior - Fix all linting issues <details> <summary>Screenshots</summary> ![the three blocks as they look now](https://github.com/user-attachments/assets/8e4274f7-e006-440c-b2b8-980df546186d) ![updated block names and descriptions in the block list](https://github.com/user-attachments/assets/866c3d9e-13ea-4fc0-87de-a5257bafb6d4) ![the new dispose_sandbox toggle on the Execute Code block](https://github.com/user-attachments/assets/56815dbb-f313-4308-81dd-50d949d9eafb) ![the new dispose_sandbox toggle on the Execute Code Step block](https://github.com/user-attachments/assets/469c140c-4cd2-4210-97b2-f27fc91778de) </details> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Test all code execution blocks manually	2025-10-02 22:50:49 +00:00
dependabot[bot]	0fc6a44389	chore(backend/deps-dev): Bump the development-dependencies group across 1 directory with 4 updates (#10946 ) Bumps the development-dependencies group with 4 updates in the /autogpt_platform/backend directory: [faker](https://github.com/joke2k/faker), [pyright](https://github.com/RobertCraigie/pyright-python), [pytest-mock](https://github.com/pytest-dev/pytest-mock) and [ruff](https://github.com/astral-sh/ruff). Updates `faker` from 37.6.0 to 37.8.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/joke2k/faker/releases">faker's releases</a>.</em></p> <blockquote> <h2>Release v37.8.0</h2> <p>See <a href="https://github.com/joke2k/faker/blob/refs/tags/v37.8.0/CHANGELOG.md">CHANGELOG.md</a>.</p> <h2>Release v37.7.0</h2> <p>See <a href="https://github.com/joke2k/faker/blob/refs/tags/v37.7.0/CHANGELOG.md">CHANGELOG.md</a>.</p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/joke2k/faker/blob/master/CHANGELOG.md">faker's changelog</a>.</em></p> <blockquote> <h3><a href="https://github.com/joke2k/faker/compare/v37.7.0...v37.8.0">v37.8.0 - 2025-09-15</a></h3> <ul> <li>Add Automotive providers for <code>ja_JP</code> locale. Thanks <a href="https://github.com/ItoRino424"><code>@ItoRino424</code></a>.</li> </ul> <h3><a href="https://github.com/joke2k/faker/compare/v37.6.0...v37.7.0">v37.7.0 - 2025-09-15</a></h3> <ul> <li>Add Nigerian name locales (<code>yo_NG</code>, <code>ha_NG</code>, <code>ig_NG</code>, <code>en_NG</code>). Thanks <a href="https://github.com/ifeoluwaoladeji"><code>@ifeoluwaoladeji</code></a>.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`4bde8f57ad`"><code>4bde8f5</code></a> Bump version: 37.7.0 → 37.8.0</li> <li><a href="`f542f364cb`"><code>f542f36</code></a> 📝 Update CHANGELOG.md</li> <li><a href="`e28d7cb909`"><code>e28d7cb</code></a> fix test</li> <li><a href="`e4305b0e29`"><code>e4305b0</code></a> fix padding</li> <li><a href="`a359441a81`"><code>a359441</code></a> 💄 format code</li> <li><a href="`0e3f0bdf81`"><code>0e3f0bd</code></a> Add Automotive providers for <code>ja_JP</code> locale (<a href="https://redirect.github.com/joke2k/faker/issues/2251">#2251</a>)</li> <li><a href="`d4fa69dfc7`"><code>d4fa69d</code></a> Bump version: 37.6.0 → 37.7.0</li> <li><a href="`f636f06a37`"><code>f636f06</code></a> 📝 Update CHANGELOG.md</li> <li><a href="`9a482dd25b`"><code>9a482dd</code></a> 💄 Format code</li> <li><a href="`2493b2d51a`"><code>2493b2d</code></a> fix: fix minor grammar typo (<a href="https://redirect.github.com/joke2k/faker/issues/2259">#2259</a>)</li> <li>Additional commits viewable in <a href="https://github.com/joke2k/faker/compare/v37.6.0...v37.8.0">compare view</a></li> </ul> </details> <br /> Updates `pyright` from 1.1.404 to 1.1.405 <details> <summary>Commits</summary> <ul> <li><a href="`e211ec8df8`"><code>e211ec8</code></a> Pyright NPM Package update to 1.1.405 (<a href="https://redirect.github.com/RobertCraigie/pyright-python/issues/353">#353</a>)</li> <li>See full diff in <a href="https://github.com/RobertCraigie/pyright-python/compare/v1.1.404...v1.1.405">compare view</a></li> </ul> </details> <br /> Updates `pytest-mock` from 3.14.1 to 3.15.1 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pytest-dev/pytest-mock/releases">pytest-mock's releases</a>.</em></p> <blockquote> <h2>v3.15.1</h2> <p><em>2025-09-16</em></p> <ul> <li><a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/529">#529</a>: Fixed <code>itertools._tee object has no attribute error</code> -- now <code>duplicate_iterators=True</code> must be passed to <code>mocker.spy</code> to duplicate iterators.</li> </ul> <h2>v3.15.0</h2> <p><em>2025-09-04</em></p> <ul> <li>Python 3.8 (EOL) is no longer supported.</li> <li><a href="https://redirect.github.com/pytest-dev/pytest-mock/pull/524">#524</a>: Added <code>spy_return_iter</code> to <code>mocker.spy</code>, which contains a duplicate of the return value of the spied method if it is an <code>Iterator</code>.</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pytest-dev/pytest-mock/blob/main/CHANGELOG.rst">pytest-mock's changelog</a>.</em></p> <blockquote> <h2>3.15.1</h2> <p><em>2025-09-16</em></p> <ul> <li><code>[#529](https://github.com/pytest-dev/pytest-mock/issues/529) <https://github.com/pytest-dev/pytest-mock/issues/529></code>_: Fixed <code>itertools._tee object has no attribute error</code> -- now <code>duplicate_iterators=True</code> must be passed to <code>mocker.spy</code> to duplicate iterators.</li> </ul> <h2>3.15.0</h2> <p><em>2025-09-04</em></p> <ul> <li>Python 3.8 (EOL) is no longer supported.</li> <li><code>[#524](https://github.com/pytest-dev/pytest-mock/issues/524) <https://github.com/pytest-dev/pytest-mock/pull/524></code>_: Added <code>spy_return_iter</code> to <code>mocker.spy</code>, which contains a duplicate of the return value of the spied method if it is an <code>Iterator</code>.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`e1b5c62a38`"><code>e1b5c62</code></a> Release 3.15.1</li> <li><a href="`184eb190d6`"><code>184eb19</code></a> Set <code>spy_return_iter</code> only when explicitly requested (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/537">#537</a>)</li> <li><a href="`4fa0088a0a`"><code>4fa0088</code></a> [pre-commit.ci] pre-commit autoupdate (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/536">#536</a>)</li> <li><a href="`f5aff33ce7`"><code>f5aff33</code></a> Fix test failure with pytest 8+ and verbose mode (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/535">#535</a>)</li> <li><a href="`adc41873c9`"><code>adc4187</code></a> Bump actions/setup-python from 5 to 6 in the github-actions group (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/533">#533</a>)</li> <li><a href="`95ad570060`"><code>95ad570</code></a> [pre-commit.ci] pre-commit autoupdate (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/532">#532</a>)</li> <li><a href="`e696bf02c1`"><code>e696bf0</code></a> Fix standalone mock support (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/531">#531</a>)</li> <li><a href="`5b29b03ce9`"><code>5b29b03</code></a> Fix gen-release-notes script</li> <li><a href="`7d22ef4e56`"><code>7d22ef4</code></a> Merge pull request <a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/528">#528</a> from pytest-dev/release-3.15.0</li> <li><a href="`90b29f89e2`"><code>90b29f8</code></a> Update CHANGELOG for 3.15.0</li> <li>Additional commits viewable in <a href="https://github.com/pytest-dev/pytest-mock/compare/v3.14.1...v3.15.1">compare view</a></li> </ul> </details> <br /> Updates `ruff` from 0.12.11 to 0.13.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/astral-sh/ruff/releases">ruff's releases</a>.</em></p> <blockquote> <h2>0.13.0</h2> <h2>Release Notes</h2> <p>Check out the <a href="https://astral.sh/blog/ruff-v0.13.0">blog post</a> for a migration guide and overview of the changes!</p> <h3>Breaking changes</h3> <ul> <li> <p><strong>Several rules can now add <code>from __future__ import annotations</code> automatically</strong></p> <p><code>TC001</code>, <code>TC002</code>, <code>TC003</code>, <code>RUF013</code>, and <code>UP037</code> now add <code>from __future__ import annotations</code> as part of their fixes when the <code>lint.future-annotations</code> setting is enabled. This allows the rules to move more imports into <code>TYPE_CHECKING</code> blocks (<code>TC001</code>, <code>TC002</code>, and <code>TC003</code>), use PEP 604 union syntax on Python versions before 3.10 (<code>RUF013</code>), and unquote more annotations (<code>UP037</code>).</p> </li> <li> <p><strong>Full module paths are now used to verify first-party modules</strong></p> <p>Ruff now checks that the full path to a module exists on disk before categorizing it as a first-party import. This change makes first-party import detection more accurate, helping to avoid false positives on local directories with the same name as a third-party dependency, for example. See the <a href="https://docs.astral.sh/ruff/faq/#how-does-ruff-determine-which-of-my-imports-are-first-party-third-party-etc">FAQ section</a> on import categorization for more details.</p> </li> <li> <p><strong>Deprecated rules must now be selected by exact rule code</strong></p> <p>Ruff will no longer activate deprecated rules selected by their group name or prefix. As noted below, the two remaining deprecated rules were also removed in this release, so this won't affect any current rules, but it will still affect any deprecations in the future.</p> </li> <li> <p><strong>The deprecated macOS configuration directory fallback has been removed</strong></p> <p>Ruff will no longer look for a user-level configuration file at <code>~/Library/Application Support/ruff/ruff.toml</code> on macOS. This feature was deprecated in v0.5 in favor of using the <a href="https://specifications.freedesktop.org/basedir-spec/latest/">XDG specification</a> (usually resolving to <code>~/.config/ruff/ruff.toml</code>), like on Linux. The fallback and accompanying deprecation warning have now been removed.</p> </li> </ul> <h3>Removed Rules</h3> <p>The following rules have been removed:</p> <ul> <li><a href="https://docs.astral.sh/ruff/rules/pandas-df-variable-name"><code>pandas-df-variable-name</code></a> (<code>PD901</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/non-pep604-isinstance"><code>non-pep604-isinstance</code></a> (<code>UP038</code>)</li> </ul> <h3>Stabilization</h3> <p>The following rules have been stabilized and are no longer in preview:</p> <ul> <li><a href="https://docs.astral.sh/ruff/rules/airflow-dag-no-schedule-argument"><code>airflow-dag-no-schedule-argument</code></a> (<code>AIR002</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-removal"><code>airflow3-removal</code></a> (<code>AIR301</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-moved-to-provider"><code>airflow3-moved-to-provider</code></a> (<code>AIR302</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-suggested-update"><code>airflow3-suggested-update</code></a> (<code>AIR311</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-suggested-to-move-to-provider"><code>airflow3-suggested-to-move-to-provider</code></a> (<code>AIR312</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/long-sleep-not-forever"><code>long-sleep-not-forever</code></a> (<code>ASYNC116</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/f-string-number-format"><code>f-string-number-format</code></a> (<code>FURB116</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/os-symlink"><code>os-symlink</code></a> (<code>PTH211</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/generic-not-last-base-class"><code>generic-not-last-base-class</code></a> (<code>PYI059</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/redundant-none-literal"><code>redundant-none-literal</code></a> (<code>PYI061</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/pytest-raises-ambiguous-pattern"><code>pytest-raises-ambiguous-pattern</code></a> (<code>RUF043</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/unused-unpacked-variable"><code>unused-unpacked-variable</code></a> (<code>RUF059</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/useless-class-metaclass-type"><code>useless-class-metaclass-type</code></a> (<code>UP050</code>)</li> </ul> <p>The following behaviors have been stabilized:</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/astral-sh/ruff/blob/main/CHANGELOG.md">ruff's changelog</a>.</em></p> <blockquote> <h2>0.13.0</h2> <p>Check out the <a href="https://astral.sh/blog/ruff-v0.13.0">blog post</a> for a migration guide and overview of the changes!</p> <h3>Breaking changes</h3> <ul> <li> <p><strong>Several rules can now add <code>from __future__ import annotations</code> automatically</strong></p> <p><code>TC001</code>, <code>TC002</code>, <code>TC003</code>, <code>RUF013</code>, and <code>UP037</code> now add <code>from __future__ import annotations</code> as part of their fixes when the <code>lint.future-annotations</code> setting is enabled. This allows the rules to move more imports into <code>TYPE_CHECKING</code> blocks (<code>TC001</code>, <code>TC002</code>, and <code>TC003</code>), use PEP 604 union syntax on Python versions before 3.10 (<code>RUF013</code>), and unquote more annotations (<code>UP037</code>).</p> </li> <li> <p><strong>Full module paths are now used to verify first-party modules</strong></p> <p>Ruff now checks that the full path to a module exists on disk before categorizing it as a first-party import. This change makes first-party import detection more accurate, helping to avoid false positives on local directories with the same name as a third-party dependency, for example. See the <a href="https://docs.astral.sh/ruff/faq/#how-does-ruff-determine-which-of-my-imports-are-first-party-third-party-etc">FAQ section</a> on import categorization for more details.</p> </li> <li> <p><strong>Deprecated rules must now be selected by exact rule code</strong></p> <p>Ruff will no longer activate deprecated rules selected by their group name or prefix. As noted below, the two remaining deprecated rules were also removed in this release, so this won't affect any current rules, but it will still affect any deprecations in the future.</p> </li> <li> <p><strong>The deprecated macOS configuration directory fallback has been removed</strong></p> <p>Ruff will no longer look for a user-level configuration file at <code>~/Library/Application Support/ruff/ruff.toml</code> on macOS. This feature was deprecated in v0.5 in favor of using the <a href="https://specifications.freedesktop.org/basedir-spec/latest/">XDG specification</a> (usually resolving to <code>~/.config/ruff/ruff.toml</code>), like on Linux. The fallback and accompanying deprecation warning have now been removed.</p> </li> </ul> <h3>Removed Rules</h3> <p>The following rules have been removed:</p> <ul> <li><a href="https://docs.astral.sh/ruff/rules/pandas-df-variable-name"><code>pandas-df-variable-name</code></a> (<code>PD901</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/non-pep604-isinstance"><code>non-pep604-isinstance</code></a> (<code>UP038</code>)</li> </ul> <h3>Stabilization</h3> <p>The following rules have been stabilized and are no longer in preview:</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`a1fdd66f10`"><code>a1fdd66</code></a> Bump 0.13.0 (<a href="https://redirect.github.com/astral-sh/ruff/issues/20336">#20336</a>)</li> <li><a href="`8770b95509`"><code>8770b95</code></a> [ty] introduce <code>DivergentType</code> (<a href="https://redirect.github.com/astral-sh/ruff/issues/20312">#20312</a>)</li> <li><a href="`65982a1e14`"><code>65982a1</code></a> [ty] Use 'unknown' specialization for upper bound on Self (<a href="https://redirect.github.com/astral-sh/ruff/issues/20325">#20325</a>)</li> <li><a href="`57d1f7132d`"><code>57d1f71</code></a> [ty] Simplify unions of enum literals and subtypes thereof (<a href="https://redirect.github.com/astral-sh/ruff/issues/20324">#20324</a>)</li> <li><a href="`7a75702237`"><code>7a75702</code></a> Ignore deprecated rules unless selected by exact code (<a href="https://redirect.github.com/astral-sh/ruff/issues/20167">#20167</a>)</li> <li><a href="`9ca632c84f`"><code>9ca632c</code></a> Stabilize adding future import via config option (<a href="https://redirect.github.com/astral-sh/ruff/issues/20277">#20277</a>)</li> <li><a href="`64fe7d30a3`"><code>64fe7d3</code></a> [<code>flake8-errmsg</code>] Stabilize extending <code>raw-string-in-exception</code> (<code>EM101</code>) to ...</li> <li><a href="`beeeb8d5c5`"><code>beeeb8d</code></a> Stabilize the remaining Airflow rules (<a href="https://redirect.github.com/astral-sh/ruff/issues/20250">#20250</a>)</li> <li><a href="`b6fca52855`"><code>b6fca52</code></a> [<code>flake8-bugbear</code>] Stabilize support for non-context-manager calls in `assert...</li> <li><a href="`ac7f882c78`"><code>ac7f882</code></a> [<code>flake8-commas</code>] Stabilize support for trailing comma checks in type paramet...</li> <li>Additional commits viewable in <a href="https://github.com/astral-sh/ruff/compare/0.12.11...0.13.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore <dependency name> major version` will close this group update PR and stop Dependabot creating any more for the specific dependency's major version (unless you unignore this specific dependency's major version or upgrade to it yourself) - `@dependabot ignore <dependency name> minor version` will close this group update PR and stop Dependabot creating any more for the specific dependency's minor version (unless you unignore this specific dependency's minor version or upgrade to it yourself) - `@dependabot ignore <dependency name>` will close this group update PR and stop Dependabot creating any more for the specific dependency (unless you unignore this specific dependency or upgrade to it yourself) - `@dependabot unignore <dependency name>` will remove all of the ignore conditions of the specified dependency - `@dependabot unignore <dependency name> <ignore condition>` will remove the ignore condition of the specified dependency and ignore conditions </details> --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com>	2025-10-02 20:57:18 +00:00
dependabot[bot]	f5ee579ab2	chore(backend/deps): Bump firecrawl-py from 2.16.3 to 4.3.1 in /autogpt_platform/backend (#10809 ) Bumps [firecrawl-py](https://github.com/firecrawl/firecrawl) from 2.16.3 to 4.3.1. <details> <summary>Commits</summary> <ul> <li>See full diff in <a href="https://github.com/firecrawl/firecrawl/commits">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=firecrawl-py&package-manager=pip&previous-version=2.16.3&new-version=4.3.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) You can trigger a rebase of this PR by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Upgrade firecrawl-py to v4.3.6 and refactor firecrawl blocks to new v4 API, formats handling, method names, and response fields. > > - Dependencies > - Bump `firecrawl-py` from `2.16.3` to `4.3.6` (adds `httpx`, updates `pydantic>=2`). > - Firecrawl API migration > - Centralize `ScrapeFormat` in `backend/blocks/firecrawl/_api.py`. > - Add `_format_utils.convert_to_format_options` to map `ScrapeFormat` (incl. `screenshot@fullPage`) to v4 `FormatOption`/`ScreenshotFormat`. > - Switch to v4 types (`firecrawl.v2.types.ScrapeOptions`); adopt snake_case fields (`only_main_content`, `max_age`, `wait_for`). > - Rename methods: `crawl_url` → `crawl`, `scrape_url` → `scrape`, `map_url` → `map`. > - Normalize response attributes: `rawHtml` → `raw_html`, `changeTracking` → `change_tracking`. > - Blocks > - `crawl.py`, `scrape.py`, `search.py`: use new formats conversion and updated options/fields; adjust iteration over results (`search`: iterate `web` when present). > - `map.py`: return both `links` and detailed `results` (url/title/description) and update output schema accordingly. > - Project files > - Update `pyproject.toml` and `poetry.lock` for new dependency versions. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `d872f2e82b`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> > Note > Automatic rebases have been disabled on this pull request as it has been open for over 30 days. --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-10-02 20:14:18 +00:00
Zamil Majdy	57a06f7088	fix(blocks, security): Fixes for various DoS vulnerabilities (#10798 ) This PR addresses multiple critical and medium security vulnerabilities that could lead to Denial of Service (DoS) attacks. All fixes implement defense-in-depth strategies with comprehensive testing. ### Changes 🏗️ #### Critical Security Fixes: 1. GHSA-m2wr-7m3r-p52c - ReDoS in CodeExtractionBlock - Fixed catastrophic backtracking in regex patterns `\s+[\s\S]?` and `\s+(.?)` - Replaced with safer patterns: `[ \t]\n([^\s\S]?)` - Files: `backend/blocks/code_extraction_block.py` 2. GHSA-955p-gpfx-r66j - AITextSummarizerBlock Memory Amplification - Added 1MB text size limit and 100 chunk maximum - Prevents 10K input → 50G memory amplification attacks - Files: `backend/blocks/llm.py` 3. GHSA-5cqw-g779-9f9x - RSS Feed XML Bomb DoS - Added 10MB feed size limit and 30s timeout - Prevents deep XML parsing memory exhaustion - Files: `backend/blocks/rss.py` 4. GHSA-7g34-7fvq-xxq6 - File Storage Disk Exhaustion - Added 100MB per file and 1GB per execution directory limits - Prevents disk space exhaustion from file uploads - Files: `backend/util/file.py` 5. GHSA-pppq-xx2w-7jpq - ExtractTextInformationBlock ReDoS - Added 1MB text limit, 1000 match limit, and 5s timeout protection - Prevents lookahead pattern memory exhaustion - Files: `backend/blocks/text.py` 6. GHSA-vw3v-whvp-33v5 - Docker Logging Disk Exhaustion - Added log rotation limits at Docker (10MB × 3 files) and application levels - Prevents unbounded log growth causing disk exhaustion - Files: `docker-compose.platform.yml`, `autogpt_libs/autogpt_libs/logging/config.py` #### Additional Security Improvements: 7. StepThroughItemsBlock DoS Prevention - Added 10,000 item limit and 1MB input size limit - Prevents large iteration DoS attacks - Files: `backend/blocks/iteration.py` 8. XMLParserBlock XML Bomb Prevention - Added 10MB XML input size limit - Files: `backend/blocks/xml_parser.py` #### Code Quality: - Fixed Python 3.10 typing compatibility issues - Added comprehensive security test suite - All code formatted and linted ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Created comprehensive security test suite covering all vulnerabilities - [x] Verified ReDoS patterns are fixed and don't cause timeouts - [x] Confirmed memory limits prevent amplification attacks - [x] Tested file size limits prevent disk exhaustion - [x] Validated log rotation prevents unbounded growth - [x] Ensured backward compatibility for normal usage #### For configuration changes: - [x] `docker-compose.yml` is updated with logging limits - [x] I have included a list of my configuration changes in the PR description (under Changes) ### Test Plan 🧪 Security Tests: 1. ReDoS Protection: Tested with malicious regex inputs (large spaces) - completes without hanging 2. Memory Limits: Verified 2MB text input gets truncated to 1MB, chunk limits enforced 3. File Size Limits: Confirmed 200MB files rejected, directory size limits enforced 4. Iteration Limits: Tested 20K item arrays rejected, large JSON strings rejected 5. Timeout Protection: Dangerous regex patterns timeout after 5s instead of hanging Compatibility Tests: - Normal functionality preserved for all blocks - Existing tests pass with new security limits - Performance impact minimal for typical usage ### Security Impact 🛡️ Before: Multiple attack vectors could cause: - CPU exhaustion (ReDoS attacks) - Memory exhaustion (amplification attacks) - Disk exhaustion (file/log bombs) - Service unavailability After: All attack vectors mitigated with: - Input validation and size limits - Timeout protections - Resource quotas - Defense-in-depth approach All fixes maintain backward compatibility while preventing DoS attacks. 🤖 Generated with [Claude Code](https://claude.ai/code) <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Adds robust DoS protections across blocks (regex, memory, iteration, XML/RSS, file I/O) and enables app/Docker log rotation with comprehensive tests. > > - Security hardening: > - Replace unsafe regex in `backend/blocks/code_extraction_block.py` to prevent ReDoS; add safer extraction/removal patterns. > - Constrain LLM summarizer chunking in `backend/blocks/llm.py` (1MB cap, chunk/overlap validation, chunk count limit). > - Limit RSS fetching in `backend/blocks/rss.py` (scheme validation, 10MB cap, timeout, bounded read) and return empty on failure. > - Impose XML size limit (10MB) in `backend/blocks/xml_parser.py`. > - Add file upload/download limits in `backend/util/file.py` (100MB/file, 1GB dir quota) and enforce scanning before write. > - Enable rotating file logs in `autogpt_libs/logging/config.py` (size + backups) and Docker json-file log rotation in `docker-compose.platform.yml`. > - Iteration block: > - Add item count/string size limits; fix yielded key for dicts; cap iterations in `backend/blocks/iteration.py`. > - Tests: > - New `backend/blocks/test/test_security_fixes.py` covering ReDoS, timeouts, memory/size and iteration limits, XML/file constraints. > - Misc: > - Typing fallback for `NotRequired` in `activity_status_generator.py`. > - Dependency updates in `backend/poetry.lock`. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `500e1578b1`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com> Co-authored-by: Zamil Majdy <majdyz@users.noreply.github.com> Co-authored-by: Reinier van der Leer <Pwuts@users.noreply.github.com> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-10-02 12:55:55 +00:00
Zamil Majdy	258bf0b1a5	fix(backend): improve activity status generation accuracy and handle missing blocks gracefully (#11039 ) ## Summary Fix critical issues where activity status generator incorrectly reported failed executions as successful, and enhance AI evaluation logic to be more accurate about actual task accomplishment. ## Changes Made ### 1. Missing Block Handling (`backend/data/graph.py`) - Replace ValueError with graceful degradation: When blocks are deleted/missing, return `_UnknownBlock` placeholder instead of crashing - Comprehensive interface implementation: `_UnknownBlock` implements all expected Block methods to prevent type errors - Warning logging: Log missing blocks for debugging without breaking execution flow - Removed unnecessary caching: Direct constructor calls instead of cached wrapper functions ### 2. Enhanced Activity Status AI Evaluation (`backend/executor/activity_status_generator.py`) #### Intention-Based Success Evaluation - Graph description analysis: AI now reads graph description FIRST to understand intended purpose - Purpose-driven evaluation: Success is measured against what the graph was designed to accomplish - Critical output analysis: Enhanced detection of missing outputs from key blocks (Output, Post, Create, Send, Publish, Generate) - Sub-agent failure detection: Better identification when AgentExecutorBlock produces no outputs #### Improved Prompting - Intent-specific examples: 'blog writing' → check for blog content, 'email automation' → check for sent emails - Primary evaluation criteria: 'Did this execution accomplish what the graph was designed to do?' - Enhanced checklist: 7-point analysis including graph description matching - Technical vs. goal completion: Distinguish between workflow steps completing vs. actual user goals achieved #### Removed Database Error Handling - Eliminated try-catch blocks: No longer needed around `get_graph_metadata` and `get_graph` calls - Direct database calls: Simplified error handling after fixing missing block root cause - Cleaner code flow: More predictable execution path without redundant error handling ## Problem Solved - False success reports: AI previously marked executions as 'successful' when critical output blocks produced no results - Missing block crashes: System would fail when trying to analyze executions with deleted/missing blocks - Intent-blind evaluation: AI evaluated technical completion instead of actual goal achievement - Database service errors: 500 errors when missing blocks caused graph loading failures ## Business Impact - More accurate user feedback: Users get honest assessment of whether their automations actually worked - Better task completion detection: Clear distinction between 'workflow completed' vs. 'goal achieved' - Improved reliability: System handles edge cases gracefully without crashing - Enhanced user trust: Truthful reporting builds confidence in the platform ## Testing - ✅ Tested with problematic executions that previously showed false successes - ✅ Confirmed missing block handling works without warnings - ✅ Verified enhanced prompt correctly identifies failures - ✅ Database calls work without try-catch protection ## Example Before/After Before (False Success): ``` Graph: "Automated SEO Blog Writer" Status: "✅ I successfully completed your blog writing task!" Reality: No blog content was actually created (critical output blocks had no outputs) ``` After (Accurate Failure Detection): ``` Graph: "Automated SEO Blog Writer" Status: "❌ The task failed because the blog post creation step didn't produce any output." Reality: Correctly identifies that the intended blog writing goal was not achieved ``` ## Files Modified - `backend/data/graph.py`: Missing block graceful handling with complete interface - `backend/executor/activity_status_generator.py`: Enhanced AI evaluation with intention-based analysis ## Type of Change - [x] Bug fix (non-breaking change which fixes an issue) - [x] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to not work as expected) - [ ] This change requires a documentation update ## Checklist - [x] My code follows the style guidelines of this project - [x] I have performed a self-review of my own code - [x] I have commented my code, particularly in hard-to-understand areas - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [x] I have added tests that prove my fix is effective or that my feature works - [x] New and existing unit tests pass locally with my changes - [x] Any dependent changes have been merged and published in downstream modules --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 12:28:57 +00:00
Ubbe	4a1cb6d64b	fix(frontend): performance and layout issues (#11036 ) ## Changes 🏗️ ### Performance (Onboarding) 🐎 - Moved non-UI logic into `providers/onboarding/helpers.ts` to reduce provider complexity. - Memoized provider value and narrowed state updates to cut unnecessary re-renders. - Deferred non-critical effects until after mount to lower initial JS work. Result: faster initial render and smoother onboarding flows under load. ### Layout and overflow fixes 📐 - Replaced `w-screen` with `w-full` in platform/admin/profile layouts and marketplace wrappers to avoid 100vw scrollbar overflow. - Adjusted mobile navbar position (`right-0` instead of `-right-4`) to prevent off-viewport elements. Result: removed horizontal scrolling on Marketplace, Library, and Settings pages; Build remains unaffected. ### New Generic Error pages - Standardized global error handling in `app/global-error.tsx` for consistent display and user feedback. - Added platform-scoped error page(s) under `app/(platform)/error` for route-level failures with a consistent layout. - Improved retry affordances using existing `ErrorCard`. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verify onboarding flows render faster and re-render less (DevTools flamegraph) - [x] Confirm no horizontal scrolling on Marketplace, Library, Settings at common widths - [x] Validate mobile navbar stays within viewport - [x] Trigger errors to confirm global and platform error pages render consistently ### For configuration changes: None	2025-10-02 10:21:31 +00:00
Copilot	7c9db7419a	fix(frontend): Display run cost correctly - convert cents to dollars run detail components (#10997 ) Fixed costs being displayed as raw cent values instead of properly formatted dollar amounts in the frontend monitoring and agent run detail pages. ## Problem The platform was showing costs incorrectly in two key areas: - Monitoring page: Total cost displayed as raw cents with incorrect "seconds" unit (e.g., "Total cost: 150 seconds") - Agent run details: Individual run costs displayed as raw cents (e.g., "Cost: $150" for what should be $1.50) ## Solution Updated the affected components to properly convert cents to dollars with consistent formatting: FlowRunsStatus.tsx - Fixed total cost calculation and display: ```tsx // Before {filteredFlowRuns.reduce((total, run) => total + (run.stats?.cost ?? 0), 0)} seconds // After ${(filteredFlowRuns.reduce((total, run) => total + (run.stats?.cost ?? 0), 0) / 100).toFixed(2)} ``` RunDetailHeader.tsx - Fixed individual run cost display: ```tsx // Before Cost: ${run.stats.cost} // After Cost: ${(run.stats.cost / 100).toFixed(2)} ``` ## Validation - Backend correctly stores costs in cents (verified in models and database schemas) - Email notification templates already handle the conversion properly using `(credits_used\|float)/100` - Other components use the existing `formatCredits()` utility which correctly converts cents to dollars - No security vulnerabilities introduced (CodeQL verification passed) - All linting and formatting checks pass The fix ensures users now see accurate dollar amounts (e.g., $1.50 instead of $150 or "150 seconds") across the platform's cost reporting interfaces. ![Cost Display Fix Demo](https://github.com/user-attachments/assets/13c75a1d-7c78-4c11-9293-3dcf4c443097) > [!WARNING] > > <details> > <summary>Firewall rules blocked me from connecting to one or more addresses (expand for details)</summary> > > #### I tried to connect to the following addresses, but was blocked by firewall rules: > > - `checkpoint.prisma.io` > - Triggering command: `/usr/bin/node /root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child {"product":"prisma","version":"5.17.0","cli_install_type":"local","information":"","local_timestamp":"2025-09-25T21:41:17Z","project_hash":"a5170f80","cli_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/index.js","cli_path_hash":"40bbdaf9","endpoint":"REDACTED","disable":false,"arch":"x64","os":"linux","node_version":"v20.19.5","ci":false,"ci_name":"","command":"generate","schema_providers":["postgresql"],"schema_preview_features":[],"schema_generators_providers":["prisma-client-py"],"cache_file":"/root/.cache/checkpoint-nodejs/prisma-40bbdaf9","cache_duration":43200000,"remind_duration":172800000,"force":false,"timeout":5000,"unref":true,"child_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child","client_event_id":"","previous_client_event_id":"","check_if_update_available":false}` (dns block) > - Triggering command: `/usr/bin/node /root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child {"product":"prisma","version":"5.17.0","cli_install_type":"local","information":"","local_timestamp":"2025-09-25T21:41:19Z","project_hash":"a5170f80","cli_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/index.js","cli_path_hash":"40bbdaf9","endpoint":"REDACTED","disable":false,"arch":"x64","os":"linux","node_version":"v20.19.5","ci":false,"ci_name":"","command":"migrate deploy","schema_providers":["postgresql"],"schema_preview_features":[],"schema_generators_providers":["prisma-client-py"],"cache_file":"/root/.cache/checkpoint-nodejs/prisma-40bbdaf9","cache_duration":43200000,"remind_duration":172800000,"force":false,"timeout":5000,"unref":true,"child_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child","client_event_id":"","previous_client_event_id":"","check_if_update_available":false}` (dns block) > - Triggering command: `/opt/hostedtoolcache/node/21.7.3/x64/bin/node /home/REDACTED/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child {"product":"prisma","version":"5.17.0","cli_install_type":"local","information":"","local_timestamp":"2025-09-25T21:44:58Z","project_hash":"c6190a20","cli_path":"/home/REDACTED/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/index.js","cli_path_hash":"8d85b642","endpoint":"REDACTED","disable":false,"arch":"x64","os":"linux","node_version":"v21.7.3","ci":true,"ci_name":"GitHub Actions","command":"generate","schema_providers":["postgresql"],"schema_preview_features":[],"schema_generators_providers":["prisma-client-py"],"cache_file":"/home/REDACTED/.cache/checkpoint-nodejs/prisma-8d85b642","cache_duration":43200000,"remind_duration":172800000,"force":false,"timeout":5000,"unref":true,"child_path":"/home/REDACTED/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child","client_event_id":"","previous_client_event_id":"","check_if_update_available":false}` (dns block) > - `fonts.googleapis.com` > - Triggering command: `node /home/REDACTED/work/AutoGPT/AutoGPT/autogpt_platform/frontend/node_modules/.bin/../next/dist/bin/next build` (dns block) > - `https://api.github.com/repos/Significant-Gravitas/Significant-Gravitas%2FAutoGPT/languages` > - Triggering command: `/home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js` (http block) > - `o1.ingest.sentry.io` > - Triggering command: `node /home/REDACTED/work/AutoGPT/AutoGPT/autogpt_platform/frontend/node_modules/.bin/../next/dist/bin/next build` (dns block) > > If you need me to access, download, or install something from one of these locations, you can either: > > - Configure [Actions setup steps](https://gh.io/copilot/actions-setup-steps) to set up my environment, which run before the firewall is enabled > - Add the appropriate URLs or hosts to the custom allowlist in this repository's [Copilot coding agent settings](https://github.com/Significant-Gravitas/AutoGPT/settings/copilot/coding_agent) (admins only) > > </details> <!-- START COPILOT CODING AGENT SUFFIX --> <details> <summary>Original prompt</summary> > > ---- > > This section details on the original issue you should resolve > > <issue_title>Costs are being shown as dollars rather than cents based on the new runs page</issue_title> > <issue_description></issue_description> > > ## Comments on the Issue (you are @copilot in this section) > > <comments> > </comments> > </details> Fixes Significant-Gravitas/AutoGPT#10886 <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ntindle <8845353+ntindle@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-10-02 10:08:03 +00:00
Krzysztof Czerwinski	18bbd8e572	fix(frontend): Fix confetti (#11031 ) ### Changes 🏗️ - Fix not being able to complete `MARKETPLACE_RUN_AGENT` task - Fix confetti shooting on every refresh - Fix confetti shooting from top-left corner ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Bugs eradicated autogpt-platform-beta-v0.6.31	2025-10-02 03:19:25 +00:00
Zamil Majdy	047f011520	fix(platform): resolve authentication performance bottlenecks and improve reliability (#11028 ) ## Summary Fix critical authentication performance bottlenecks causing infinite loading during login and malformed redirect URL handling. ## Root Cause Analysis - OnboardingProvider was running expensive `isOnboardingEnabled()` database queries on every route for all users - Timezone detection was calling backend APIs during authentication flow instead of only during onboarding - Malformed redirect URLs like `/marketplace,%20/marketplace` causing authentication callback failures - Arbitrary setTimeout creating race conditions instead of proper authentication state management ## Changes Made ### 1. Backend: Cache Expensive Onboarding Queries (`backend/data/onboarding.py`) - Add `@cached(maxsize=1, ttl_seconds=300)` decorator to `onboarding_enabled()` - Cache expensive database queries for 5 minutes to prevent repeated execution during auth - Optimize query with `take=MIN_AGENT_COUNT + 1` to stop counting early - Fix typo: "Onboading" → "Onboarding" ### 2. Frontend: Optimize OnboardingProvider (`providers/onboarding/onboarding-provider.tsx`) - Route-based optimization: Only call `isOnboardingEnabled()` when user is actually on `/onboarding/` routes - Preserve functionality: Still fetch `getUserOnboarding()` for step completion tracking on all routes - Smart redirects: Only handle onboarding completion redirects when on onboarding routes - Performance improvement: Eliminates expensive database calls for 95% of page loads ### 3. Frontend: Fix Timezone Detection Race Conditions (`hooks/useOnboardingTimezoneDetection.ts`) - Remove setTimeout hack: Replace arbitrary 1000ms timeout with proper authentication state checks - Add route filtering: Only run timezone detection on `/onboarding/` routes using `pathname.startsWith()` - Proper auth dependencies: Use `useSupabase()` hook to wait for `user` and `!isUserLoading` - Fire-and-forget updates: Change from `mutateAsync()` to `mutate()` to prevent blocking UI ### 4. Frontend: Apply Fire-and-Forget Pattern (`hooks/useTimezoneDetection.ts`) - Change timezone auto-detection from `mutateAsync()` to `mutate()` - Prevents blocking user interactions during background timezone updates - API still executes successfully, user doesn't wait for response ### 5. Frontend: Enhanced URL Validation (`auth/callback/route.ts`) - Add malformed URL detection: Check for commas and spaces in redirect URLs - Constants: Use `DEFAULT_REDIRECT_PATH = "/marketplace"` instead of hardcoded strings - Better error handling: Try-catch with fallback to safe default path - Path depth limits: Reject suspiciously deep URLs (>5 segments) - Race condition mitigation: Default to `/marketplace` for corrupted URLs with warning logs ## Technical Implementation ### Performance Optimizations - Database caching: 5-minute cache prevents repeated expensive onboarding queries - Route-aware logic: Heavy operations only run where needed (`/onboarding/` routes) - Non-blocking updates: Timezone updates don't block authentication flow - Proper state management: Wait for actual authentication instead of arbitrary delays ### Authentication Flow Improvements - Eliminate race conditions: No more setTimeout guessing - wait for proper auth state - Faster auth: Remove blocking timezone API calls during login flow - Better UX: Handle malformed URLs gracefully instead of failing ## Files Changed - `backend/data/onboarding.py` - Add caching to expensive queries - `providers/onboarding/onboarding-provider.tsx` - Route-based optimization - `hooks/useOnboardingTimezoneDetection.ts` - Proper auth state + route filtering + fire-and-forget - `hooks/useTimezoneDetection.ts` - Fire-and-forget pattern - `auth/callback/route.ts` - Enhanced URL validation ## Impact - Eliminates infinite loading* during authentication flow - Improves auth response times from 5-11+ seconds to sub-second - Prevents malformed redirect URLs that confused users - Reduces database load through intelligent caching - Maintains all existing functionality with better performance - Eliminates race conditions from arbitrary timeouts ## Validation - ✅ All pre-commit hooks pass (format, lint, typecheck) - ✅ No breaking changes to existing functionality - ✅ Backward compatible with all onboarding flows - ✅ Enhanced error logging and graceful fallbacks 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 01:26:49 +00:00
Reinier van der Leer	d11917eb10	feat(blocks): Improve data output of code execution block (#11017 ) - Resolves #11016 ### Changes 🏗️ - Add more extensive outputs to Code Execution Block - Rename "Response" output to "Main Text Output" ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Object outputs can be accessed now	2025-10-01 10:38:04 +00:00
Copilot	4663066e65	feat(blocks): Implement AI Condition Block for natural language condition evaluation (#10996 ) This PR implements the AI Condition Block as requested in issue AUTOMAT-60. The new block enables users to define conditional logic using natural language descriptions instead of traditional comparison operators, while maintaining the same yes/no data pass-through functionality as the existing ConditionBlock. ## Overview The AI Condition Block uses Large Language Models to evaluate conditions written in plain English, such as: - "the input is the body of an email" - "the input is a City in the USA" - "the input is an error or a refusal" ## Key Features Natural Language Processing: Users can express complex conditions in everyday English rather than programming logic, making agent workflows more intuitive and accessible. Consistent Interface: Maintains the same input/output schema as the standard ConditionBlock: - Boolean `result` output indicating condition evaluation - `yes_output` and `no_output` for conditional data flow - Optional custom values for yes/no cases Robust Error Handling: Defaults to `false` on AI evaluation failures to ensure safe operation and prevent workflow interruption. Performance Optimized: Uses minimal token limits (10 tokens) for true/false responses to reduce latency and API costs. ## Implementation Details The block is implemented as `AIConditionBlock` in `backend/blocks/ai_condition.py` and inherits from `AIBlockBase` following established platform patterns. It includes: - Proper LLM integration with credential management - Token usage tracking and statistics - Comprehensive test mocking for reliable CI/CD - Full documentation with examples and use cases ## Use Cases This block enables more sophisticated conditional logic for: - Content Classification: Automatically categorize text, emails, or documents - Data Validation: Validate inputs using natural language rules - Smart Routing: Route data based on AI-evaluated conditions - Error Detection: Identify and handle error messages or problematic inputs - Quality Control: Check content against flexible quality standards ## Testing The implementation includes comprehensive testing that integrates with the existing platform test suite. All tests pass, including: - Unit tests with proper LLM response mocking - Code quality checks (linting, formatting, type checking) - Security analysis via CodeQL - Integration testing to ensure proper block discovery and loading The block is automatically discovered by the platform's block loading system and is immediately available for use in agent workflows. ## PR Checklist - [x] Have you listed your changes in the description? - Added new `AIConditionBlock` in `backend/blocks/ai_condition.py` - Added comprehensive documentation in `docs/content/platform/blocks/ai_condition.md` - Implemented natural language condition evaluation using LLMs - [x] Have you included a test plan? - Unit tests with mocked LLM responses - Integration tests for block discovery and loading - Error handling validation - Token usage tracking verification - [x] Have you tested your changes according to the test plan? - All existing tests pass - Linting and formatting checks pass - Type checking passes - Security analysis via CodeQL passes - Fixed `json_format` parameter to `force_json_output` per recent API changes > [!WARNING] > > <details> > <summary>Firewall rules blocked me from connecting to one or more addresses (expand for details)</summary> > > #### I tried to connect to the following addresses, but was blocked by firewall rules: > > - `api.openai.com` > - Triggering command: `/home/REDACTED/.cache/pypoetry/virtualenvs/autogpt-platform-backend-Ajv4iu2i-py3.11/bin/python /home/REDACTED/.cache/pypoetry/virtualenvs/autogpt-platform-backend-Ajv4iu2i-py3.11/bin/pytest backend/blocks/test/test_block.py::test_available_blocks -k AIConditionBlock -v` (dns block) > - `https://api.github.com/repos/Significant-Gravitas/Significant-Gravitas%2FAutoGPT/languages` > - Triggering command: `/home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js` (http block) > > If you need me to access, download, or install something from one of these locations, you can either: > > - Configure [Actions setup steps](https://gh.io/copilot/actions-setup-steps) to set up my environment, which run before the firewall is enabled > - Add the appropriate URLs or hosts to the custom allowlist in this repository's [Copilot coding agent settings](https://github.com/Significant-Gravitas/AutoGPT/settings/copilot/coding_agent) (admins only) > > </details> <!-- START COPILOT CODING AGENT SUFFIX --> <details> <summary>Original prompt</summary> > Issue Title: AI Condition Block > Issue Description: A version of the condition/if block that uses an AI powered condition. > > It should have the same yes/no data pass throughs, as well as outputting a result Boolean. > > The condition is plaintext English, provided by the user, and could be anything. > > e.g > If `[the input] is the body of an email` > If `[the input] is a City in the USA` > If `[the input] is an error or a refusal` > Fixes https://linear.app/autogpt/issue/AUTOMAT-60/ai-condition-block > > > Comment by User 4bcbb358-1758-43e4-abef-a0a42b63442f: > 📋 I need a repo label on this issue to determine which GitHub repository to work in. > > Please add a repo label to this issue with the format `owner/repository-name` (e.g., `github/copilot`), then I'll automatically start working on it! > > Comment by User : > This thread is for an agent session with githubcopilotcodingagent. > > </details> <!-- START COPILOT CODING AGENT TIPS --> --- ✨ Let Copilot coding agent [set things up for you](https://github.com/Significant-Gravitas/AutoGPT/issues/new?title=✨+Set+up+Copilot+instructions&body=Configure%20instructions%20for%20this%20repository%20as%20documented%20in%20%5BBest%20practices%20for%20Copilot%20coding%20agent%20in%20your%20repository%5D%28https://gh.io/copilot-coding-agent-tips%29%2E%0A%0A%3COnboard%20this%20repo%3E&assignees=copilot) — coding agent works faster and does higher quality work when set up for your repo. <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Introduces `AIConditionBlock` that uses an LLM to evaluate natural-language conditions and outputs boolean result with yes/no pass-through, plus accompanying documentation. > > - Backend: > - New block: `backend/blocks/ai_condition.py` > - Evaluates natural-language conditions via `llm_call` using selectable `LlmModel` and credentials. > - Parses strict true/false responses (with fallback token matching), yields `result`, `yes_output`/`no_output`, and `error` on ambiguity/failure. > - Tracks token usage via `NodeExecutionStats`; includes test inputs/mocks and `force_json_output=False`. > - Docs: > - Adds `docs/content/platform/blocks/ai_condition.md` with usage, inputs/outputs, examples, and considerations. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `06e9586bd3`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ntindle <8845353+ntindle@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicktindle@outlook.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com>	2025-10-01 05:02:57 +00:00
Krzysztof Czerwinski	48a0faa611	feat(frontend): Restore onboarding steps (#11027 ) Wallet update removed `BUILDER_OPEN` and `BUILDER_RUN_AGENT`. ### Changes 🏗️ - Restore completion codepaths for `BUILDER_OPEN` and `BUILDER_RUN_AGENT` for analytical purposes ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Tasks are completed silently	2025-10-01 04:53:51 +00:00
Nicholas Tindle	70d00b4104	fix(ci): Delete pr_reviewer section in .pr_agent.toml (#11024 ) Remove pr_reviewer section from configuration <!-- Clearly explain the need for these changes: --> ### Changes 🏗️ removes the out of config status section <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] validated by global config	2025-10-01 03:01:24 +00:00
Nicholas Tindle	aad0434cb2	feat(frontend): Enhance Sentry integration and TallyPopup telemetry (#10862 ) Added Sentry captureConsoleIntegration and extraErrorDataIntegration to client, edge, and server configs. Improved replay integration with unmasking support. Updated TallyPopup to collect and expose Sentry replay data, user agent, and page URL for enhanced telemetry and debugging. Improved event handling and error logging for Tally events. Marked CustomNode title for Sentry unmasking.<!-- Clearly explain the need for these changes: --> ### Changes 🏗️ Reconfigure sentry Pass the id with sentry replay to tally alongside prefilling email, and passing non user identifying attributes like platform url, full url, and is authenticated. <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Test the results show up in sentry - [x] Test the url works in tally	2025-10-01 03:00:20 +00:00
Krzysztof Czerwinski	f33ec1f2ec	feat(platform): New retention-focused tasks and wallet update (#10977 ) ### Changes 🏗️ - Rename wallet and update design - Update tasks and add Hidden Tasks section - Update onboarding backend code and related db migration - Add progress bar for some tasks ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] All tasks can be finished - [x] Finished tasks add correct amount of credits --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-10-01 01:29:30 +00:00
dependabot[bot]	e68b873bcf	chore(frontend/deps): Bump @faker-js/faker from 9.9.0 to 10.0.0 in /autogpt_platform/frontend (#10806 ) Bumps [@faker-js/faker](https://github.com/faker-js/faker) from 9.9.0 to 10.0.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/faker-js/faker/releases"><code>@faker-js/faker</code>'s releases</a>.</em></p> <blockquote> <h2>v10.0.0</h2> <h2>New & Noteworthy</h2> <ul> <li>esm only (for cjs support look into migration guide, we got you covered 😉)</li> <li>remove v9 deprecations</li> <li>change default error strategy to 'fail' in word module</li> <li>remove invalid credit card issuer patterns</li> <li>see our <a href="https://v10.fakerjs.dev/guide/upgrading.html">migration guide</a></li> </ul> <h2>What's Changed</h2> <ul> <li>ci: use node 24 by <a href="https://github.com/Shinigami92"><code>@Shinigami92</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3543">faker-js/faker#3543</a></li> <li>infra: stop using node 18 by <a href="https://github.com/Shinigami92"><code>@Shinigami92</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3536">faker-js/faker#3536</a></li> <li>infra: use import.meta.dirname by <a href="https://github.com/Shinigami92"><code>@Shinigami92</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3542">faker-js/faker#3542</a></li> <li>chore(deps): update devdependencies (major) by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3512">faker-js/faker#3512</a></li> <li>chore(deps): update eslint by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3555">faker-js/faker#3555</a></li> <li>chore(deps): update dependency <code>@vitest/eslint-plugin</code> to v1.3.4 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3554">faker-js/faker#3554</a></li> <li>chore(deps): update devdependencies by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3556">faker-js/faker#3556</a></li> <li>chore(deps): lock file maintenance by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3557">faker-js/faker#3557</a></li> <li>feat!: esm only by <a href="https://github.com/Shinigami92"><code>@Shinigami92</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3540">faker-js/faker#3540</a></li> <li>refactor!: remove deprecations by <a href="https://github.com/Shinigami92"><code>@Shinigami92</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3553">faker-js/faker#3553</a></li> <li>docs: migration guide for v10 by <a href="https://github.com/matthewmayer"><code>@matthewmayer</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3559">faker-js/faker#3559</a></li> <li>infra: more precise engines field by <a href="https://github.com/matthewmayer"><code>@matthewmayer</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3561">faker-js/faker#3561</a></li> <li>refactor(word)!: change default error strategy to 'fail' by <a href="https://github.com/xDivisionByZerox"><code>@xDivisionByZerox</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3560">faker-js/faker#3560</a></li> <li>chore(release): 10.0.0-beta.0 by <a href="https://github.com/fakerjs-bot"><code>@fakerjs-bot</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3565">faker-js/faker#3565</a></li> <li>docs: Minor improvements to migration guide by <a href="https://github.com/matthewmayer"><code>@matthewmayer</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3569">faker-js/faker#3569</a></li> <li>chore(deps): update pnpm to v10.13.1 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3570">faker-js/faker#3570</a></li> <li>chore(deps): update devdependencies by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3571">faker-js/faker#3571</a></li> <li>chore(deps): update eslint by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3572">faker-js/faker#3572</a></li> <li>chore(deps): lock file maintenance by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3562">faker-js/faker#3562</a></li> <li>chore(deps): update dependency typescript to v5.9.2 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3576">faker-js/faker#3576</a></li> <li>chore(deps): update pnpm to v10.14.0 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3579">faker-js/faker#3579</a></li> <li>chore(deps): update mcr.microsoft.com/devcontainers/typescript-node:22 docker digest to 2baa40a by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3575">faker-js/faker#3575</a></li> <li>chore(deps): update devdependencies by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3577">faker-js/faker#3577</a></li> <li>chore(deps): update eslint (major) by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3580">faker-js/faker#3580</a></li> <li>chore(deps): update eslint by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3578">faker-js/faker#3578</a></li> <li>feat(locale): extended list of colors in Polish by <a href="https://github.com/pkuczynski"><code>@pkuczynski</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3586">faker-js/faker#3586</a></li> <li>refactor(locale): remove invalid credit card issuer patterns by <a href="https://github.com/xDivisionByZerox"><code>@xDivisionByZerox</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3568">faker-js/faker#3568</a></li> <li>docs: update migration guide with findings from playground update by <a href="https://github.com/xDivisionByZerox"><code>@xDivisionByZerox</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3587">faker-js/faker#3587</a></li> <li>chore: fix typo in test by <a href="https://github.com/noritaka1166"><code>@noritaka1166</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3591">faker-js/faker#3591</a></li> <li>chore(deps): update all non-major dependencies by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3596">faker-js/faker#3596</a></li> <li>chore(deps): update amannn/action-semantic-pull-request action to v6 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3598">faker-js/faker#3598</a></li> <li>chore(deps): update devdependencies by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3599">faker-js/faker#3599</a></li> <li>chore(deps): update actions/checkout action to v5 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3597">faker-js/faker#3597</a></li> <li>chore(deps): update dependency cypress to v15 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3603">faker-js/faker#3603</a></li> <li>chore(deps): update dependency vitepress to v1.6.4 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3601">faker-js/faker#3601</a></li> <li>chore(deps): pin dependency node to 24.6.0 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3600">faker-js/faker#3600</a></li> <li>chore(deps): update dependency typescript-eslint to v8.40.0 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3602">faker-js/faker#3602</a></li> <li>chore(deps): update dependency eslint-plugin-jsdoc to v54 by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3604">faker-js/faker#3604</a></li> <li>chore(deps): lock file maintenance by <a href="https://github.com/renovate"><code>@renovate</code></a>[bot] in <a href="https://redirect.github.com/faker-js/faker/pull/3584">faker-js/faker#3584</a></li> <li>chore(release): 10.0.0 by <a href="https://github.com/fakerjs-bot"><code>@fakerjs-bot</code></a> in <a href="https://redirect.github.com/faker-js/faker/pull/3605">faker-js/faker#3605</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/faker-js/faker/blob/next/CHANGELOG.md"><code>@faker-js/faker</code>'s changelog</a>.</em></p> <blockquote> <h2><a href="https://github.com/faker-js/faker/compare/v10.0.0-beta.0...v10.0.0">10.0.0</a> (2025-08-21)</h2> <h3>New Locales</h3> <ul> <li><strong>locale:</strong> extended list of colors in Polish (<a href="https://redirect.github.com/faker-js/faker/issues/3586">#3586</a>) (<a href="`9940d54f75`">9940d54</a>)</li> </ul> <h3>Features</h3> <ul> <li><strong>locales:</strong> add animal vocabulary(bear, bird, cat, rabbit, pet_name) in Korean (<a href="https://redirect.github.com/faker-js/faker/issues/3535">#3535</a>) (<a href="`0d2143c75d`">0d2143c</a>)</li> </ul> <h3>Changed Locales</h3> <ul> <li><strong>locale:</strong> remove invalid credit card issuer patterns (<a href="https://redirect.github.com/faker-js/faker/issues/3568">#3568</a>) (<a href="`9783d95a8e`">9783d95</a>)</li> </ul> <h2><a href="https://github.com/faker-js/faker/compare/v9.9.0...v10.0.0-beta.0">10.0.0-beta.0</a> (2025-07-09)</h2> <h3>⚠ BREAKING CHANGES</h3> <ul> <li> <p><strong>word:</strong> change default error strategy to 'fail' (<a href="https://redirect.github.com/faker-js/faker/issues/3560">#3560</a>)</p> </li> <li> <p>remove deprecations (<a href="https://redirect.github.com/faker-js/faker/issues/3553">#3553</a>)</p> </li> <li> <p>esm only (<a href="https://redirect.github.com/faker-js/faker/issues/3540">#3540</a>)</p> </li> <li> <p>remove deprecations (<a href="https://redirect.github.com/faker-js/faker/issues/3553">#3553</a>) (<a href="`623d2741a4`">623d274</a>)</p> </li> <li> <p><strong>word:</strong> change default error strategy to 'fail' (<a href="https://redirect.github.com/faker-js/faker/issues/3560">#3560</a>) (<a href="`93416f71cf`">93416f7</a>)</p> </li> </ul> <h3>Features</h3> <ul> <li>esm only (<a href="https://redirect.github.com/faker-js/faker/issues/3540">#3540</a>) (<a href="`160960b797`">160960b</a>)</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`51943aecb9`"><code>51943ae</code></a> chore(release): 10.0.0 (<a href="https://redirect.github.com/faker-js/faker/issues/3605">#3605</a>)</li> <li><a href="`96d7517b9b`"><code>96d7517</code></a> chore(deps): lock file maintenance (<a href="https://redirect.github.com/faker-js/faker/issues/3584">#3584</a>)</li> <li><a href="`2eb6fa0a7a`"><code>2eb6fa0</code></a> chore(deps): update dependency eslint-plugin-jsdoc to v54 (<a href="https://redirect.github.com/faker-js/faker/issues/3604">#3604</a>)</li> <li><a href="`1fcfe4830d`"><code>1fcfe48</code></a> chore(deps): pin dependency node to 24.6.0 (<a href="https://redirect.github.com/faker-js/faker/issues/3600">#3600</a>)</li> <li><a href="`2bd4807fa2`"><code>2bd4807</code></a> chore(deps): update dependency typescript-eslint to v8.40.0 (<a href="https://redirect.github.com/faker-js/faker/issues/3602">#3602</a>)</li> <li><a href="`09a88eb100`"><code>09a88eb</code></a> chore(deps): update dependency vitepress to v1.6.4 (<a href="https://redirect.github.com/faker-js/faker/issues/3601">#3601</a>)</li> <li><a href="`5418574bf7`"><code>5418574</code></a> chore(deps): update dependency cypress to v15 (<a href="https://redirect.github.com/faker-js/faker/issues/3603">#3603</a>)</li> <li><a href="`9e4f463ecf`"><code>9e4f463</code></a> chore(deps): update actions/checkout action to v5 (<a href="https://redirect.github.com/faker-js/faker/issues/3597">#3597</a>)</li> <li><a href="`287ecdaa39`"><code>287ecda</code></a> chore(deps): update devdependencies (<a href="https://redirect.github.com/faker-js/faker/issues/3599">#3599</a>)</li> <li><a href="`2b1495956f`"><code>2b14959</code></a> chore(deps): update amannn/action-semantic-pull-request action to v6 (<a href="https://redirect.github.com/faker-js/faker/issues/3598">#3598</a>)</li> <li>Additional commits viewable in <a href="https://github.com/faker-js/faker/compare/v9.9.0...v10.0.0">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=@faker-js/faker&package-manager=npm_and_yarn&previous-version=9.9.0&new-version=10.0.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Upgrades `@faker-js/faker` to v10 and updates test utilities to dynamically import Faker and make password generation async. > > - Frontend dependencies: > - Bump `@faker-js/faker` from `9.9.0` to `10.0.0`. > - Tests: > - Replace static imports with dynamic `import("@faker-js/faker")` in `src/tests/utils/{auth.ts,signup.ts}`. > - Change `generateTestPassword` to `async` returning `Promise<string>` to use ESM Faker. > - Adjust test user creation to use dynamically generated `email`/`password` via Faker. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `334f4a264d`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com>	2025-09-30 21:12:18 +00:00
Nicholas Tindle	4530e97e59	feat(platform/blocks): Add table input UI and builder block (#10829 ) <!-- Clearly explain the need for these changes: --> https://github.com/user-attachments/assets/909a6ecf-5731-424c-8dee-fe25db907365 ### Need 💡 This PR introduces a new "Table Input" block and corresponding UI component, allowing users to easily input structured, tabular data directly within the agent builder. This addresses the need for a user-friendly way to define custom column headers and populate rows of data, which is then output as a list of dictionaries. ### Changes 🏗️ <!-- Concisely describe all of the changes made in this pull request: --> * New `TableInputBlock` (backend): A new block (`backend/backend/blocks/table_input.py`) has been added. It defines an `Input` schema with `headers` (a list of strings for column names) and `value` (a list of dictionaries representing table rows). The block outputs the `value` data in the specified dictionary format. * New `NodeTableInput` Component (frontend): A new React component (`frontend/src/components/node-table-input.tsx`) was created to render an editable table UI, supporting dynamic row addition/removal and cell editing. * Frontend Integration: * `NodeGenericInputField` and `NodeObjectInputTree` were updated to pass `parentContext` down the component hierarchy. * `NodeArrayInput` was modified to conditionally render the new `NodeTableInput` component. It now detects when an array field (`selfKey` is "value") is part of a parent context that defines `headers`, indicating it should be rendered as a table. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Add a "Table Input" block to the builder. - [x] Define custom headers (e.g., "Name", "Email"). - [x] Add several rows of data using the table UI. - [x] Verify that adding, editing, and removing rows works as expected. - [x] Connect the output of the "Table Input" block to another block (e.g., a "Print" block) and confirm the output format is a list of dictionaries with the defined headers as keys. - [x] Test with an empty table (no rows). - [x] Test with no headers defined (should default). - [x] Test that an empty row returns empty data (is this a good behavior? example output of the block ``` { "advanced": false, "column_headers": [ "Col 1", "Col 2", "Col 3" ], "name": "table_input", "value": [ { "Col 1": "row 1", "Col 2": "row 1", "Col 3": "row 1" }, { "Col 1": "val 1", "Col 2": "val 2", "Col 3": "val 3" } ] } ``` --- <a href="https://cursor.com/background-agent?bcId=bc-b8d31867-1034-4374-852c-b92ca69cc399"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://cursor.com/open-in-cursor-dark.svg"> <source media="(prefers-color-scheme: light)" srcset="https://cursor.com/open-in-cursor-light.svg"> <img alt="Open in Cursor" src="https://cursor.com/open-in-cursor.svg"> </picture> </a> <a href="https://cursor.com/agents?id=bc-b8d31867-1034-4374-852c-b92ca69cc399"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://cursor.com/open-in-web-dark.svg"> <source media="(prefers-color-scheme: light)" srcset="https://cursor.com/open-in-web-light.svg"> <img alt="Open in Web" src="https://cursor.com/open-in-web.svg"> </picture> </a> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com>	2025-09-30 19:41:03 +00:00
Bently	477c261488	feat(blocks): Add claude-sonnet-4.5 (#11023 ) ## Summary Adds claude-sonnet-4.5 model to the platform and sets the price to 9 ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] test the new claude-sonnet-4.5 model on the platform to make sure it works	2025-09-30 19:30:58 +00:00
dependabot[bot]	8ac2228e1e	chore(frontend/deps): Upgrade @sentry/nextjs from 9.42.0 to 10.8.0 (#10802 ) Bumps [@sentry/nextjs](https://github.com/getsentry/sentry-javascript) from 9.42.0 to 10.8.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/getsentry/sentry-javascript/releases"><code>@sentry/nextjs</code>'s releases</a>.</em></p> <blockquote> <h2>10.8.0</h2> <h3>Important Changes</h3> <ul> <li> <p><strong>feat(sveltekit): Add Compatibility for builtin SvelteKit Tracing (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17423">#17423</a>)</strong></p> <p>This release makes the <code>@sentry/sveltekit</code> SDK compatible with SvelteKit's native <a href="https://svelte.dev/docs/kit/observability">observability support</a> introduced in SvelteKit version <code>2.31.0</code>. If you enable both, instrumentation and tracing, the SDK will now initialize early enough to set up additional instrumentation like database queries and it will pick up spans emitted from SvelteKit.</p> <p>We will follow up with docs how to set up the SDK soon. For now, If you're on SvelteKit version <code>2.31.0</code> or newer, you can easily opt into the new feature:</p> <ol> <li> <p>Enable <a href="https://svelte.dev/docs/kit/observability">experimental tracing and instrumentation support</a> in <code>svelte.config.js</code>:</p> </li> <li> <p>Move your <code>Sentry.init()</code> call from <code>src/hooks.server.(js\|ts)</code> to the new <code>instrumentation.server.(js\|ts)</code> file:</p> <pre lang="ts"><code>// instrumentation.server.ts import * as Sentry from '@sentry/sveltekit'; <p>Sentry.init({<br /> dsn: '...',<br /> // rest of your config<br /> });<br /> </code></pre></p> <p>The rest of your Sentry config in <code>hooks.server.ts</code> (<code>sentryHandle</code> and <code>handleErrorWithSentry</code>) should stay the same.</p> </li> </ol> <p>If you prefer to stay on the hooks-file based config for now, the SDK will continue to work as previously.</p> <p>Thanks to the Svelte team and <a href="https://github.com/elliott-with-the-longest-name-on-github"><code>@elliott-with-the-longest-name-on-github</code></a> for implementing observability support and for reviewing our PR!</p> </li> </ul> <h3>Other Changes</h3> <ul> <li>fix(react): Avoid multiple name updates on navigation spans (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17438">#17438</a>)</li> </ul> <!-- raw HTML omitted --> <ul> <li>test(profiling): Add tests for current state of profiling (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17470">#17470</a>)</li> </ul> <!-- raw HTML omitted --> <h2>Bundle size 📦</h2> <table> <thead> <tr> <th>Path</th> <th>Size</th> </tr> </thead> <tbody> <tr> <td><code>@sentry/browser</code></td> <td>23.59 KB</td> </tr> <tr> <td><code>@sentry/browser</code> - with treeshaking flags</td> <td>22.2 KB</td> </tr> <tr> <td><code>@sentry/browser</code> (incl. Tracing)</td> <td>38.94 KB</td> </tr> <tr> <td><code>@sentry/browser</code> (incl. Tracing, Replay)</td> <td>76.4 KB</td> </tr> <tr> <td><code>@sentry/browser</code> (incl. Tracing, Replay) - with treeshaking flags</td> <td>66.43 KB</td> </tr> </tbody> </table> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/getsentry/sentry-javascript/blob/develop/CHANGELOG.md"><code>@sentry/nextjs</code>'s changelog</a>.</em></p> <blockquote> <h2>10.8.0</h2> <h3>Important Changes</h3> <ul> <li> <p><strong>feat(sveltekit): Add Compatibility for builtin SvelteKit Tracing (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17423">#17423</a>)</strong></p> <p>This release makes the <code>@sentry/sveltekit</code> SDK compatible with SvelteKit's native <a href="https://svelte.dev/docs/kit/observability">observability support</a> introduced in SvelteKit version <code>2.31.0</code>. If you enable both, instrumentation and tracing, the SDK will now initialize early enough to set up additional instrumentation like database queries and it will pick up spans emitted from SvelteKit.</p> <p>We will follow up with docs how to set up the SDK soon. For now, If you're on SvelteKit version <code>2.31.0</code> or newer, you can easily opt into the new feature:</p> <ol> <li> <p>Enable <a href="https://svelte.dev/docs/kit/observability">experimental tracing and instrumentation support</a> in <code>svelte.config.js</code>:</p> </li> <li> <p>Move your <code>Sentry.init()</code> call from <code>src/hooks.server.(js\|ts)</code> to the new <code>instrumentation.server.(js\|ts)</code> file:</p> <pre lang="ts"><code>// instrumentation.server.ts import * as Sentry from '@sentry/sveltekit'; <p>Sentry.init({<br /> dsn: '...',<br /> // rest of your config<br /> });<br /> </code></pre></p> <p>The rest of your Sentry config in <code>hooks.server.ts</code> (<code>sentryHandle</code> and <code>handleErrorWithSentry</code>) should stay the same.</p> </li> </ol> <p>If you prefer to stay on the hooks-file based config for now, the SDK will continue to work as previously.</p> <p>Thanks to the Svelte team and <a href="https://github.com/elliott-with-the-longest-name-on-github"><code>@elliott-with-the-longest-name-on-github</code></a> for implementing observability support and for reviewing our PR!</p> </li> </ul> <h3>Other Changes</h3> <ul> <li>fix(react): Avoid multiple name updates on navigation spans (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17438">#17438</a>)</li> </ul> <!-- raw HTML omitted --> <ul> <li>test(profiling): Add tests for current state of profiling (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17470">#17470</a>)</li> </ul> <!-- raw HTML omitted --> <h2>10.7.0</h2> <h3>Important Changes</h3> <ul> <li><strong>feat(cloudflare): Add <code>instrumentPrototypeMethods</code> option to instrument RPC methods for DurableObjects (<a href="https://redirect.github.com/getsentry/sentry-javascript/pull/17424">#17424</a>)</strong></li> </ul> <p>By default, <code>Sentry.instrumentDurableObjectWithSentry</code> will not wrap any RPC methods on the prototype. To enable wrapping for RPC methods, set <code>instrumentPrototypeMethods</code> to <code>true</code> or, if performance is a concern, a list of only the methods you want to instrument:</p> <pre lang="js"><code></tr></table> </code></pre> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`bd8458e659`"><code>bd8458e</code></a> release: 10.8.0</li> <li><a href="`dbdddc896f`"><code>dbdddc8</code></a> Merge pull request <a href="https://redirect.github.com/getsentry/sentry-javascript/issues/17481">#17481</a> from getsentry/prepare-release/10.8.0</li> <li><a href="`f5d4bd616e`"><code>f5d4bd6</code></a> meta(changelog): Update changelog for 10.8.0</li> <li><a href="`dfdc3b0ab9`"><code>dfdc3b0</code></a> test(profiling): Add tests for current state of profiling (<a href="https://redirect.github.com/getsentry/sentry-javascript/issues/17470">#17470</a>)</li> <li><a href="`895b38590c`"><code>895b385</code></a> fix(react): Avoid multiple name updates on navigation spans (<a href="https://redirect.github.com/getsentry/sentry-javascript/issues/17438">#17438</a>)</li> <li><a href="`e6e20d847c`"><code>e6e20d8</code></a> feat(sveltekit): Add Compatibility for builtin SvelteKit Tracing (<a href="https://redirect.github.com/getsentry/sentry-javascript/issues/17423">#17423</a>)</li> <li><a href="`7e24422327`"><code>7e24422</code></a> Merge pull request <a href="https://redirect.github.com/getsentry/sentry-javascript/issues/17472">#17472</a> from getsentry/master</li> <li><a href="`27e97b0cec`"><code>27e97b0</code></a> Merge branch 'release/10.7.0'</li> <li><a href="`b7e4816824`"><code>b7e4816</code></a> release: 10.7.0</li> <li><a href="`0bc8417d50`"><code>0bc8417</code></a> Merge pull request <a href="https://redirect.github.com/getsentry/sentry-javascript/issues/17471">#17471</a> from getsentry/prepare-release/10.7.0</li> <li>Additional commits viewable in <a href="https://github.com/getsentry/sentry-javascript/compare/9.42.0...10.8.0">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=@sentry/nextjs&package-manager=npm_and_yarn&previous-version=9.42.0&new-version=10.8.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Upgrades `@sentry/nextjs` to 10.15.0, updating numerous related `@sentry/`, OpenTelemetry (v2), and build/dev dependencies via the lockfile. > > - Dependencies (frontend): > - Upgrade `@sentry/nextjs` from `9.42.0` to `10.15.0`. > - Cascading updates in `pnpm-lock.yaml`: > - `@sentry/` packages (`browser`, `core`, `node`, `opentelemetry`, `react`, `vercel-edge`, `webpack-plugin`, `bundler-plugin-core`, `cli`, etc.). > - OpenTelemetry stack to newer major versions (`@opentelemetry/core`/`resources`/`sdk-trace-base` 2.x; multiple `instrumentation-` packages). > - Build tooling: `rollup` 4.52.x and platform binaries; `@rollup/plugin-`. > - Misc dev typings and utilities (e.g., `@types/mysql`, `@types/pg`, `debug`, `@prisma/instrumentation`). > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `5b4b37e551`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-09-30 16:42:05 +00:00
Zamil Majdy	91dd9364bb	fix(backend): implement retry mechanism for SmartDecisionMaker tool call validation (#11015 ) <!-- Clearly explain the need for these changes: --> This PR fixes a critical production issue where SmartDecisionMakerBlock was silently accepting tool calls with typo'd parameter names (e.g., 'maximum_keyword_difficulty' instead of 'max_keyword_difficulty'), causing downstream blocks to receive null values and execution failures. The solution implements comprehensive parameter validation with automatic retry when the LLM provides malformed tool calls, giving the LLM specific feedback to correct the errors. ### Changes 🏗️ <!-- Concisely describe all of the changes made in this pull request: --> Core Validation & Retry Logic (`backend/blocks/smart_decision_maker.py`) - Add tool call parameter validation against function schema - Implement retry mechanism using existing `create_retry_decorator` from `backend.util.retry` - Validate provided parameters against expected schema properties and required fields - Generate specific error messages for unknown parameters (typos) and missing required parameters - Add error feedback to conversation history for LLM learning on retry attempts - Use `input_data.retry` field to configure number of retry attempts Comprehensive Test Coverage (`backend/blocks/test/test_smart_decision_maker.py`) - Add `test_smart_decision_maker_parameter_validation` with 4 comprehensive test scenarios: 1. Tool call with typo'd parameter (should retry and eventually fail with clear error) 2. Tool call missing required parameter (should fail immediately with clear error) 3. Valid tool call with optional parameter missing (should succeed) 4. Valid tool call with all parameters provided (should succeed) - Verify retry mechanism works correctly and respects retry count - Mock LLM responses for controlled testing of validation logic Load Tests Documentation Update (`load-tests/README.md`) - Update documentation to reflect current orchestrator-based architecture - Remove references to deprecated `run-tests.js` and `comprehensive-orchestrator.js` - Streamline documentation to focus on working `orchestrator/orchestrator.js` - Update NPM scripts and command examples for current workflow - Clean up outdated file references to match actual infrastructure Production Impact - Prevents silent failures: Tool call parameter typos now cause retries instead of null downstream values - Maintains compatibility: No breaking changes to existing SmartDecisionMaker functionality - Improves reliability: LLM receives feedback to correct parameter errors - Configurable retries: Uses existing `retry` field for user control - Accurate documentation: Load-tests docs now match actual working infrastructure ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Run existing SmartDecisionMaker tests to ensure no regressions: `poetry run pytest backend/blocks/test/test_smart_decision_maker.py -xvs` ✅ All 4 tests passed - [x] Run new parameter validation test specifically: `poetry run pytest backend/blocks/test/test_smart_decision_maker.py::test_smart_decision_maker_parameter_validation -xvs` ✅ Passed with retry behavior confirmed - [x] Verify retry mechanism works by checking log output for retry attempts ✅ Confirmed in test logs - [x] Test tool call validation with different scenarios (typos, missing params, valid calls) ✅ All scenarios covered and working - [x] Run code formatting and linting: `poetry run format` ✅ All formatters passed - [x] Verify no breaking changes to existing SmartDecisionMaker functionality ✅ All existing tests pass - [x] Verify load-tests documentation accuracy ✅ README now matches actual orchestrator infrastructure #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) Note: No configuration changes were needed as this uses existing retry infrastructure and block schema validation. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-30 16:18:05 +00:00
Zamil Majdy	f314fbf14f	fix(backend): resolve two critical long-running agent execution failures (#11011 ) ## Summary Fix two production issues causing agent execution failures that occurred this morning: 1. AsyncRedisLock Release Error (ExecutionID: 08b2c251-ee27-45de-b88d-1792823ca3ee) - Error: "Cannot release a lock that's no longer owned" - Root cause: Race condition where lock expires during long database operations - Location: backend/executor/manager.py synchronized context manager 2. Tool Call Parameter Validation (ExecutionID: 766fd9a0-5f22-4a77-96e8-14c9d02f3292) - Issue: LLM used typo'd parameter 'maximum_keyword_difficulty' instead of 'max_keyword_difficulty' - SmartDecisionMakerBlock silently accepted typo, setting correct parameter to null - Result: Downstream blocks received null values causing execution failures ## Changes Made ### AsyncRedisLock Error Handling - Add try-catch blocks around AsyncRedisLock.release() calls in ExecutionManager and OAuth refresh - Prevent crashes when locks expire between ownership check and release - Log warnings instead of crashing execution ### Tool Call Parameter Validation - Reject unknown parameters: Raise ValueError for typo'd parameter names with detailed error messages - Allow optional parameters: Only validate missing REQUIRED parameters - Safe parameter access: Use .get() to handle optional parameters with defaults - Clean code: Extract parameters object once to eliminate duplication ## Technical Implementation Lock Release Protection: ```python if await lock.locked() and await lock.owned(): try: await lock.release() except Exception as e: logger.warning(f"Failed to release lock for key {key}: {e}") ``` Parameter Validation Logic: ```python # Get parameters schema from tool definition if tool_def and "function" in tool_def and "parameters" in tool_def["function"]: parameters = tool_def["function"]["parameters"] expected_args = parameters.get("properties", {}) required_params = set(parameters.get("required", [])) # Detect parameter typos and missing required params unexpected_args = provided_args - expected_args_set missing_required_args = required_params - provided_args if unexpected_args or missing_required_args: raise ValueError(error_msg) # Detailed error explaining the problem ``` ## Testing - [x] All existing tests pass - [x] Lock error handling prevents execution crashes - [x] Tool validation catches typos while allowing optional parameters - [x] Maintains backward compatibility with existing workflows ## Impact - ✅ No more "Cannot release a lock" crashes during long database operations - ✅ Tool calls with typo'd parameters are rejected with clear error messages - ✅ Optional parameters work correctly with default values - ✅ Production stability improved with graceful error handling ## Files Modified - `backend/executor/manager.py` - AsyncRedisLock error handling in synchronized context - `backend/integrations/creds_manager.py` - OAuth refresh lock error handling - `backend/blocks/smart_decision_maker.py` - Tool call parameter validation with typo detection Fixes two critical production failures that were causing 2/5 agent runs to fail this morning. --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-29 15:34:20 +00:00
Zamil Majdy	a97ff641c3	feat(backend): optimize FastAPI endpoints performance and alert system (#11000 ) ## Summary Comprehensive performance optimization fixing event loop binding issues and addressing all PR feedback. ### Original Performance Issues Fixed Event Loop Binding Problems: - JWT authentication dependencies were synchronous, causing thread pool bottlenecks under high concurrency - FastAPI's default thread pool (40 threads) was insufficient for high-load scenarios - Backend services lacked proper event loop configuration Security & Performance Improvements: - Security middleware converted from BaseHTTPMiddleware to pure ASGI for better performance - Added blocks endpoint to cacheable paths for improved response times - Cross-platform uvloop detection with Windows compatibility ### Key Changes Made #### 1. JWT Authentication Async Conversion - Files: `autogpt_libs/auth/dependencies.py`, `autogpt_libs/auth/jwt_utils.py` - Change: Convert all JWT functions to async (`requires_user`, `requires_admin_user`, `get_user_id`, `get_jwt_payload`) - Impact: Eliminates thread pool blocking, improves concurrency handling - Tests: All 25+ authentication tests updated to async patterns #### 2. FastAPI Thread Pool Optimization - File: `backend/server/rest_api.py:82-93` - Change: Configure thread pool size via `config.fastapi_thread_pool_size` - Default: Increased from 40 to higher limit for sync operations - Impact: Better handling of remaining sync dependencies #### 3. Performance-Optimized Security Middleware - File: `backend/server/middleware/security.py` - Change: Pure ASGI implementation replacing BaseHTTPMiddleware - Headers: HTTP spec compliant capitalization (X-Content-Type-Options, X-Frame-Options, etc.) - Caching: Added `/api/blocks` and `/api/v1/blocks` to cacheable paths - Impact: Reduced middleware overhead, improved header compliance #### 4. Cross-Platform Event Loop Configuration - File: `backend/server/rest_api.py:311-312` - Change: Platform-aware uvloop detection: `'uvloop' if platform.system() != 'Windows' else 'auto'` - Impact: Windows compatibility while maintaining Unix performance benefits - Verified: 'auto' is valid uvicorn default parameter #### 5. Enhanced Caching Infrastructure - File: `autogpt_libs/utils/cache.py:118-132` - Change: Per-event-loop asyncio.Lock instances prevent cross-loop deadlocks - Impact: Thread-safe caching across multiple event loops #### 6. Database Query Limits & Performance - Files: Multiple data layer files - Change: Added configurable limits to prevent unbounded queries - Constants: `MAX_GRAPH_VERSIONS_FETCH=50`, `MAX_USER_API_KEYS_FETCH=500`, etc. - Impact: Consistent performance regardless of data volume #### 7. OpenAPI Documentation Improvements - File: `backend/server/routers/v1.py:68-85` - Change: Added proper response model and schema for blocks endpoint - Impact: Better API documentation and type safety #### 8. Error Handling & Retry Logic Fixes - File: `backend/util/retry.py:63` - Change: Accurate retry threshold comments referencing EXCESSIVE_RETRY_THRESHOLD - Impact: Clear documentation for debugging retry scenarios ### ntindle Feedback Addressed ✅ HTTP Header Capitalization: All headers now use proper HTTP spec capitalization ✅ Windows uvloop Compatibility: Clean platform detection with inline conditional ✅ OpenAPI Response Model: Blocks endpoint properly documented in schema ✅ Retry Comment Accuracy: References actual threshold constants instead of hardcoded numbers ✅ Code Cleanliness: Inline conditionals preferred over verbose if statements ### Performance Testing Results Before Optimization: - High latency under concurrent load - Thread pool exhaustion at ~40 concurrent requests - Event loop binding issues causing timeouts After Optimization: - Improved concurrency handling with async JWT pipeline - Configurable thread pool scaling - Cross-platform event loop optimization - Reduced middleware overhead ### Backward Compatibility ✅ All existing functionality preserved ✅ No breaking API changes ✅ Enhanced test coverage with async patterns ✅ Windows and Unix compatibility maintained ### Files Modified Core Authentication & Performance: - `autogpt_libs/auth/dependencies.py` - Async JWT dependencies - `autogpt_libs/auth/jwt_utils.py` - Async JWT utilities - `backend/server/rest_api.py` - Thread pool config + uvloop detection - `backend/server/middleware/security.py` - ASGI security middleware Database & Limits: - `backend/data/includes.py` - Performance constants and configurable includes - `backend/data/api_key.py`, `backend/data/credit.py`, `backend/data/graph.py`, `backend/data/integrations.py` - Query limits Caching & Infrastructure: - `autogpt_libs/utils/cache.py` - Per-event-loop lock safety - `backend/server/routers/v1.py` - OpenAPI improvements - `backend/util/retry.py` - Comment accuracy Testing: - `autogpt_libs/auth/dependencies_test.py` - 25+ async test conversions - `autogpt_libs/auth/jwt_utils_test.py` - Async JWT test patterns Ready for review and production deployment. 🚀 --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-29 05:32:48 +00:00
Zamil Majdy	114f604d7b	Merge branch 'master' of github.com:Significant-Gravitas/AutoGPT into dev	2025-09-27 18:43:26 +07:00
Zamil Majdy	3abea1ed96	fix(backend): prevent duplicate graph executions across multiple executor pods (#11008 ) ## Problem Multiple executor pods could simultaneously execute the same graph, leading to: - Duplicate executions and wasted resources - Inconsistent execution states and results - Race conditions in graph execution management - Inefficient resource utilization in cluster environments ## Solution Implement distributed locking using ClusterLock to ensure only one executor pod can process a specific graph execution at a time. ## Key Changes ### Core Fix: Distributed Execution Coordination - ClusterLock implementation: Redis-based distributed locking prevents duplicate executions - Atomic lock acquisition: Only one executor can hold the lock for a specific graph execution - Automatic lock expiry: Prevents deadlocks if executor pods crash or become unresponsive - Graceful degradation: System continues operating even if Redis becomes temporarily unavailable ### Technical Implementation - Move ClusterLock to `backend/executor/` alongside ExecutionManager (its primary consumer) - Comprehensive integration tests (27 test scenarios) ensure reliability under all conditions - Redis client compatibility for different deployment configurations - Rate-limited lock refresh to minimize Redis load ### Reliability Improvements - Context manager support: Automatic lock cleanup prevents resource leaks - Ownership verification: Locks can only be refreshed/released by the owner - Concurrency testing: Thread-safe operations verified under high contention - Error handling: Robust failure scenarios including network partitions ## Test Coverage - ✅ Concurrent executor coordination (prevents duplicate executions) - ✅ Lock expiry and refresh mechanisms (prevents deadlocks) - ✅ Redis connection failures (graceful degradation) - ✅ Thread safety under high load (production scenarios) - ✅ Long-running executions with periodic refresh ## Impact - No more duplicate executions: Eliminates wasted compute resources and inconsistent results - Improved reliability: Robust distributed coordination across executor pods - Better resource utilization: Only one pod processes each execution - Scalable architecture: Supports multiple executor pods without conflicts ## Validation - All integration tests pass ✅ - Existing ExecutionManager functionality preserved ✅ - No breaking changes to APIs ✅ - Production-ready distributed locking ✅ 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-27 11:42:40 +00:00
Abhimanyu Yadav	da6e1ad26d	refactor(frontend): enhance builder UI for better performance (#10922 ) ### Changes 🏗️ This PR introduces a new high-performance builder interface for the AutoGPT platform, implementing a React Flow-based visual editor with optimized state management and rendering. #### Key Changes: 1. New Flow Editor Implementation - Built on React Flow for efficient graph rendering and interaction - Implements a node-based visual workflow builder with custom nodes and edges - Dynamic form generation using React JSON Schema Form (RJSF) for block inputs - Intelligent connection handling with visual feedback 2. State Management Optimization - Added Zustand for lightweight, performant state management - Separated node and edge stores for better data isolation - Reduced unnecessary re-renders through granular state updates 3. Dual Builder View (Temporary) - Added toggle between old and new builder implementations - Allows A/B testing and gradual migration - Feature flagged for controlled rollout 4. Enhanced UI Components - Custom form widgets for various input types (date, time, file, etc.) - Array and object editors with improved UX - Connection handles with visual state indicators - Advanced mode toggle for complex configurations 5. Architecture Improvements - Modular component structure for better code organization - Comprehensive documentation for the new system - Type-safe implementation with TypeScript #### Dependencies Added: - `zustand` (v5.0.2) - State management - `@rjsf/core` (v5.22.8) - JSON Schema Form core - `@rjsf/utils` (v5.22.8) - RJSF utilities - `@rjsf/validator-ajv8` (v5.22.8) - Schema validation ### Performance Improvements 🚀 - Reduced Re-renders: Zustand's shallow comparison and selective subscriptions minimize unnecessary component updates - Optimized Graph Rendering: React Flow provides efficient canvas-based rendering for large workflows - Lazy Loading: Components are loaded on-demand reducing initial bundle size - Memoized Computations: Heavy calculations are cached to avoid redundant processing ### Test Plan 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: #### Test Checklist: - [x] Create a new agent from scratch with at least 5 blocks - [x] Connect blocks and verify connections render correctly - [x] Switch between old and new builder views - [x] Test all form input types (text, number, boolean, array, object) - [x] Verify data persistence when switching views - [x] Test advanced mode toggle functionality - [x] Performance test with 50+ blocks to verify smooth interaction ### Migration Strategy The implementation includes a temporary toggle to switch between the old and new builder. This allows for: - Gradual user migration - A/B testing to measure performance improvements - Fallback option if issues are discovered - Collecting user feedback before full rollout ### Documentation Comprehensive documentation has been added: - `/components/FlowEditor/docs/README.md` - Architecture overview and store management - `/components/FlowEditor/docs/FORM_CREATOR.md` - Detailed form system documentation --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co> Co-authored-by: Claude <noreply@anthropic.com>	2025-09-26 10:42:05 +00:00
Swifty	634fffb967	fix(blocks): Handle NoneType in DataForSEO Blocks and Add missing Err (#11004 ) This PR fixes critical issues in the DataForSEO blocks to improve error handling and prevent runtime exceptions. ### Changes 🏗️ 1. Fixed NoneType error in DataForSEO Related Keywords Block (#10990) - Added null check to ensure `items` is always a list before iteration - Prevents TypeError when API returns None for items field - Ensures robust handling of unexpected API responses 2. Added error output pins to DataForSEO blocks (#10981) - Added `error` field to Output schema in both `related_keywords.py` and `keyword_suggestions.py` - Wrapped entire `run` methods in try-except blocks - Errors are now properly yielded to the error output pin, allowing agents to handle failures gracefully ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified that DataForSEO blocks handle None responses without throwing TypeError - [x] Confirmed error output pins capture and yield exceptions properly - [x] Ensured backwards compatibility with existing block implementations - [x] Tested both Related Keywords and Keyword Suggestions blocks #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) --- Fixes #10990 Fixes #10981 Generated with [Claude Code](https://claude.ai/code) <!-- Clearly explain the need for these changes: --> ### Changes 🏗️ <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [ ] I have clearly listed my changes in the PR description - [ ] I have made a test plan - [ ] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [ ] ... <details> <summary>Example test plan</summary> - [ ] Create from scratch and execute an agent with at least 3 blocks - [ ] Import an agent from file upload, and confirm it executes correctly - [ ] Upload agent to marketplace - [ ] Import an agent from marketplace and confirm it executes correctly - [ ] Edit an agent from monitor, and confirm it executes correctly </details> #### For configuration changes: - [ ] `.env.default` is updated or already compatible with my changes - [ ] `docker-compose.yml` is updated or already compatible with my changes - [ ] I have included a list of my configuration changes in the PR description (under Changes) <details> <summary>Examples of configuration changes</summary> - Changing ports - Adding new services that need to communicate with each other - Secrets or environment variable changes - New or infrastructure changes such as databases </details> Co-authored-by: Toran Bruce Richards <toran.richards@gmail.com>	2025-09-26 11:23:14 +02:00
Toran Bruce Richards	f3ec426c82	fix(blocks): Handle NoneType in DataForSEO Blocks and Add missing Error pins (#10995 ) This PR fixes critical issues in the DataForSEO blocks to improve error handling and prevent runtime exceptions. ### Changes 🏗️ 1. Fixed NoneType error in DataForSEO Related Keywords Block (#10990) - Added null check to ensure `items` is always a list before iteration - Prevents TypeError when API returns None for items field - Ensures robust handling of unexpected API responses 2. Added error output pins to DataForSEO blocks (#10981) - Added `error` field to Output schema in both `related_keywords.py` and `keyword_suggestions.py` - Wrapped entire `run` methods in try-except blocks - Errors are now properly yielded to the error output pin, allowing agents to handle failures gracefully ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified that DataForSEO blocks handle None responses without throwing TypeError - [x] Confirmed error output pins capture and yield exceptions properly - [x] Ensured backwards compatibility with existing block implementations - [x] Tested both Related Keywords and Keyword Suggestions blocks #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) --- Fixes #10990 Fixes #10981 Generated with [Claude Code](https://claude.ai/code)	2025-09-25 20:22:10 +00:00
Reinier van der Leer	0b267f573e	feat(blocks): Improve JSON generation+parsing in AI Structured Response block (#10960 ) The AI Structured Response Generator block currently doesn't support responses that aren't pure JSON. This prohibits multi-step prompting because reasoning content is not allowed in the response, which in turn limits performance. ### Changes 🏗️ - Adjust prompt to enclose JSON in pre-defined tags so we can extract it from a response that isn't pure JSON - Adjust mechanism to extract and parse JSON - Add `force_json_output` input (advanced, default `False`) - Update incorrect `max_output_tokens` values for Claude 4 and 3.7 to prevent responses from being cut off due to `max_tokens` ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] LLMs correctly follows response generation instructions - [x] LLMs follow system response format instructions even if user prompt contains conflicting instructions - [x] JSON is extracted from response successfully - [x] `force_json_output` works (at least for models that support it) Tested with Claude 4 Sonnet, various GPT models, and Llama 3.3 70B. --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-09-25 17:42:25 +00:00
Reinier van der Leer	7bd571d9ce	fix(blocks): Default disable HTML escaping in all blocks with templating features (#10955 ) - Resolves #10954 Unnecessary escaping distorts content and so should be disabled wherever the output isn't used in HTML. ### Changes 🏗️ - Disable HTML escaping on prompt value insertion in AI blocks - Make HTML escaping optional in text formatting and output blocks ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] `SandboxedEnvironment(autoescape=False).from_string(template_str).render(values)` doesn't escape characters with HTML entities	2025-09-25 12:04:38 +00:00
Zamil Majdy	7a331651ba	feat(backend): enhance database indexes for AgentGraph and AgentGraphExecution performance (#10985 ) ## Summary Enhances database performance by improving indexes on `AgentGraph` and `AgentGraphExecution` tables for better query efficiency. ### Changes 🏗️ - Database Schema: Updated Prisma schema to enhance database indexes - Modified `AgentGraph` index from `[userId, isActive]` to `[userId, isActive, id, version]` for better compound query performance - Enhanced `AgentGraphExecution` index from `[userId]` to `[userId, isDeleted, createdAt]` to support filtered queries with sorting - Migration: Auto-generated Prisma migration to implement the index changes - Drops existing indexes: `AgentGraph_userId_isActive_idx` and `AgentGraphExecution_userId_idx` - Creates new compound indexes: `AgentGraph_userId_isActive_id_version_idx` and `AgentGraphExecution_userId_isDeleted_createdAt_idx` ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verified migration runs successfully - [x] Confirmed database queries continue to work with new indexes - [x] Tested that existing functionality remains unaffected #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [x] I have included a list of my configuration changes in the PR description (under Changes) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Claude <noreply@anthropic.com>	2025-09-25 09:05:53 +00:00
Zamil Majdy	5bc69adc33	Merge branch 'master' of github.com:Significant-Gravitas/AutoGPT into dev	2025-09-25 16:09:01 +07:00
Krzysztof Czerwinski	f4bcc8494f	hotfix: Fix Agent node missing inputs and outputs (#10987 ) Restore `include=AGENT_GRAPH_INCLUDE` that is needed to build schema from the nodes. ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] I/O is back on the Agent node	2025-09-25 08:56:32 +00:00
Zamil Majdy	4c000086e6	feat(backend): implement clean k6 load testing infrastructure (#10978 ) ## Summary Implement comprehensive k6 load testing infrastructure for the AutoGPT Platform with clean file organization, unified test runner, and cloud integration. ## Key Features ### 🗂️ Clean File Organization - tests/basic/: Simple validation tests (connectivity, single endpoints) - tests/api/: Core functionality tests (API endpoints, graph execution) - tests/marketplace/: User-facing feature tests (public/library access) - tests/comprehensive/: End-to-end scenario tests (complete user journeys) - orchestrator/: Advanced test orchestration for full suites ### 🚀 Unified Test Runner - Single entry point: `run-tests.js` for both local and cloud execution - 7 available tests: From basic connectivity to comprehensive platform journeys - Flexible execution: Run individual tests, comma-separated lists, or all tests - Auto-configuration: Different VU/duration settings for local vs cloud execution ### 🔐 Advanced Authentication - Pre-authenticated tokens: 24-hour JWT tokens eliminate Supabase rate limiting - Configurable generation: Default 10 tokens, scalable to 150+ for high concurrency - Graceful handling: Proper auth failure detection and recovery - ES module compatible: Modern JavaScript with full import/export support ### ☁️ k6 Cloud Integration - Cloud execution: Tests run on k6 cloud infrastructure for consistent results - Real-time monitoring: Live dashboards with performance metrics - URL tracking: Automatic test result URL capture and storage - Sequential orchestration: Proper delays between tests for resource management ## Test Coverage ### Performance Validated - Core API: 100 VUs successfully testing `/api/credits`, `/api/graphs`, `/api/blocks`, `/api/executions` - Graph Execution: 80 VUs for complete workflow pipeline testing - Marketplace: 150 VUs for public browsing, 100 VUs for authenticated library operations - Authentication: 150+ concurrent users with pre-authenticated token scaling ### User Journey Simulation - Dashboard workflows: Credits checking, graph management, execution monitoring - Marketplace browsing: Public search, agent discovery, category filtering - Library operations: Agent adding, favoriting, forking, detailed views - Complete workflows: End-to-end platform usage with realistic user behavior ## Technical Implementation ### ES Module Compatibility - Full ES module support with modern JavaScript imports/exports - Proper module execution patterns for Node.js compatibility - Clean separation between CommonJS legacy and modern ES modules ### Error Handling & Monitoring - Separate metrics: HTTP status, authentication, JSON validation, overall success - Graceful degradation: Auth failures don't crash VUs, proper error tracking - Performance thresholds: Configurable P95/P99 latency and error rate limits - Custom counters: Track operation types, success rates, user journey completion ### Infrastructure Benefits - Rate limit protection: Pre-auth tokens prevent Supabase auth bottlenecks - Scalable testing: Support for 150+ concurrent users with proper token management - Cloud consistency: Tests run on dedicated k6 cloud servers for reliable results - Development workflow: Local execution for debugging, cloud for performance validation ## Usage ### Quick Start ```bash # Setup and verification export SUPABASE_SERVICE_KEY="your-service-key" node generate-tokens.js node run-tests.js verify # Local testing (development) node run-tests.js run core-api-test DEV # Cloud testing (performance) node run-tests.js cloud all DEV ``` ### NPM Scripts ```bash npm run verify # Quick setup check npm test # All tests locally npm run cloud # All tests in k6 cloud ``` ## Validation Results ✅ Authentication: 100% success rate with fresh 24-hour tokens ✅ File Structure: All imports and references verified correct ✅ Test Execution: All 7 tests execute successfully with proper metrics ✅ Cloud Integration: k6 cloud execution working with proper credentials ✅ Documentation: Complete README with usage examples and troubleshooting ## Files Changed ### Core Infrastructure - `run-tests.js`: Unified test runner supporting local/cloud execution - `generate-tokens.js`: ES module compatible token generation with 24-hour expiry - `README.md`: Comprehensive documentation with updated file references ### Organized Test Structure - `tests/basic/connectivity-test.js`: Basic connectivity validation - `tests/basic/single-endpoint-test.js`: Individual API endpoint testing - `tests/api/core-api-test.js`: Core authenticated API endpoints - `tests/api/graph-execution-test.js`: Graph workflow pipeline testing - `tests/marketplace/public-access-test.js`: Public marketplace browsing - `tests/marketplace/library-access-test.js`: Authenticated marketplace/library operations - `tests/comprehensive/platform-journey-test.js`: Complete user journey simulation ### Configuration - `configs/environment.js`: Environment URLs and performance settings - `package.json`: NPM scripts and dependencies for unified workflow This infrastructure provides a solid foundation for continuous performance monitoring and load testing of the AutoGPT Platform. 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-09-25 12:51:54 +07:00
Krzysztof Czerwinski	9c6cc5b29d	Merge branch 'dev' autogpt-platform-beta-v0.6.30	2025-09-25 13:28:17 +09:00
Toran Bruce Richards	b34973ca47	feat: Add 'depth' parameter to DataForSEO Related Keywords block (#10983 ) Fixes #10982 <!-- Clearly explain the need for these changes: --> The DataForSEO Related Keywords block was missing the `depth` parameter, which is a critical parameter that controls the comprehensiveness of keyword research. The depth parameter determines the number of related keywords returned by the API, ranging from 1 keyword at depth 0 to approximately 4680 keywords at depth 4. ### Changes 🏗️ <!-- Concisely describe all of the changes made in this pull request: --> - Added `depth` parameter to the DataForSEO Related Keywords block as an integer input field (range 0-4) - Added `depth` parameter to the `related_keywords` method signature in the API client - Updated the API client to include the depth parameter in the request payload when provided - Added documentation explaining the depth parameter's effect on the number of returned keywords - Fixed missing parameter in function signature that was causing runtime errors ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Verified the depth parameter appears correctly in the block UI with appropriate range validation (0-4) - [x] Confirmed the parameter is passed correctly to the API client - [x] Tested that omitting the depth parameter doesn't break existing functionality (defaults to None) - [x] Verified the implementation follows the existing pattern for optional parameters in the DataForSEO blocks #### For configuration changes: - [x] `.env.default` is updated or already compatible with my changes - [x] `docker-compose.yml` is updated or already compatible with my changes - [ ] I have included a list of my configuration changes in the PR description (under Changes) Note: No configuration changes were required for this feature addition. --------- Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Toran Bruce Richards <Torantulino@users.noreply.github.com>	2025-09-24 21:29:47 +00:00
Nicholas Tindle	2bc6a56877	fix(backend): Fix GCS timeout error in FileInput blocks (#10976 ) ## Summary - Fixed "Timeout context manager should be used inside a task" error occurring intermittently in FileInput blocks when downloading files from Google Cloud Storage - Implemented proper async session management for GCS client to ensure operations run within correct task context - Added comprehensive logging to help diagnose and monitor the issue in production ## Changes ### Core Fix - Modified `CloudStorageHandler._retrieve_file_gcs()` to create a fresh GCS client and session for each download operation - This ensures the aiohttp session is always created within the proper async task context, preventing the timeout error - The fix trades a small amount of efficiency for reliability, but only affects download operations ### Logging Enhancements - Added detailed logging in `store_media_file()` to track execution context and async task state - Enhanced `scan_content_safe()` to specifically catch and log timeout errors with CRITICAL level - Added context logging in virus scanner around `asyncio.create_task()` calls - Upgraded key debug logs to info level for visibility in production ### Code Quality - Fixed unbound variable issue where `async_client` could be referenced before initialization - Replaced bare `except:` clauses with proper exception handling - Fixed unused parameters warning in `__aexit__` method ## Testing - The timeout error was occurring intermittently in production when FileInput blocks processed GCS files - With these changes, the error should be eliminated as the session is always created in the correct context - Comprehensive logging allows monitoring of the fix effectiveness in production ## Context The root cause was that `gcloud-aio-storage` was creating its internal aiohttp session/timeout context outside of an async task context when called by the executor. This happened intermittently depending on how the executor scheduled block execution. ## Related Issues - Addresses timeout errors reported in FileInput block execution - Improves reliability of file uploads from the platform ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Test a multiple file input agent and it works - [x] Test the agent that is causing the failure and it works 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-09-24 16:21:41 -05:00
Nicholas Tindle	87c773d03a	fix(backend): Fix GCS timeout error in FileInput blocks (#10976 ) ## Summary - Fixed "Timeout context manager should be used inside a task" error occurring intermittently in FileInput blocks when downloading files from Google Cloud Storage - Implemented proper async session management for GCS client to ensure operations run within correct task context - Added comprehensive logging to help diagnose and monitor the issue in production ## Changes ### Core Fix - Modified `CloudStorageHandler._retrieve_file_gcs()` to create a fresh GCS client and session for each download operation - This ensures the aiohttp session is always created within the proper async task context, preventing the timeout error - The fix trades a small amount of efficiency for reliability, but only affects download operations ### Logging Enhancements - Added detailed logging in `store_media_file()` to track execution context and async task state - Enhanced `scan_content_safe()` to specifically catch and log timeout errors with CRITICAL level - Added context logging in virus scanner around `asyncio.create_task()` calls - Upgraded key debug logs to info level for visibility in production ### Code Quality - Fixed unbound variable issue where `async_client` could be referenced before initialization - Replaced bare `except:` clauses with proper exception handling - Fixed unused parameters warning in `__aexit__` method ## Testing - The timeout error was occurring intermittently in production when FileInput blocks processed GCS files - With these changes, the error should be eliminated as the session is always created in the correct context - Comprehensive logging allows monitoring of the fix effectiveness in production ## Context The root cause was that `gcloud-aio-storage` was creating its internal aiohttp session/timeout context outside of an async task context when called by the executor. This happened intermittently depending on how the executor scheduled block execution. ## Related Issues - Addresses timeout errors reported in FileInput block execution - Improves reliability of file uploads from the platform ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Test a multiple file input agent and it works - [x] Test the agent that is causing the failure and it works 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-09-24 21:06:51 +00:00
Swifty	ebeefc96e8	feat(backend): implement caching layer for store API endpoints (Part 1) (#10975 ) ## Summary This PR introduces comprehensive caching for the Store API endpoints to improve performance and reduce database load. This is Part 1 in a series of PRs to add comprehensive caching across our entire API. ### Key improvements: - Implements caching layer using the existing `@cached` decorator from `autogpt_libs.utils.cache` - Reduces database queries by 80-90% for frequently accessed public data - Built-in thundering herd protection prevents database overload during cache expiry - Selective cache invalidation ensures data freshness when mutations occur ## Details ### Cached endpoints with TTLs: - Public data (5-10 min TTL): - `/agents` - Store agents list (2 min) - `/agents/{username}/{agent_name}` - Agent details (5 min) - `/graph/{store_listing_version_id}` - Agent graphs (10 min) - `/agents/{store_listing_version_id}` - Agent by version (10 min) - `/creators` - Creators list (5 min) - `/creator/{username}` - Creator details (5 min) - User-specific data (1 min TTL): - `/profile` - User profiles (5 min) - `/myagents` - User's own agents (1 min) - `/submissions` - User's submissions (1 min) ### Cache invalidation strategy: - Profile updates → clear user's profile cache - New reviews → clear specific agent cache + agents list - New submissions → clear agents list + user's caches - Submission edits → clear related version caches ### Cache management endpoints: - `GET /cache/info` - Monitor cache statistics - `POST /cache/clear` - Clear all caches - `POST /cache/clear/{cache_name}` - Clear specific cache ## Changes <!-- REQUIRED: Bullet point summary of changes --> - Added caching decorators to all suitable GET endpoints in store routes - Implemented cache invalidation on data mutations (POST/PUT/DELETE) - Added cache management endpoints for monitoring and manual clearing - Created comprehensive test suite for cache_delete functionality - Verified thundering herd protection works correctly ## Testing <!-- How to test your changes --> - ✅ Created comprehensive test suite (`test_cache_delete.py`) validating: - Selective cache deletion works correctly - Cache entries are properly invalidated on mutations - Other cache entries remain unaffected - cache_info() accurately reflects state - ✅ Tested thundering herd protection with concurrent requests - ✅ Verified all endpoints return correct data with and without cache ## Checklist <!-- REQUIRED: Be sure to check these off before marking the PR ready for review. --> - [x] I have self-reviewed this PR's diff, line by line - [x] I have updated and tested the software architecture documentation (if applicable) - [x] I have run the agent to verify that it still works (if applicable) --------- Co-authored-by: Zamil Majdy <zamil.majdy@agpt.co> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-09-24 10:01:52 +00:00
Nicholas Tindle	83fe8d5b94	fix(backend): make preset migration not crash the system (#10966 ) <!-- Clearly explain the need for these changes: --> For those who develop blocks, they may or may not exist in the code at the same time as the database. > Create block in one branch, test, then move to another branch the block is not in This migration will prevent startup in that case. ### Changes 🏗️ Adds a try except around the migration <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Test that startup actually works --------- Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-09-24 14:23:22 +07:00
Zamil Majdy	50689218ed	feat(backend): implement comprehensive load testing performance fixes + database health improvements (#10965 )	2025-09-24 14:22:57 +07:00
Aayush Shah	ddff09a8e4	feat(blocks): add NotionReadPage block (#10760 ) Introduces a Notion Read Page block that fetches a page by ID via the Notion REST API. This is a first step toward Notion integration in the AutoGPT Platform. Motivation - Notion was not integrated yet. Im starting with a small block to add capability incrementally. ### Notes - I referred to the Todoist block implementation as a reference since I’m a beginner. - This is my first PR here - The block passed `docker compose run --rm rest_server pytest -q` successfully <!-- Clearly explain the need for these changes: --> <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: ### Test plan - [x] Ran `docker compose run --rm rest_server pytest -q backend/blocks/test/test_block.py -k notion` - [x] Confirmed tests passed (2 passed, 652 deselected, warnings only). - [x] Ran poetry run format to fix linters and tests --------- Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: Nicholas Tindle <nicktindle@outlook.com>	2025-09-19 18:54:47 +00:00
Ubbe	0c363a1cea	fix(frontend): force dynamic rendering on marketplace (#10957 ) ## Changes 🏗️ When building on Vercel: ``` at Object.start (.next/server/chunks/2744.js:1:312830) { description: "Route /marketplace couldn't be rendered statically because it used `cookies`. See more info here: https://nextjs.org/docs/messages/dynamic-server-error", digest: 'DYNAMIC_SERVER_USAGE' } Failed to get server auth token: Error: Dynamic server usage: Route /marketplace couldn't be rendered statically because it used `cookies`. See more info here: https://nextjs.org/docs/messages/dynamic-server-error at r (.next/server/chunks/8450.js:22:7298) at n (.next/server/chunks/4735.js:1:37020) at g (.next/server/chunks/555.js:1:31925) at m (.next/server/chunks/555.js:1:87056) at h (.next/server/chunks/555.js:1:932) at k (.next/server/chunks/555.js:1:25195) at queryFn (.next/server/chunks/555.js:1:25590) at Object.f [as fn] (.next/server/chunks/2744.js:1:316625) at q (.next/server/chunks/2744.js:1:312288) at Object.start (.next/server/chunks/2744.js:1:312830) { description: "Route /marketplace couldn't be rendered statically because it used `cookies`. See more info here: https://nextjs.org/docs/messages/dynamic-server-error", digest: 'DYNAMIC_SERVER_USAGE' } ``` That's because the `/marketplace` page prefetches the store agents data on the server, and that query uses `cookies` for Auth. In theory, those endpoints can be called without auth, but I think if you are logged that affects the results. The simpler for now is to tell Next.js to not try to statically render it and render on the fly with caching. According to AI we shouldn't see much difference performance wise: > Short answer: Usually no noticeable slowdown. You’ll trade a small TTFB increase (server renders per request) for correct behavior with cookies. Overall interactivity stays the same since we still dehydrate React Query data. Why it’s fine: Server already had to fetch marketplace data; doing it at request-time vs build-time is roughly the same cost for users. Hydration uses the prefetched data, avoiding extra client round-trips. If you want extra speed: If those endpoints don’t need auth, we can skip reading cookies during server prefetch and enable ISR (e.g., revalidate=60) for partial caching. Or move the cookie-dependent parts to the client and keep the page static. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Run the app - [x] Page load marketplace is fine and not slow - [x] No build cookies errors ### For configuration changes: None	2025-09-19 08:24:08 +00:00
Ubbe	e5d870a348	refactor(frontend): move old components to `__legacy__` (#10953 ) ## Changes 🏗️ Moving non-design-system components ( old ) to a `components/__legacy__` folder 📁 so like this is more obvious for developers that they should not import them or use them on new features. What is now top-level in `/components` is what it is actively maintained. Document some existing components like `<Alert />`. More on this coming on follow-up PRs. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Test and types pass on the CI - [x] Run app locally, click around, looks good ### For configuration changes: None	2025-09-18 21:37:43 +00:00
Reinier van der Leer	3f19cba28f	fix(frontend/builder): Fix moved blocks disappearing on save (#10951 ) - Resolves #10926 - Fixes a bug introduced in #10779 ### Changes 🏗️ - Fix `.metadata.position` in graph save payload - Make node reconciliation after graph save more robust ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Moved nodes don't disappear on graph save autogpt-platform-beta-v0.6.29	2025-09-18 13:34:06 +00:00
Reinier van der Leer	a978e91271	fix(ci, backend): Update Redis image & amend config to work with it (#10952 ) CI is currently broken because Bitnami has pulled all `bitnami/redis` images. The current official Redis image on Docker Hub is `redis`. ### Changes 🏗️ - Replace `bitnami/redis:6.2` by `redis:latest` in Backend CI workflow file - Make `REDIS_PASSWORD` optional in the backend settings ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] CI no longer broken	2025-09-18 13:02:49 +00:00
Ubbe	f283e6c514	refactor(frontend): cleanup of `components` folder (2/3) (#10942 ) ## Changes 🏗️ Following up my initial PR to tidy up the `components` folder https://github.com/Significant-Gravitas/AutoGPT/pull/10940. This is mostly moving files around and renaming some + documenting them on the design system as needed. Should be pretty safe as long as types on the CI pass. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Run the app locally - [x] Click around, looks ok - [x] Test and types pass on the CI ### For configuration changes: None	2025-09-18 16:21:18 +09:00
Ubbe	9fc2101e7e	refactor(frontend): tidy up on components folder (#10940 ) ## Changes 🏗️ Re-organise the `components` folder, moving things which are not re-used across screens or part of the design system out of it. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Run the app locally - [x] It works and test/types pass CI wise ### For configuration changes: None	2025-09-17 12:56:49 +00:00
Bentlybro	634f826d82	Merge branch 'master' into dev autogpt-platform-beta-v0.6.28	2025-09-17 11:35:29 +01:00
Ubbe	6d6bf308fc	fix(frontend): marketplace page load and caching (#10934 ) ## Changes 🏗️ ### Server-Side: - ✅ ISR Cache: Page cached for 60 seconds, served instantly - ✅ Prefetch: All API calls made on server, not client - ✅ Static Generation: HTML pre-rendered with data - ✅ Streaming: Loading states show immediately ### Client-Side: - ✅ No API Calls: Data hydrated from server cache - ✅ Fast Hydration: React Query uses prefetched data - ✅ Smart Caching: 60s stale time prevents unnecessary requests - ✅ Progressive Loading: Suspense boundaries for better UX ### 🔄 Caching Strategy: 1. Server: ISR cache (60s) → API calls → Static HTML 2. CDN: Cached HTML served instantly 3. Client: Hydrated data from server → No additional API calls 4. Background: ISR regenerates stale pages automatically ### 🎯 Result: - First Visit: Instant HTML + hydrated data (no client API calls) - Subsequent Visits: Instant cached page - Background Updates: Automatic revalidation every 60s - Optimal Performance: Server-side rendering + client-side caching ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Run the app locally - [x] Marketplace page loads are faster ### For configuration changes: None	2025-09-17 07:23:43 +00:00

... 10 11 12 13 14 ...

7872 Commits