AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-01-14 09:38:00 -05:00

Author	SHA1	Message	Date
Swifty	01bab66f5c	Merge branch 'swiftyos/caching-pt2' into swiftyos/shared-cache	2025-10-07 14:35:52 +02:00
Swifty	30b2d6b50d	update the frontend ci to generate the api files before running the tests	2025-10-07 14:13:33 +02:00
Swifty	97e77339fd	Merge branch 'dev' into swiftyos/caching-pt2	2025-10-07 14:06:44 +02:00
Abhimanyu Yadav	7c47f54e25	feat(frontend): add an API key modal for adding credentials in new builder. (#11105 ) In this PR, I’ve added an API Key modal to the new builder so users can add API key credentials. https://github.com/user-attachments/assets/68da226c-3787-4950-abb0-7a715910355e ### Changes - Updated the credential field to support API key. - Added a modal for creating new API keys and improved the selection UI for credentials. - Refactored components for better modularity and maintainability. - Enhanced styling and user experience in the FlowEditor components. - Updated OpenAPI documentation for better clarity on credential operations. ### Checklist 📋 - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Able to create API key perfectly. - [x] can select the correct credentials.	2025-10-07 11:19:17 +00:00
Swifty	6a360a49b1	Merge branch 'dev' into swiftyos/caching-pt2	2025-10-07 02:40:13 +02:00
Lluis Agusti	927042d93e	fix(frontend): more turnstile experiments (2)	2025-10-07 00:40:49 +09:00
Lluis Agusti	4244979a45	fix(frontend): more turnstile experiments	2025-10-07 00:22:20 +09:00
Lluis Agusti	aa27365e7f	fix(frontend): fix captcha reset	2025-10-06 23:57:42 +09:00
Nicholas Tindle	b86aa8b14e	feat(frontend): launchdarkly tracking on frontend browser (#11076 ) <!-- Clearly explain the need for these changes: --> We struggle to identify where issues are coming from feature flags and which are from normal use. This adds that split on the frontend. ### Changes 🏗️ Include sentry in the LD initialization <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Test that launch darkly flags get attached to the frontend (browser only)	2025-10-06 13:48:13 +00:00
Lluis Agusti	e7ab2626f5	fix(frontend): remove captcha ref reset	2025-10-06 22:34:08 +09:00
Ubbe	ff58ce174b	fix(frontend): possible login issues related to turnstile (#11094 ) ## Changes 🏗️ We are seeing login and authentication issues in production and staging. Locally though, the app behaves fine. We also had issues related to the CAPTCHA in the past. Our CAPTCHA code is less than ideal, with some heavy `useEffect` that will load the Turnstile script into the DOM. I have the impression that is loading the script multiple times ( due to dependencies on the effects array not being well set ), or the like causing associated login issues. Created a new Turnstile component using [`react-turnstile`](https://docs.page/marsidev/react-turnstile) that is way simpler and should hopefully be more stable. I also fixed an issue with the Credits popover layout rendering cropped on the window. ## Checklist 📋 ### For code changes - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Login/logout on the app multiple times with Turnstile ON, everything is stable - [x] Credits popover appears on the right place ### For configuration changes: None	2025-10-06 12:59:27 +00:00
Abhimanyu Yadav	2d8ab6b7c0	feat(frontend): add selecting UI for custom node in new builder (#11091 ) React Flow has built-in functionality to select multiple nodes by using `cmd` + click. You can also select using rectangle selection by holding the shift key. However, we need to design a custom node after it’s selected. <img width="845" height="510" alt="Screenshot 2025-10-06 at 12 41 16 PM" src="https://github.com/user-attachments/assets/c91f22e3-2211-46b6-b3d3-fbc89148e99a" /> ### Tests - [x] Selecting Ui is visible after selecting a node, using cmd + click, and after rectangle selection.	2025-10-06 12:53:59 +00:00
Abhimanyu Yadav	a7306970b8	refactor(frontend): simplify marketplace search page and update data fetching (#11061 ) This PR refactors the marketplace search page to improve code maintainability, readability, and follows modern React patterns by extracting complex logic into a custom hook and creating dedicated components. ### 🔄 Changes #### Architecture Improvements - Component Extraction: Replaced the monolithic `SearchResults` component with a cleaner `MainSearchResultPage` component that focuses solely on presentation - Custom Hook Pattern: Extracted all business logic and state management into `useMainSearchResultPage` hook for better separation of concerns - Loading State Component: Added dedicated `MainSearchResultPageLoading` component for consistent loading UI #### Code Simplification - Reduced search page to 19 lines (from 175 lines) by removing inline logic and state management - Centralized data fetching using auto-generated API endpoints (`useGetV2ListStoreAgents`, `useGetV2ListStoreCreators`) - Improved error handling with dedicated error states and loading states #### Feature Updates - Sort Options: Commented out "Most Recent" and "Highest Rated" sort options due to backend limitations (no date/rating data available) - Client-side Sorting: Implemented client-side sorting for "runs" and "rating" as a temporary solution - Search Filters: Maintained filter functionality for agents/creators with improved state management ### 📊 Impact - Better Developer Experience: Code is now more modular and easier to understand - Improved Maintainability: Business logic separated from presentation layer - Future-Ready: Structure prepared for backend improvements when date/rating data becomes available - Type Safety: Leveraging TypeScript with auto-generated API types ### 🧪 Testing Checklist - [x] Search functionality works correctly with various search terms - [x] Filter chips correctly toggle between "All", "Agents", and "Creators" - [x] Sort dropdown displays only "Most Runs" option - [x] Client-side sorting correctly sorts agents and creators by runs - [x] Loading state displays while fetching data - [x] Error state displays when API calls fail - [x] "No results found" message appears for empty searches - [x] Search bar in results page is functional - [x] Results display correctly with proper layout and styling	2025-10-06 12:53:45 +00:00
Abhimanyu Yadav	c42f94ce2a	feat(frontend): add new credential field for new builder (#11066 ) In this PR, I’ve added a feature to select a credential from a list and also provided a UI to create a new credential if desired. <img width="443" height="157" alt="Screenshot 2025-10-06 at 9 28 07 AM" src="https://github.com/user-attachments/assets/d9e72a14-255d-45b6-aa61-b55c2465dd7e" /> #### Frontend Changes: - Refactored credential field from a single component to a modular architecture: - Created `CredentialField/` directory with separated concerns - Added `SelectCredential.tsx` component for credential selection UI with provider details display - Implemented `useCredentialField.ts` custom hook for credential data fetching with 10-minute caching - Added `helpers.ts` with credential filtering and provider name formatting utilities - Added loading states with skeleton UI while fetching credentials - Enhanced UI/UX features: - Dropdown selector showing credentials with provider, title, username, and host details - Visual key icon for each credential option - Placeholder "Add API Key" button (implementation pending) - Loading skeleton UI for better perceived performance - Smart filtering of credentials based on provider requirements - Template improvements: - Updated `FieldTemplate.tsx` to properly handle credential field display - Special handling for credential field labels showing provider-specific names - Removed input handle for credential fields in the node editor #### Backend Changes: - API Documentation improvements: - Added OpenAPI summaries to `/credentials` endpoint ("List Credentials") - Added summary to `/{provider}/credentials/{cred_id}` endpoint ("Get Specific Credential By ID") ### Test Plan 📋 - [x] Navigate to the flow builder - [x] Add a block that requires credentials (e.g., API block) - [x] Verify the credential dropdown loads and displays available credentials - [x] Check that only credentials matching the provider requirements are shown	2025-10-06 12:52:45 +00:00
Zamil Majdy	4e1557e498	fix(backend): Add dynamic input pin support for Smart Decision Maker Block (#11082 ) ## Summary - Centralize dynamic field delimiters and helpers in backend/data/dynamic_fields.py. - Refactor SmartDecisionMaker: build function signatures with dynamic-field mapping and re-map tool outputs back to original dynamic names. - Deterministic retry loop with retry-only feedback to avoid polluting final conversation history. - Update executor/utils.py and data/graph.py to use centralized utilities. - Update and extend tests: dynamic-field E2E flow, mapping verification, output yielding, and retry validation; switch mocked llm_call to AsyncMock; align tool-name expectations. - Add a single-tool fallback in schema lookup to support mocked scenarios. ## Validation - Full backend test suite: 1125 passed, 88 skipped, 53 warnings (local). - Backend lint/format pass. ## Scope - Minimal and localized to SmartDecisionMaker and dynamic-field utilities; unrelated pyright warnings remain unchanged. ## Risks/Mitigations - Behavior is backward-compatible; dynamic-field constants are centralized and reused. - Output re-mapping only affects SmartDecisionMaker tool outputs and matches existing link naming conventions. ## Checklist - [x] Formatted and linted - [x] All updated tests pass locally - [x] No secrets introduced --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-04 14:23:13 +00:00
Swifty	045009a84a	fix test	2025-10-03 20:36:27 +02:00
Swifty	70cb7824fd	Merge branch 'swiftyos/caching-pt2' into swiftyos/shared-cache	2025-10-03 20:30:40 +02:00
seer-by-sentry[bot]	7f8cf36ceb	feat(frontend): Add description to Upload Agent dialog (#11053 ) ### Changes 🏗️ - Added a description to the Upload Agent dialog to provide more context for users. Fixes [BUILDER-3N1](https://sentry.io/organizations/significant-gravitas/issues/6915512912/). The issue was that: DialogContent in LibraryUploadAgentDialog lacks an accessible description, violating WAI-ARIA standards. <img width="2066" height="1740" alt="image" src="https://github.com/user-attachments/assets/c876fb33-4375-4a66-a6a2-6b13c00ef8d3" /> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] Test it works - [x] Get design approval Co-authored-by: seer-by-sentry[bot] <157164994+seer-by-sentry[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-10-03 16:38:49 +00:00
Swifty	df1d15fcfe	added more tests and central place to set page_size to make the caching more robust	2025-10-03 17:56:35 +02:00
Swifty	eb022e50a7	moved clear cache helper function to cache	2025-10-03 15:21:18 +02:00
Swifty	431042a391	fix cache clear helper function	2025-10-03 15:10:34 +02:00
Swifty	7c6a9146f0	Merge branch 'dev' into swiftyos/caching-pt2	2025-10-03 15:06:39 +02:00
Swifty	0264cb56d3	update page_size invalidation	2025-10-03 12:17:24 +02:00
Zamil Majdy	8b4eb6f87c	fix(backend): resolve SmartDecisionMaker ChatCompletionMessage error and enhance tool call token counting (#11059 ) ## Summary Fix two critical production issues affecting SmartDecisionMaker functionality and prompt compression accuracy. ### 🔧 Changes Made #### Issue 1: SmartDecisionMaker ChatCompletionMessage Error Problem: PR #11015 introduced code that appended `response.raw_response` (ChatCompletionMessage object) directly to conversation history, causing `'ChatCompletionMessage' object has no attribute 'get'` errors. Root Cause: ChatCompletionMessage objects don't have `.get()` method but conversation history processing expects dictionary objects with `.get()` capability. Solution: Created `_convert_raw_response_to_dict()` helper function for type-safe conversion: - ✅ Helper function: Safely converts raw_response to dictionary format for conversation history - ✅ Type safety: Handles OpenAI (ChatCompletionMessage), Anthropic (Message), and Ollama (string) responses - ✅ Preserves context: Maintains conversation flow for multi-turn tool calling scenarios - ✅ DRY principle: Single helper used in both validation error path (line 624) and success path (line 681) - ✅ No breaking changes: Tool call continuity preserved for complex workflows #### Issue 2: Tool Call Token Counting in Prompt Compression Problem: `_msg_tokens()` function only counted tokens in 'content' field, severely undercounting tool calls which store data in different fields (tool_calls, function.arguments, etc.). Root Cause: Tool calls have no 'content' to calculate length of, causing massive token undercounting during prompt compression that could lead to context overflow. Solution: Enhanced `_msg_tokens()` to handle both OpenAI and Anthropic tool call formats: - ✅ OpenAI format: Count tokens in `tool_calls[].id`, `type`, `function.name`, `function.arguments` - ✅ Anthropic format: Count tokens in `content[].tool_use` (`id`, `name`, `input`) and `content[].tool_result` - ✅ Backward compatibility: Regular string content counted exactly as before - ✅ Comprehensive testing: Added 11 unit tests in `prompt_test.py` ### 📊 Validation Results - ✅ SmartDecisionMaker errors resolved: No more ChatCompletionMessage.get() failures - ✅ Token counting accuracy: OpenAI tool calls 9+ tokens vs previous 3-4 wrapper-only tokens - ✅ Token counting accuracy: Anthropic tool calls 13+ tokens vs previous 3-4 wrapper-only tokens - ✅ Backward compatibility: Regular messages maintain exact same token count - ✅ Type safety: 0 type errors in both modified files - ✅ Test coverage: All 11 new unit tests pass + existing SmartDecisionMaker tests pass - ✅ Multi-turn conversations: Tool call workflows continue working correctly ### 🎯 Impact - Resolves Sentry issue OPEN-2750: ChatCompletionMessage errors eliminated - Prevents context overflow: Accurate token counting during prompt compression for long tool call conversations - Production stability: SmartDecisionMaker retry mechanism works correctly with proper conversation flow - Resource efficiency: Better memory management through accurate token accounting - Zero breaking changes: Full backward compatibility maintained ### 🧪 Test Plan - [x] Verified SmartDecisionMaker no longer crashes with ChatCompletionMessage errors - [x] Validated tool call token counting accuracy with comprehensive unit tests (11 tests all pass) - [x] Confirmed backward compatibility for regular message token counting - [x] Tested both OpenAI and Anthropic tool call formats - [x] Verified type safety with pyright checks - [x] Ensured conversation history flows correctly with helper function - [x] Confirmed multi-turn tool calling scenarios work with preserved context ### 📝 Files Modified - `backend/blocks/smart_decision_maker.py` - Added `_convert_raw_response_to_dict()` helper for safe conversion - `backend/util/prompt.py` - Enhanced tool call token counting for accurate prompt compression - `backend/util/prompt_test.py` - Comprehensive unit tests for token counting (11 tests) ### ⚡ Ready for Review Both fixes are critical for production stability and have been thoroughly tested with zero breaking changes. The helper function approach ensures type safety while preserving essential conversation context for complex tool calling workflows. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-03 00:25:21 +00:00
Reinier van der Leer	4b7d17b9d2	refactor(blocks/code): Clean up & rename code execution blocks (#11019 ) The code execution blocks' implementations are heavily duplicated and their names aren't very clear. E.g. the "InstantiationBlock" just shows up as "Instantiation" in the block list. I would've done this in #11017 but kept the refactoring separate for easier reviewing. ### Changes 🏗️ - Rename "Code Execution" block to "Execute Code" - Rename "Instantiation" block to "Instantiate Code Sandbox" - Rename "Step Execution" block to "Execute Code Step" - Deduplicate implementation of the three code execution blocks - Add `dispose_sandbox` toggle to "Execute Code" and "Execute Code Step" blocks - Note: it's default `True` on the Execute Code block, default `False` on the Execute Code Step block - Update block and input descriptions to clarify behavior - Fix all linting issues <details> <summary>Screenshots</summary> ![the three blocks as they look now](https://github.com/user-attachments/assets/8e4274f7-e006-440c-b2b8-980df546186d) ![updated block names and descriptions in the block list](https://github.com/user-attachments/assets/866c3d9e-13ea-4fc0-87de-a5257bafb6d4) ![the new dispose_sandbox toggle on the Execute Code block](https://github.com/user-attachments/assets/56815dbb-f313-4308-81dd-50d949d9eafb) ![the new dispose_sandbox toggle on the Execute Code Step block](https://github.com/user-attachments/assets/469c140c-4cd2-4210-97b2-f27fc91778de) </details> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Test all code execution blocks manually	2025-10-02 22:50:49 +00:00
dependabot[bot]	0fc6a44389	chore(backend/deps-dev): Bump the development-dependencies group across 1 directory with 4 updates (#10946 ) Bumps the development-dependencies group with 4 updates in the /autogpt_platform/backend directory: [faker](https://github.com/joke2k/faker), [pyright](https://github.com/RobertCraigie/pyright-python), [pytest-mock](https://github.com/pytest-dev/pytest-mock) and [ruff](https://github.com/astral-sh/ruff). Updates `faker` from 37.6.0 to 37.8.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/joke2k/faker/releases">faker's releases</a>.</em></p> <blockquote> <h2>Release v37.8.0</h2> <p>See <a href="https://github.com/joke2k/faker/blob/refs/tags/v37.8.0/CHANGELOG.md">CHANGELOG.md</a>.</p> <h2>Release v37.7.0</h2> <p>See <a href="https://github.com/joke2k/faker/blob/refs/tags/v37.7.0/CHANGELOG.md">CHANGELOG.md</a>.</p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/joke2k/faker/blob/master/CHANGELOG.md">faker's changelog</a>.</em></p> <blockquote> <h3><a href="https://github.com/joke2k/faker/compare/v37.7.0...v37.8.0">v37.8.0 - 2025-09-15</a></h3> <ul> <li>Add Automotive providers for <code>ja_JP</code> locale. Thanks <a href="https://github.com/ItoRino424"><code>@ItoRino424</code></a>.</li> </ul> <h3><a href="https://github.com/joke2k/faker/compare/v37.6.0...v37.7.0">v37.7.0 - 2025-09-15</a></h3> <ul> <li>Add Nigerian name locales (<code>yo_NG</code>, <code>ha_NG</code>, <code>ig_NG</code>, <code>en_NG</code>). Thanks <a href="https://github.com/ifeoluwaoladeji"><code>@ifeoluwaoladeji</code></a>.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`4bde8f57ad`"><code>4bde8f5</code></a> Bump version: 37.7.0 → 37.8.0</li> <li><a href="`f542f364cb`"><code>f542f36</code></a> 📝 Update CHANGELOG.md</li> <li><a href="`e28d7cb909`"><code>e28d7cb</code></a> fix test</li> <li><a href="`e4305b0e29`"><code>e4305b0</code></a> fix padding</li> <li><a href="`a359441a81`"><code>a359441</code></a> 💄 format code</li> <li><a href="`0e3f0bdf81`"><code>0e3f0bd</code></a> Add Automotive providers for <code>ja_JP</code> locale (<a href="https://redirect.github.com/joke2k/faker/issues/2251">#2251</a>)</li> <li><a href="`d4fa69dfc7`"><code>d4fa69d</code></a> Bump version: 37.6.0 → 37.7.0</li> <li><a href="`f636f06a37`"><code>f636f06</code></a> 📝 Update CHANGELOG.md</li> <li><a href="`9a482dd25b`"><code>9a482dd</code></a> 💄 Format code</li> <li><a href="`2493b2d51a`"><code>2493b2d</code></a> fix: fix minor grammar typo (<a href="https://redirect.github.com/joke2k/faker/issues/2259">#2259</a>)</li> <li>Additional commits viewable in <a href="https://github.com/joke2k/faker/compare/v37.6.0...v37.8.0">compare view</a></li> </ul> </details> <br /> Updates `pyright` from 1.1.404 to 1.1.405 <details> <summary>Commits</summary> <ul> <li><a href="`e211ec8df8`"><code>e211ec8</code></a> Pyright NPM Package update to 1.1.405 (<a href="https://redirect.github.com/RobertCraigie/pyright-python/issues/353">#353</a>)</li> <li>See full diff in <a href="https://github.com/RobertCraigie/pyright-python/compare/v1.1.404...v1.1.405">compare view</a></li> </ul> </details> <br /> Updates `pytest-mock` from 3.14.1 to 3.15.1 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pytest-dev/pytest-mock/releases">pytest-mock's releases</a>.</em></p> <blockquote> <h2>v3.15.1</h2> <p><em>2025-09-16</em></p> <ul> <li><a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/529">#529</a>: Fixed <code>itertools._tee object has no attribute error</code> -- now <code>duplicate_iterators=True</code> must be passed to <code>mocker.spy</code> to duplicate iterators.</li> </ul> <h2>v3.15.0</h2> <p><em>2025-09-04</em></p> <ul> <li>Python 3.8 (EOL) is no longer supported.</li> <li><a href="https://redirect.github.com/pytest-dev/pytest-mock/pull/524">#524</a>: Added <code>spy_return_iter</code> to <code>mocker.spy</code>, which contains a duplicate of the return value of the spied method if it is an <code>Iterator</code>.</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pytest-dev/pytest-mock/blob/main/CHANGELOG.rst">pytest-mock's changelog</a>.</em></p> <blockquote> <h2>3.15.1</h2> <p><em>2025-09-16</em></p> <ul> <li><code>[#529](https://github.com/pytest-dev/pytest-mock/issues/529) <https://github.com/pytest-dev/pytest-mock/issues/529></code>_: Fixed <code>itertools._tee object has no attribute error</code> -- now <code>duplicate_iterators=True</code> must be passed to <code>mocker.spy</code> to duplicate iterators.</li> </ul> <h2>3.15.0</h2> <p><em>2025-09-04</em></p> <ul> <li>Python 3.8 (EOL) is no longer supported.</li> <li><code>[#524](https://github.com/pytest-dev/pytest-mock/issues/524) <https://github.com/pytest-dev/pytest-mock/pull/524></code>_: Added <code>spy_return_iter</code> to <code>mocker.spy</code>, which contains a duplicate of the return value of the spied method if it is an <code>Iterator</code>.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`e1b5c62a38`"><code>e1b5c62</code></a> Release 3.15.1</li> <li><a href="`184eb190d6`"><code>184eb19</code></a> Set <code>spy_return_iter</code> only when explicitly requested (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/537">#537</a>)</li> <li><a href="`4fa0088a0a`"><code>4fa0088</code></a> [pre-commit.ci] pre-commit autoupdate (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/536">#536</a>)</li> <li><a href="`f5aff33ce7`"><code>f5aff33</code></a> Fix test failure with pytest 8+ and verbose mode (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/535">#535</a>)</li> <li><a href="`adc41873c9`"><code>adc4187</code></a> Bump actions/setup-python from 5 to 6 in the github-actions group (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/533">#533</a>)</li> <li><a href="`95ad570060`"><code>95ad570</code></a> [pre-commit.ci] pre-commit autoupdate (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/532">#532</a>)</li> <li><a href="`e696bf02c1`"><code>e696bf0</code></a> Fix standalone mock support (<a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/531">#531</a>)</li> <li><a href="`5b29b03ce9`"><code>5b29b03</code></a> Fix gen-release-notes script</li> <li><a href="`7d22ef4e56`"><code>7d22ef4</code></a> Merge pull request <a href="https://redirect.github.com/pytest-dev/pytest-mock/issues/528">#528</a> from pytest-dev/release-3.15.0</li> <li><a href="`90b29f89e2`"><code>90b29f8</code></a> Update CHANGELOG for 3.15.0</li> <li>Additional commits viewable in <a href="https://github.com/pytest-dev/pytest-mock/compare/v3.14.1...v3.15.1">compare view</a></li> </ul> </details> <br /> Updates `ruff` from 0.12.11 to 0.13.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/astral-sh/ruff/releases">ruff's releases</a>.</em></p> <blockquote> <h2>0.13.0</h2> <h2>Release Notes</h2> <p>Check out the <a href="https://astral.sh/blog/ruff-v0.13.0">blog post</a> for a migration guide and overview of the changes!</p> <h3>Breaking changes</h3> <ul> <li> <p><strong>Several rules can now add <code>from __future__ import annotations</code> automatically</strong></p> <p><code>TC001</code>, <code>TC002</code>, <code>TC003</code>, <code>RUF013</code>, and <code>UP037</code> now add <code>from __future__ import annotations</code> as part of their fixes when the <code>lint.future-annotations</code> setting is enabled. This allows the rules to move more imports into <code>TYPE_CHECKING</code> blocks (<code>TC001</code>, <code>TC002</code>, and <code>TC003</code>), use PEP 604 union syntax on Python versions before 3.10 (<code>RUF013</code>), and unquote more annotations (<code>UP037</code>).</p> </li> <li> <p><strong>Full module paths are now used to verify first-party modules</strong></p> <p>Ruff now checks that the full path to a module exists on disk before categorizing it as a first-party import. This change makes first-party import detection more accurate, helping to avoid false positives on local directories with the same name as a third-party dependency, for example. See the <a href="https://docs.astral.sh/ruff/faq/#how-does-ruff-determine-which-of-my-imports-are-first-party-third-party-etc">FAQ section</a> on import categorization for more details.</p> </li> <li> <p><strong>Deprecated rules must now be selected by exact rule code</strong></p> <p>Ruff will no longer activate deprecated rules selected by their group name or prefix. As noted below, the two remaining deprecated rules were also removed in this release, so this won't affect any current rules, but it will still affect any deprecations in the future.</p> </li> <li> <p><strong>The deprecated macOS configuration directory fallback has been removed</strong></p> <p>Ruff will no longer look for a user-level configuration file at <code>~/Library/Application Support/ruff/ruff.toml</code> on macOS. This feature was deprecated in v0.5 in favor of using the <a href="https://specifications.freedesktop.org/basedir-spec/latest/">XDG specification</a> (usually resolving to <code>~/.config/ruff/ruff.toml</code>), like on Linux. The fallback and accompanying deprecation warning have now been removed.</p> </li> </ul> <h3>Removed Rules</h3> <p>The following rules have been removed:</p> <ul> <li><a href="https://docs.astral.sh/ruff/rules/pandas-df-variable-name"><code>pandas-df-variable-name</code></a> (<code>PD901</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/non-pep604-isinstance"><code>non-pep604-isinstance</code></a> (<code>UP038</code>)</li> </ul> <h3>Stabilization</h3> <p>The following rules have been stabilized and are no longer in preview:</p> <ul> <li><a href="https://docs.astral.sh/ruff/rules/airflow-dag-no-schedule-argument"><code>airflow-dag-no-schedule-argument</code></a> (<code>AIR002</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-removal"><code>airflow3-removal</code></a> (<code>AIR301</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-moved-to-provider"><code>airflow3-moved-to-provider</code></a> (<code>AIR302</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-suggested-update"><code>airflow3-suggested-update</code></a> (<code>AIR311</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/airflow3-suggested-to-move-to-provider"><code>airflow3-suggested-to-move-to-provider</code></a> (<code>AIR312</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/long-sleep-not-forever"><code>long-sleep-not-forever</code></a> (<code>ASYNC116</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/f-string-number-format"><code>f-string-number-format</code></a> (<code>FURB116</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/os-symlink"><code>os-symlink</code></a> (<code>PTH211</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/generic-not-last-base-class"><code>generic-not-last-base-class</code></a> (<code>PYI059</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/redundant-none-literal"><code>redundant-none-literal</code></a> (<code>PYI061</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/pytest-raises-ambiguous-pattern"><code>pytest-raises-ambiguous-pattern</code></a> (<code>RUF043</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/unused-unpacked-variable"><code>unused-unpacked-variable</code></a> (<code>RUF059</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/useless-class-metaclass-type"><code>useless-class-metaclass-type</code></a> (<code>UP050</code>)</li> </ul> <p>The following behaviors have been stabilized:</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/astral-sh/ruff/blob/main/CHANGELOG.md">ruff's changelog</a>.</em></p> <blockquote> <h2>0.13.0</h2> <p>Check out the <a href="https://astral.sh/blog/ruff-v0.13.0">blog post</a> for a migration guide and overview of the changes!</p> <h3>Breaking changes</h3> <ul> <li> <p><strong>Several rules can now add <code>from __future__ import annotations</code> automatically</strong></p> <p><code>TC001</code>, <code>TC002</code>, <code>TC003</code>, <code>RUF013</code>, and <code>UP037</code> now add <code>from __future__ import annotations</code> as part of their fixes when the <code>lint.future-annotations</code> setting is enabled. This allows the rules to move more imports into <code>TYPE_CHECKING</code> blocks (<code>TC001</code>, <code>TC002</code>, and <code>TC003</code>), use PEP 604 union syntax on Python versions before 3.10 (<code>RUF013</code>), and unquote more annotations (<code>UP037</code>).</p> </li> <li> <p><strong>Full module paths are now used to verify first-party modules</strong></p> <p>Ruff now checks that the full path to a module exists on disk before categorizing it as a first-party import. This change makes first-party import detection more accurate, helping to avoid false positives on local directories with the same name as a third-party dependency, for example. See the <a href="https://docs.astral.sh/ruff/faq/#how-does-ruff-determine-which-of-my-imports-are-first-party-third-party-etc">FAQ section</a> on import categorization for more details.</p> </li> <li> <p><strong>Deprecated rules must now be selected by exact rule code</strong></p> <p>Ruff will no longer activate deprecated rules selected by their group name or prefix. As noted below, the two remaining deprecated rules were also removed in this release, so this won't affect any current rules, but it will still affect any deprecations in the future.</p> </li> <li> <p><strong>The deprecated macOS configuration directory fallback has been removed</strong></p> <p>Ruff will no longer look for a user-level configuration file at <code>~/Library/Application Support/ruff/ruff.toml</code> on macOS. This feature was deprecated in v0.5 in favor of using the <a href="https://specifications.freedesktop.org/basedir-spec/latest/">XDG specification</a> (usually resolving to <code>~/.config/ruff/ruff.toml</code>), like on Linux. The fallback and accompanying deprecation warning have now been removed.</p> </li> </ul> <h3>Removed Rules</h3> <p>The following rules have been removed:</p> <ul> <li><a href="https://docs.astral.sh/ruff/rules/pandas-df-variable-name"><code>pandas-df-variable-name</code></a> (<code>PD901</code>)</li> <li><a href="https://docs.astral.sh/ruff/rules/non-pep604-isinstance"><code>non-pep604-isinstance</code></a> (<code>UP038</code>)</li> </ul> <h3>Stabilization</h3> <p>The following rules have been stabilized and are no longer in preview:</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`a1fdd66f10`"><code>a1fdd66</code></a> Bump 0.13.0 (<a href="https://redirect.github.com/astral-sh/ruff/issues/20336">#20336</a>)</li> <li><a href="`8770b95509`"><code>8770b95</code></a> [ty] introduce <code>DivergentType</code> (<a href="https://redirect.github.com/astral-sh/ruff/issues/20312">#20312</a>)</li> <li><a href="`65982a1e14`"><code>65982a1</code></a> [ty] Use 'unknown' specialization for upper bound on Self (<a href="https://redirect.github.com/astral-sh/ruff/issues/20325">#20325</a>)</li> <li><a href="`57d1f7132d`"><code>57d1f71</code></a> [ty] Simplify unions of enum literals and subtypes thereof (<a href="https://redirect.github.com/astral-sh/ruff/issues/20324">#20324</a>)</li> <li><a href="`7a75702237`"><code>7a75702</code></a> Ignore deprecated rules unless selected by exact code (<a href="https://redirect.github.com/astral-sh/ruff/issues/20167">#20167</a>)</li> <li><a href="`9ca632c84f`"><code>9ca632c</code></a> Stabilize adding future import via config option (<a href="https://redirect.github.com/astral-sh/ruff/issues/20277">#20277</a>)</li> <li><a href="`64fe7d30a3`"><code>64fe7d3</code></a> [<code>flake8-errmsg</code>] Stabilize extending <code>raw-string-in-exception</code> (<code>EM101</code>) to ...</li> <li><a href="`beeeb8d5c5`"><code>beeeb8d</code></a> Stabilize the remaining Airflow rules (<a href="https://redirect.github.com/astral-sh/ruff/issues/20250">#20250</a>)</li> <li><a href="`b6fca52855`"><code>b6fca52</code></a> [<code>flake8-bugbear</code>] Stabilize support for non-context-manager calls in `assert...</li> <li><a href="`ac7f882c78`"><code>ac7f882</code></a> [<code>flake8-commas</code>] Stabilize support for trailing comma checks in type paramet...</li> <li>Additional commits viewable in <a href="https://github.com/astral-sh/ruff/compare/0.12.11...0.13.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore <dependency name> major version` will close this group update PR and stop Dependabot creating any more for the specific dependency's major version (unless you unignore this specific dependency's major version or upgrade to it yourself) - `@dependabot ignore <dependency name> minor version` will close this group update PR and stop Dependabot creating any more for the specific dependency's minor version (unless you unignore this specific dependency's minor version or upgrade to it yourself) - `@dependabot ignore <dependency name>` will close this group update PR and stop Dependabot creating any more for the specific dependency (unless you unignore this specific dependency or upgrade to it yourself) - `@dependabot unignore <dependency name>` will remove all of the ignore conditions of the specified dependency - `@dependabot unignore <dependency name> <ignore condition>` will remove the ignore condition of the specified dependency and ignore conditions </details> --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com>	2025-10-02 20:57:18 +00:00
dependabot[bot]	f5ee579ab2	chore(backend/deps): Bump firecrawl-py from 2.16.3 to 4.3.1 in /autogpt_platform/backend (#10809 ) Bumps [firecrawl-py](https://github.com/firecrawl/firecrawl) from 2.16.3 to 4.3.1. <details> <summary>Commits</summary> <ul> <li>See full diff in <a href="https://github.com/firecrawl/firecrawl/commits">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=firecrawl-py&package-manager=pip&previous-version=2.16.3&new-version=4.3.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) You can trigger a rebase of this PR by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Upgrade firecrawl-py to v4.3.6 and refactor firecrawl blocks to new v4 API, formats handling, method names, and response fields. > > - Dependencies > - Bump `firecrawl-py` from `2.16.3` to `4.3.6` (adds `httpx`, updates `pydantic>=2`). > - Firecrawl API migration > - Centralize `ScrapeFormat` in `backend/blocks/firecrawl/_api.py`. > - Add `_format_utils.convert_to_format_options` to map `ScrapeFormat` (incl. `screenshot@fullPage`) to v4 `FormatOption`/`ScreenshotFormat`. > - Switch to v4 types (`firecrawl.v2.types.ScrapeOptions`); adopt snake_case fields (`only_main_content`, `max_age`, `wait_for`). > - Rename methods: `crawl_url` → `crawl`, `scrape_url` → `scrape`, `map_url` → `map`. > - Normalize response attributes: `rawHtml` → `raw_html`, `changeTracking` → `change_tracking`. > - Blocks > - `crawl.py`, `scrape.py`, `search.py`: use new formats conversion and updated options/fields; adjust iteration over results (`search`: iterate `web` when present). > - `map.py`: return both `links` and detailed `results` (url/title/description) and update output schema accordingly. > - Project files > - Update `pyproject.toml` and `poetry.lock` for new dependency versions. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `d872f2e82b`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> > Note > Automatic rebases have been disabled on this pull request as it has been open for over 30 days. --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-10-02 20:14:18 +00:00
Swifty	ef552c189f	feat(cache): refactor caching system and address PR review feedback Addresses all 14 review comments from @majdyz on PR #11030 Major architectural improvements: - Move cache.py from autogpt_libs to backend/util for proper dependency management - Add Redis configuration to Settings class for centralized config management - Remove duplicate retry.py from autogpt_libs (use backend.util.retry) - Implement dedicated Redis connection pool with 50 max connections Cache API enhancements: - Make ttl_seconds a required parameter (no infinite TTLs allowed) - Add CachedValue dataclass to eliminate tuple ambiguity when caching tuple results - Implement LRU with TTL refresh using Redis GETEX command - Add pattern-based cache clearing: cache_clear(pattern="user:*") - Simplify wrapper logic by extracting helper functions Redis integration: - Create separate connection pool for cache (binary mode for pickle) - Add recommended Redis production configuration in comments - Use Settings class for Redis config instead of environment variables directly - Update both cache.py and redis_client.py to use centralized settings Test improvements: - Move test file from autogpt_libs to backend/test - Fix tests to use pickleable data structures instead of MagicMock objects - Update all @cached() decorators to include ttl_seconds parameter - All 51 cache tests passing Breaking changes: - @cached() decorator now requires ttl_seconds parameter - Import path changed: autogpt_libs.utils.cache -> backend.util.cache - Tests using shared_cache=True must return pickleable objects	2025-10-02 15:45:43 +02:00
Zamil Majdy	57a06f7088	fix(blocks, security): Fixes for various DoS vulnerabilities (#10798 ) This PR addresses multiple critical and medium security vulnerabilities that could lead to Denial of Service (DoS) attacks. All fixes implement defense-in-depth strategies with comprehensive testing. ### Changes 🏗️ #### Critical Security Fixes: 1. GHSA-m2wr-7m3r-p52c - ReDoS in CodeExtractionBlock - Fixed catastrophic backtracking in regex patterns `\s+[\s\S]?` and `\s+(.?)` - Replaced with safer patterns: `[ \t]\n([^\s\S]?)` - Files: `backend/blocks/code_extraction_block.py` 2. GHSA-955p-gpfx-r66j - AITextSummarizerBlock Memory Amplification - Added 1MB text size limit and 100 chunk maximum - Prevents 10K input → 50G memory amplification attacks - Files: `backend/blocks/llm.py` 3. GHSA-5cqw-g779-9f9x - RSS Feed XML Bomb DoS - Added 10MB feed size limit and 30s timeout - Prevents deep XML parsing memory exhaustion - Files: `backend/blocks/rss.py` 4. GHSA-7g34-7fvq-xxq6 - File Storage Disk Exhaustion - Added 100MB per file and 1GB per execution directory limits - Prevents disk space exhaustion from file uploads - Files: `backend/util/file.py` 5. GHSA-pppq-xx2w-7jpq - ExtractTextInformationBlock ReDoS - Added 1MB text limit, 1000 match limit, and 5s timeout protection - Prevents lookahead pattern memory exhaustion - Files: `backend/blocks/text.py` 6. GHSA-vw3v-whvp-33v5 - Docker Logging Disk Exhaustion - Added log rotation limits at Docker (10MB × 3 files) and application levels - Prevents unbounded log growth causing disk exhaustion - Files: `docker-compose.platform.yml`, `autogpt_libs/autogpt_libs/logging/config.py` #### Additional Security Improvements: 7. StepThroughItemsBlock DoS Prevention - Added 10,000 item limit and 1MB input size limit - Prevents large iteration DoS attacks - Files: `backend/blocks/iteration.py` 8. XMLParserBlock XML Bomb Prevention - Added 10MB XML input size limit - Files: `backend/blocks/xml_parser.py` #### Code Quality: - Fixed Python 3.10 typing compatibility issues - Added comprehensive security test suite - All code formatted and linted ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Created comprehensive security test suite covering all vulnerabilities - [x] Verified ReDoS patterns are fixed and don't cause timeouts - [x] Confirmed memory limits prevent amplification attacks - [x] Tested file size limits prevent disk exhaustion - [x] Validated log rotation prevents unbounded growth - [x] Ensured backward compatibility for normal usage #### For configuration changes: - [x] `docker-compose.yml` is updated with logging limits - [x] I have included a list of my configuration changes in the PR description (under Changes) ### Test Plan 🧪 Security Tests: 1. ReDoS Protection: Tested with malicious regex inputs (large spaces) - completes without hanging 2. Memory Limits: Verified 2MB text input gets truncated to 1MB, chunk limits enforced 3. File Size Limits: Confirmed 200MB files rejected, directory size limits enforced 4. Iteration Limits: Tested 20K item arrays rejected, large JSON strings rejected 5. Timeout Protection: Dangerous regex patterns timeout after 5s instead of hanging Compatibility Tests: - Normal functionality preserved for all blocks - Existing tests pass with new security limits - Performance impact minimal for typical usage ### Security Impact 🛡️ Before: Multiple attack vectors could cause: - CPU exhaustion (ReDoS attacks) - Memory exhaustion (amplification attacks) - Disk exhaustion (file/log bombs) - Service unavailability After: All attack vectors mitigated with: - Input validation and size limits - Timeout protections - Resource quotas - Defense-in-depth approach All fixes maintain backward compatibility while preventing DoS attacks. 🤖 Generated with [Claude Code](https://claude.ai/code) <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Adds robust DoS protections across blocks (regex, memory, iteration, XML/RSS, file I/O) and enables app/Docker log rotation with comprehensive tests. > > - Security hardening: > - Replace unsafe regex in `backend/blocks/code_extraction_block.py` to prevent ReDoS; add safer extraction/removal patterns. > - Constrain LLM summarizer chunking in `backend/blocks/llm.py` (1MB cap, chunk/overlap validation, chunk count limit). > - Limit RSS fetching in `backend/blocks/rss.py` (scheme validation, 10MB cap, timeout, bounded read) and return empty on failure. > - Impose XML size limit (10MB) in `backend/blocks/xml_parser.py`. > - Add file upload/download limits in `backend/util/file.py` (100MB/file, 1GB dir quota) and enforce scanning before write. > - Enable rotating file logs in `autogpt_libs/logging/config.py` (size + backups) and Docker json-file log rotation in `docker-compose.platform.yml`. > - Iteration block: > - Add item count/string size limits; fix yielded key for dicts; cap iterations in `backend/blocks/iteration.py`. > - Tests: > - New `backend/blocks/test/test_security_fixes.py` covering ReDoS, timeouts, memory/size and iteration limits, XML/file constraints. > - Misc: > - Typing fallback for `NotRequired` in `activity_status_generator.py`. > - Dependency updates in `backend/poetry.lock`. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `500e1578b1`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com> Co-authored-by: Zamil Majdy <majdyz@users.noreply.github.com> Co-authored-by: Reinier van der Leer <Pwuts@users.noreply.github.com> Co-authored-by: Reinier van der Leer <pwuts@agpt.co>	2025-10-02 12:55:55 +00:00
Zamil Majdy	258bf0b1a5	fix(backend): improve activity status generation accuracy and handle missing blocks gracefully (#11039 ) ## Summary Fix critical issues where activity status generator incorrectly reported failed executions as successful, and enhance AI evaluation logic to be more accurate about actual task accomplishment. ## Changes Made ### 1. Missing Block Handling (`backend/data/graph.py`) - Replace ValueError with graceful degradation: When blocks are deleted/missing, return `_UnknownBlock` placeholder instead of crashing - Comprehensive interface implementation: `_UnknownBlock` implements all expected Block methods to prevent type errors - Warning logging: Log missing blocks for debugging without breaking execution flow - Removed unnecessary caching: Direct constructor calls instead of cached wrapper functions ### 2. Enhanced Activity Status AI Evaluation (`backend/executor/activity_status_generator.py`) #### Intention-Based Success Evaluation - Graph description analysis: AI now reads graph description FIRST to understand intended purpose - Purpose-driven evaluation: Success is measured against what the graph was designed to accomplish - Critical output analysis: Enhanced detection of missing outputs from key blocks (Output, Post, Create, Send, Publish, Generate) - Sub-agent failure detection: Better identification when AgentExecutorBlock produces no outputs #### Improved Prompting - Intent-specific examples: 'blog writing' → check for blog content, 'email automation' → check for sent emails - Primary evaluation criteria: 'Did this execution accomplish what the graph was designed to do?' - Enhanced checklist: 7-point analysis including graph description matching - Technical vs. goal completion: Distinguish between workflow steps completing vs. actual user goals achieved #### Removed Database Error Handling - Eliminated try-catch blocks: No longer needed around `get_graph_metadata` and `get_graph` calls - Direct database calls: Simplified error handling after fixing missing block root cause - Cleaner code flow: More predictable execution path without redundant error handling ## Problem Solved - False success reports: AI previously marked executions as 'successful' when critical output blocks produced no results - Missing block crashes: System would fail when trying to analyze executions with deleted/missing blocks - Intent-blind evaluation: AI evaluated technical completion instead of actual goal achievement - Database service errors: 500 errors when missing blocks caused graph loading failures ## Business Impact - More accurate user feedback: Users get honest assessment of whether their automations actually worked - Better task completion detection: Clear distinction between 'workflow completed' vs. 'goal achieved' - Improved reliability: System handles edge cases gracefully without crashing - Enhanced user trust: Truthful reporting builds confidence in the platform ## Testing - ✅ Tested with problematic executions that previously showed false successes - ✅ Confirmed missing block handling works without warnings - ✅ Verified enhanced prompt correctly identifies failures - ✅ Database calls work without try-catch protection ## Example Before/After Before (False Success): ``` Graph: "Automated SEO Blog Writer" Status: "✅ I successfully completed your blog writing task!" Reality: No blog content was actually created (critical output blocks had no outputs) ``` After (Accurate Failure Detection): ``` Graph: "Automated SEO Blog Writer" Status: "❌ The task failed because the blog post creation step didn't produce any output." Reality: Correctly identifies that the intended blog writing goal was not achieved ``` ## Files Modified - `backend/data/graph.py`: Missing block graceful handling with complete interface - `backend/executor/activity_status_generator.py`: Enhanced AI evaluation with intention-based analysis ## Type of Change - [x] Bug fix (non-breaking change which fixes an issue) - [x] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to not work as expected) - [ ] This change requires a documentation update ## Checklist - [x] My code follows the style guidelines of this project - [x] I have performed a self-review of my own code - [x] I have commented my code, particularly in hard-to-understand areas - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [x] I have added tests that prove my fix is effective or that my feature works - [x] New and existing unit tests pass locally with my changes - [x] Any dependent changes have been merged and published in downstream modules --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 12:28:57 +00:00
Ubbe	4a1cb6d64b	fix(frontend): performance and layout issues (#11036 ) ## Changes 🏗️ ### Performance (Onboarding) 🐎 - Moved non-UI logic into `providers/onboarding/helpers.ts` to reduce provider complexity. - Memoized provider value and narrowed state updates to cut unnecessary re-renders. - Deferred non-critical effects until after mount to lower initial JS work. Result: faster initial render and smoother onboarding flows under load. ### Layout and overflow fixes 📐 - Replaced `w-screen` with `w-full` in platform/admin/profile layouts and marketplace wrappers to avoid 100vw scrollbar overflow. - Adjusted mobile navbar position (`right-0` instead of `-right-4`) to prevent off-viewport elements. Result: removed horizontal scrolling on Marketplace, Library, and Settings pages; Build remains unaffected. ### New Generic Error pages - Standardized global error handling in `app/global-error.tsx` for consistent display and user feedback. - Added platform-scoped error page(s) under `app/(platform)/error` for route-level failures with a consistent layout. - Improved retry affordances using existing `ErrorCard`. ## Checklist 📋 ### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Verify onboarding flows render faster and re-render less (DevTools flamegraph) - [x] Confirm no horizontal scrolling on Marketplace, Library, Settings at common widths - [x] Validate mobile navbar stays within viewport - [x] Trigger errors to confirm global and platform error pages render consistently ### For configuration changes: None	2025-10-02 10:21:31 +00:00
Copilot	7c9db7419a	fix(frontend): Display run cost correctly - convert cents to dollars run detail components (#10997 ) Fixed costs being displayed as raw cent values instead of properly formatted dollar amounts in the frontend monitoring and agent run detail pages. ## Problem The platform was showing costs incorrectly in two key areas: - Monitoring page: Total cost displayed as raw cents with incorrect "seconds" unit (e.g., "Total cost: 150 seconds") - Agent run details: Individual run costs displayed as raw cents (e.g., "Cost: $150" for what should be $1.50) ## Solution Updated the affected components to properly convert cents to dollars with consistent formatting: FlowRunsStatus.tsx - Fixed total cost calculation and display: ```tsx // Before {filteredFlowRuns.reduce((total, run) => total + (run.stats?.cost ?? 0), 0)} seconds // After ${(filteredFlowRuns.reduce((total, run) => total + (run.stats?.cost ?? 0), 0) / 100).toFixed(2)} ``` RunDetailHeader.tsx - Fixed individual run cost display: ```tsx // Before Cost: ${run.stats.cost} // After Cost: ${(run.stats.cost / 100).toFixed(2)} ``` ## Validation - Backend correctly stores costs in cents (verified in models and database schemas) - Email notification templates already handle the conversion properly using `(credits_used\|float)/100` - Other components use the existing `formatCredits()` utility which correctly converts cents to dollars - No security vulnerabilities introduced (CodeQL verification passed) - All linting and formatting checks pass The fix ensures users now see accurate dollar amounts (e.g., $1.50 instead of $150 or "150 seconds") across the platform's cost reporting interfaces. ![Cost Display Fix Demo](https://github.com/user-attachments/assets/13c75a1d-7c78-4c11-9293-3dcf4c443097) > [!WARNING] > > <details> > <summary>Firewall rules blocked me from connecting to one or more addresses (expand for details)</summary> > > #### I tried to connect to the following addresses, but was blocked by firewall rules: > > - `checkpoint.prisma.io` > - Triggering command: `/usr/bin/node /root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child {"product":"prisma","version":"5.17.0","cli_install_type":"local","information":"","local_timestamp":"2025-09-25T21:41:17Z","project_hash":"a5170f80","cli_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/index.js","cli_path_hash":"40bbdaf9","endpoint":"REDACTED","disable":false,"arch":"x64","os":"linux","node_version":"v20.19.5","ci":false,"ci_name":"","command":"generate","schema_providers":["postgresql"],"schema_preview_features":[],"schema_generators_providers":["prisma-client-py"],"cache_file":"/root/.cache/checkpoint-nodejs/prisma-40bbdaf9","cache_duration":43200000,"remind_duration":172800000,"force":false,"timeout":5000,"unref":true,"child_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child","client_event_id":"","previous_client_event_id":"","check_if_update_available":false}` (dns block) > - Triggering command: `/usr/bin/node /root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child {"product":"prisma","version":"5.17.0","cli_install_type":"local","information":"","local_timestamp":"2025-09-25T21:41:19Z","project_hash":"a5170f80","cli_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/index.js","cli_path_hash":"40bbdaf9","endpoint":"REDACTED","disable":false,"arch":"x64","os":"linux","node_version":"v20.19.5","ci":false,"ci_name":"","command":"migrate deploy","schema_providers":["postgresql"],"schema_preview_features":[],"schema_generators_providers":["prisma-client-py"],"cache_file":"/root/.cache/checkpoint-nodejs/prisma-40bbdaf9","cache_duration":43200000,"remind_duration":172800000,"force":false,"timeout":5000,"unref":true,"child_path":"/root/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child","client_event_id":"","previous_client_event_id":"","check_if_update_available":false}` (dns block) > - Triggering command: `/opt/hostedtoolcache/node/21.7.3/x64/bin/node /home/REDACTED/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child {"product":"prisma","version":"5.17.0","cli_install_type":"local","information":"","local_timestamp":"2025-09-25T21:44:58Z","project_hash":"c6190a20","cli_path":"/home/REDACTED/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/index.js","cli_path_hash":"8d85b642","endpoint":"REDACTED","disable":false,"arch":"x64","os":"linux","node_version":"v21.7.3","ci":true,"ci_name":"GitHub Actions","command":"generate","schema_providers":["postgresql"],"schema_preview_features":[],"schema_generators_providers":["prisma-client-py"],"cache_file":"/home/REDACTED/.cache/checkpoint-nodejs/prisma-8d85b642","cache_duration":43200000,"remind_duration":172800000,"force":false,"timeout":5000,"unref":true,"child_path":"/home/REDACTED/.cache/prisma-python/binaries/5.17.0/393aa359c9ad4a4bb28630fb5613f9c281cde053/node_modules/prisma/build/child","client_event_id":"","previous_client_event_id":"","check_if_update_available":false}` (dns block) > - `fonts.googleapis.com` > - Triggering command: `node /home/REDACTED/work/AutoGPT/AutoGPT/autogpt_platform/frontend/node_modules/.bin/../next/dist/bin/next build` (dns block) > - `https://api.github.com/repos/Significant-Gravitas/Significant-Gravitas%2FAutoGPT/languages` > - Triggering command: `/home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js` (http block) > - `o1.ingest.sentry.io` > - Triggering command: `node /home/REDACTED/work/AutoGPT/AutoGPT/autogpt_platform/frontend/node_modules/.bin/../next/dist/bin/next build` (dns block) > > If you need me to access, download, or install something from one of these locations, you can either: > > - Configure [Actions setup steps](https://gh.io/copilot/actions-setup-steps) to set up my environment, which run before the firewall is enabled > - Add the appropriate URLs or hosts to the custom allowlist in this repository's [Copilot coding agent settings](https://github.com/Significant-Gravitas/AutoGPT/settings/copilot/coding_agent) (admins only) > > </details> <!-- START COPILOT CODING AGENT SUFFIX --> <details> <summary>Original prompt</summary> > > ---- > > This section details on the original issue you should resolve > > <issue_title>Costs are being shown as dollars rather than cents based on the new runs page</issue_title> > <issue_description></issue_description> > > ## Comments on the Issue (you are @copilot in this section) > > <comments> > </comments> > </details> Fixes Significant-Gravitas/AutoGPT#10886 <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ntindle <8845353+ntindle@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co>	2025-10-02 10:08:03 +00:00
Swifty	e75cf2b765	create helper functions for clearing caches	2025-10-02 11:09:31 +02:00
Krzysztof Czerwinski	18bbd8e572	fix(frontend): Fix confetti (#11031 ) ### Changes 🏗️ - Fix not being able to complete `MARKETPLACE_RUN_AGENT` task - Fix confetti shooting on every refresh - Fix confetti shooting from top-left corner ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Bugs eradicated autogpt-platform-beta-v0.6.31	2025-10-02 03:19:25 +00:00
Zamil Majdy	047f011520	fix(platform): resolve authentication performance bottlenecks and improve reliability (#11028 ) ## Summary Fix critical authentication performance bottlenecks causing infinite loading during login and malformed redirect URL handling. ## Root Cause Analysis - OnboardingProvider was running expensive `isOnboardingEnabled()` database queries on every route for all users - Timezone detection was calling backend APIs during authentication flow instead of only during onboarding - Malformed redirect URLs like `/marketplace,%20/marketplace` causing authentication callback failures - Arbitrary setTimeout creating race conditions instead of proper authentication state management ## Changes Made ### 1. Backend: Cache Expensive Onboarding Queries (`backend/data/onboarding.py`) - Add `@cached(maxsize=1, ttl_seconds=300)` decorator to `onboarding_enabled()` - Cache expensive database queries for 5 minutes to prevent repeated execution during auth - Optimize query with `take=MIN_AGENT_COUNT + 1` to stop counting early - Fix typo: "Onboading" → "Onboarding" ### 2. Frontend: Optimize OnboardingProvider (`providers/onboarding/onboarding-provider.tsx`) - Route-based optimization: Only call `isOnboardingEnabled()` when user is actually on `/onboarding/` routes - Preserve functionality: Still fetch `getUserOnboarding()` for step completion tracking on all routes - Smart redirects: Only handle onboarding completion redirects when on onboarding routes - Performance improvement: Eliminates expensive database calls for 95% of page loads ### 3. Frontend: Fix Timezone Detection Race Conditions (`hooks/useOnboardingTimezoneDetection.ts`) - Remove setTimeout hack: Replace arbitrary 1000ms timeout with proper authentication state checks - Add route filtering: Only run timezone detection on `/onboarding/` routes using `pathname.startsWith()` - Proper auth dependencies: Use `useSupabase()` hook to wait for `user` and `!isUserLoading` - Fire-and-forget updates: Change from `mutateAsync()` to `mutate()` to prevent blocking UI ### 4. Frontend: Apply Fire-and-Forget Pattern (`hooks/useTimezoneDetection.ts`) - Change timezone auto-detection from `mutateAsync()` to `mutate()` - Prevents blocking user interactions during background timezone updates - API still executes successfully, user doesn't wait for response ### 5. Frontend: Enhanced URL Validation (`auth/callback/route.ts`) - Add malformed URL detection: Check for commas and spaces in redirect URLs - Constants: Use `DEFAULT_REDIRECT_PATH = "/marketplace"` instead of hardcoded strings - Better error handling: Try-catch with fallback to safe default path - Path depth limits: Reject suspiciously deep URLs (>5 segments) - Race condition mitigation: Default to `/marketplace` for corrupted URLs with warning logs ## Technical Implementation ### Performance Optimizations - Database caching: 5-minute cache prevents repeated expensive onboarding queries - Route-aware logic: Heavy operations only run where needed (`/onboarding/` routes) - Non-blocking updates: Timezone updates don't block authentication flow - Proper state management: Wait for actual authentication instead of arbitrary delays ### Authentication Flow Improvements - Eliminate race conditions: No more setTimeout guessing - wait for proper auth state - Faster auth: Remove blocking timezone API calls during login flow - Better UX: Handle malformed URLs gracefully instead of failing ## Files Changed - `backend/data/onboarding.py` - Add caching to expensive queries - `providers/onboarding/onboarding-provider.tsx` - Route-based optimization - `hooks/useOnboardingTimezoneDetection.ts` - Proper auth state + route filtering + fire-and-forget - `hooks/useTimezoneDetection.ts` - Fire-and-forget pattern - `auth/callback/route.ts` - Enhanced URL validation ## Impact - Eliminates infinite loading* during authentication flow - Improves auth response times from 5-11+ seconds to sub-second - Prevents malformed redirect URLs that confused users - Reduces database load through intelligent caching - Maintains all existing functionality with better performance - Eliminates race conditions from arbitrary timeouts ## Validation - ✅ All pre-commit hooks pass (format, lint, typecheck) - ✅ No breaking changes to existing functionality - ✅ Backward compatible with all onboarding flows - ✅ Enhanced error logging and graceful fallbacks 🤖 Generated with [Claude Code](https://claude.ai/code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 01:26:49 +00:00
Swifty	0f6d1f54ee	update lock file	2025-10-01 16:21:58 +02:00
Swifty	b3fe2b84ce	added shared caching	2025-10-01 16:17:26 +02:00
Swifty	e13861ad33	revet over logging in test	2025-10-01 15:38:27 +02:00
Swifty	e2c24bd463	Merge branch 'swiftyos/caching-pt2' of github.com:Significant-Gravitas/AutoGPT into swiftyos/caching-pt2	2025-10-01 15:25:18 +02:00
Swifty	9f5afff83e	update caching rules	2025-10-01 15:24:59 +02:00
Swifty	ced61e2640	Merge branch 'dev' into swiftyos/caching-pt2	2025-10-01 15:07:12 +02:00
Swifty	c9a7cc63da	invalidate more caches - we have many with very similar names...	2025-10-01 14:52:58 +02:00
Reinier van der Leer	d11917eb10	feat(blocks): Improve data output of code execution block (#11017 ) - Resolves #11016 ### Changes 🏗️ - Add more extensive outputs to Code Execution Block - Rename "Response" output to "Main Text Output" ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Object outputs can be accessed now	2025-10-01 10:38:04 +00:00
Swifty	7afa01a168	add detailed log messages	2025-10-01 11:54:57 +02:00
Swifty	2f9aba0420	moved cache invalidation earlier in flow to avoid race condition	2025-10-01 11:23:53 +02:00
Swifty	ff4b0929e1	fix cache invalidation for agent activity dropdown	2025-10-01 10:26:37 +02:00
Swifty	2230c76863	Merge branch 'dev' into swiftyos/caching-pt2	2025-10-01 10:18:32 +02:00
Copilot	4663066e65	feat(blocks): Implement AI Condition Block for natural language condition evaluation (#10996 ) This PR implements the AI Condition Block as requested in issue AUTOMAT-60. The new block enables users to define conditional logic using natural language descriptions instead of traditional comparison operators, while maintaining the same yes/no data pass-through functionality as the existing ConditionBlock. ## Overview The AI Condition Block uses Large Language Models to evaluate conditions written in plain English, such as: - "the input is the body of an email" - "the input is a City in the USA" - "the input is an error or a refusal" ## Key Features Natural Language Processing: Users can express complex conditions in everyday English rather than programming logic, making agent workflows more intuitive and accessible. Consistent Interface: Maintains the same input/output schema as the standard ConditionBlock: - Boolean `result` output indicating condition evaluation - `yes_output` and `no_output` for conditional data flow - Optional custom values for yes/no cases Robust Error Handling: Defaults to `false` on AI evaluation failures to ensure safe operation and prevent workflow interruption. Performance Optimized: Uses minimal token limits (10 tokens) for true/false responses to reduce latency and API costs. ## Implementation Details The block is implemented as `AIConditionBlock` in `backend/blocks/ai_condition.py` and inherits from `AIBlockBase` following established platform patterns. It includes: - Proper LLM integration with credential management - Token usage tracking and statistics - Comprehensive test mocking for reliable CI/CD - Full documentation with examples and use cases ## Use Cases This block enables more sophisticated conditional logic for: - Content Classification: Automatically categorize text, emails, or documents - Data Validation: Validate inputs using natural language rules - Smart Routing: Route data based on AI-evaluated conditions - Error Detection: Identify and handle error messages or problematic inputs - Quality Control: Check content against flexible quality standards ## Testing The implementation includes comprehensive testing that integrates with the existing platform test suite. All tests pass, including: - Unit tests with proper LLM response mocking - Code quality checks (linting, formatting, type checking) - Security analysis via CodeQL - Integration testing to ensure proper block discovery and loading The block is automatically discovered by the platform's block loading system and is immediately available for use in agent workflows. ## PR Checklist - [x] Have you listed your changes in the description? - Added new `AIConditionBlock` in `backend/blocks/ai_condition.py` - Added comprehensive documentation in `docs/content/platform/blocks/ai_condition.md` - Implemented natural language condition evaluation using LLMs - [x] Have you included a test plan? - Unit tests with mocked LLM responses - Integration tests for block discovery and loading - Error handling validation - Token usage tracking verification - [x] Have you tested your changes according to the test plan? - All existing tests pass - Linting and formatting checks pass - Type checking passes - Security analysis via CodeQL passes - Fixed `json_format` parameter to `force_json_output` per recent API changes > [!WARNING] > > <details> > <summary>Firewall rules blocked me from connecting to one or more addresses (expand for details)</summary> > > #### I tried to connect to the following addresses, but was blocked by firewall rules: > > - `api.openai.com` > - Triggering command: `/home/REDACTED/.cache/pypoetry/virtualenvs/autogpt-platform-backend-Ajv4iu2i-py3.11/bin/python /home/REDACTED/.cache/pypoetry/virtualenvs/autogpt-platform-backend-Ajv4iu2i-py3.11/bin/pytest backend/blocks/test/test_block.py::test_available_blocks -k AIConditionBlock -v` (dns block) > - `https://api.github.com/repos/Significant-Gravitas/Significant-Gravitas%2FAutoGPT/languages` > - Triggering command: `/home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js` (http block) > > If you need me to access, download, or install something from one of these locations, you can either: > > - Configure [Actions setup steps](https://gh.io/copilot/actions-setup-steps) to set up my environment, which run before the firewall is enabled > - Add the appropriate URLs or hosts to the custom allowlist in this repository's [Copilot coding agent settings](https://github.com/Significant-Gravitas/AutoGPT/settings/copilot/coding_agent) (admins only) > > </details> <!-- START COPILOT CODING AGENT SUFFIX --> <details> <summary>Original prompt</summary> > Issue Title: AI Condition Block > Issue Description: A version of the condition/if block that uses an AI powered condition. > > It should have the same yes/no data pass throughs, as well as outputting a result Boolean. > > The condition is plaintext English, provided by the user, and could be anything. > > e.g > If `[the input] is the body of an email` > If `[the input] is a City in the USA` > If `[the input] is an error or a refusal` > Fixes https://linear.app/autogpt/issue/AUTOMAT-60/ai-condition-block > > > Comment by User 4bcbb358-1758-43e4-abef-a0a42b63442f: > 📋 I need a repo label on this issue to determine which GitHub repository to work in. > > Please add a repo label to this issue with the format `owner/repository-name` (e.g., `github/copilot`), then I'll automatically start working on it! > > Comment by User : > This thread is for an agent session with githubcopilotcodingagent. > > </details> <!-- START COPILOT CODING AGENT TIPS --> --- ✨ Let Copilot coding agent [set things up for you](https://github.com/Significant-Gravitas/AutoGPT/issues/new?title=✨+Set+up+Copilot+instructions&body=Configure%20instructions%20for%20this%20repository%20as%20documented%20in%20%5BBest%20practices%20for%20Copilot%20coding%20agent%20in%20your%20repository%5D%28https://gh.io/copilot-coding-agent-tips%29%2E%0A%0A%3COnboard%20this%20repo%3E&assignees=copilot) — coding agent works faster and does higher quality work when set up for your repo. <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Introduces `AIConditionBlock` that uses an LLM to evaluate natural-language conditions and outputs boolean result with yes/no pass-through, plus accompanying documentation. > > - Backend: > - New block: `backend/blocks/ai_condition.py` > - Evaluates natural-language conditions via `llm_call` using selectable `LlmModel` and credentials. > - Parses strict true/false responses (with fallback token matching), yields `result`, `yes_output`/`no_output`, and `error` on ambiguity/failure. > - Tracks token usage via `NodeExecutionStats`; includes test inputs/mocks and `force_json_output=False`. > - Docs: > - Adds `docs/content/platform/blocks/ai_condition.md` with usage, inputs/outputs, examples, and considerations. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `06e9586bd3`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ntindle <8845353+ntindle@users.noreply.github.com> Co-authored-by: Nicholas Tindle <nicktindle@outlook.com> Co-authored-by: Nicholas Tindle <nicholas.tindle@agpt.co> Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Nicholas Tindle <ntindle@users.noreply.github.com>	2025-10-01 05:02:57 +00:00
Krzysztof Czerwinski	48a0faa611	feat(frontend): Restore onboarding steps (#11027 ) Wallet update removed `BUILDER_OPEN` and `BUILDER_RUN_AGENT`. ### Changes 🏗️ - Restore completion codepaths for `BUILDER_OPEN` and `BUILDER_RUN_AGENT` for analytical purposes ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: - [x] Tasks are completed silently	2025-10-01 04:53:51 +00:00
Nicholas Tindle	70d00b4104	fix(ci): Delete pr_reviewer section in .pr_agent.toml (#11024 ) Remove pr_reviewer section from configuration <!-- Clearly explain the need for these changes: --> ### Changes 🏗️ removes the out of config status section <!-- Concisely describe all of the changes made in this pull request: --> ### Checklist 📋 #### For code changes: - [x] I have clearly listed my changes in the PR description - [x] I have made a test plan - [x] I have tested my changes according to the test plan: <!-- Put your test plan here: --> - [x] validated by global config	2025-10-01 03:01:24 +00:00

1 2 3 4 5 ...

7377 Commits