fix(git): use public HTTPS clone URL when token is missing/empty for public repos

Co-authored-by: openhands <openhands@all-hands.dev>
fix(frontend): treat GitLab branch listing errors as no branches (enable Launch without token)
2026-04-29 03:00:45 -04:00 · 2025-08-18 18:50:34 -04:00 · 2025-08-18 18:50:33 -04:00 · 2025-08-18 18:50:32 -04:00 · 2025-08-18 19:11:10 +00:00 · 2025-08-18 18:40:25 +00:00
451 changed files with 10632 additions and 22641 deletions
--- a/.github/workflows/e2e-tests.yml
+++ b/.github/workflows/e2e-tests.yml
@@ -22,7 +22,7 @@ jobs:
        uses: actions/checkout@v4

      - name: Install poetry via pipx
-        uses: abatilo/actions-poetry@v4
+        uses: abatilo/actions-poetry@v3
        with:
          poetry-version: 2.1.3

@@ -169,6 +169,7 @@ jobs:
      - name: Run end-to-end tests
        env:
          GITHUB_TOKEN: ${{ secrets.E2E_TEST_GITHUB_TOKEN }}
+          GITLAB_TOKEN: ${{ secrets.GITLAB_TOKEN }}
          LLM_MODEL: ${{ secrets.LLM_MODEL || 'gpt-4o' }}
          LLM_API_KEY: ${{ secrets.LLM_API_KEY || 'test-key' }}
          LLM_BASE_URL: ${{ secrets.LLM_BASE_URL }}
@@ -187,7 +188,7 @@ jobs:
            test_settings.py::test_github_token_configuration \
            test_conversation.py::test_conversation_start \
            test_browsing_catchphrase.py::test_browsing_catchphrase \
-            test_multi_conversation_resume.py::test_multi_conversation_resume \
+            test_gitlab_integration.py::test_gitlab_repository_cloning \
            -v --no-header --capture=no --timeout=900

      - name: Upload test results
--- a/.github/workflows/ghcr-build.yml
+++ b/.github/workflows/ghcr-build.yml
@@ -225,7 +225,7 @@ jobs:
          SANDBOX_RUNTIME_CONTAINER_IMAGE=$image_name \
          TEST_IN_CI=true \
          RUN_AS_OPENHANDS=false \
-          poetry run pytest -n 0 -raRs --reruns 2 --reruns-delay 5 -s ./tests/runtime --ignore=tests/runtime/test_browsergym_envs.py --durations=10
+          poetry run pytest -n 7 -raRs --reruns 2 --reruns-delay 5 -s ./tests/runtime --ignore=tests/runtime/test_browsergym_envs.py --durations=10
        env:
          DEBUG: "1"

@@ -284,7 +284,7 @@ jobs:
          SANDBOX_RUNTIME_CONTAINER_IMAGE=$image_name \
          TEST_IN_CI=true \
          RUN_AS_OPENHANDS=true \
-          poetry run pytest -n 0 -raRs --reruns 2 --reruns-delay 5 -s ./tests/runtime --ignore=tests/runtime/test_browsergym_envs.py --durations=10
+          poetry run pytest -n 7 -raRs --reruns 2 --reruns-delay 5 -s ./tests/runtime --ignore=tests/runtime/test_browsergym_envs.py --durations=10
        env:
          DEBUG: "1"

--- a/.github/workflows/py-tests.yml
+++ b/.github/workflows/py-tests.yml
@@ -73,7 +73,7 @@ jobs:
      - name: Install Python dependencies using Poetry
        run: poetry install --with dev,test,runtime
      - name: Run Windows unit tests
-        run: poetry run pytest -svv tests/unit/runtime/utils/test_windows_bash.py
+        run: poetry run pytest -svv tests/unit/test_windows_bash.py
        env:
          PYTHONPATH: ".;$env:PYTHONPATH"
          DEBUG: "1"
--- a/.github/workflows/stale.yml
+++ b/.github/workflows/stale.yml
@@ -15,7 +15,7 @@ jobs:
          stale-issue-message: 'This issue is stale because it has been open for 40 days with no activity. Remove the stale label or leave a comment, otherwise it will be closed in 10 days.'
          stale-pr-message: 'This PR is stale because it has been open for 40 days with no activity. Remove the stale label or leave a comment, otherwise it will be closed in 10 days.'
          days-before-stale: 40
-          exempt-issue-labels: roadmap,backlog
+          exempt-issue-labels: 'roadmap'
          close-issue-message: 'This issue was automatically closed due to 50 days of inactivity. We do this to help keep the issues somewhat manageable and focus on active issues.'
          close-pr-message: 'This PR was closed because it had no activity for 50 days. If you feel this was closed in error, and you would like to continue the PR, please resubmit or let us know.'
          days-before-close: 10
--- a/.github/workflows/welcome-good-first-issue.yml
+++ b/.github/workflows/welcome-good-first-issue.yml
@@ -1,51 +0,0 @@
-name: Welcome Good First Issue
-
-on:
-  issues:
-    types: [labeled]
-
-permissions:
-  issues: write
-
-jobs:
-  comment-on-good-first-issue:
-    if: github.event.label.name == 'good first issue'
-    runs-on: ubuntu-latest
-    steps:
-      - name: Check if welcome comment already exists
-        id: check_comment
-        uses: actions/github-script@v7
-        with:
-          result-encoding: string
-          script: |
-            const issueNumber = context.issue.number;
-            const comments = await github.rest.issues.listComments({
-              ...context.repo,
-              issue_number: issueNumber
-            });
-
-            const alreadyCommented = comments.data.some(
-              (comment) =>
-                comment.body.includes('<!-- auto-comment:good-first-issue -->')
-            );
-
-            return alreadyCommented ? 'true' : 'false';
-
-      - name: Leave welcome comment
-        if: steps.check_comment.outputs.result == 'false'
-        uses: actions/github-script@v7
-        with:
-          script: |
-            const repoUrl = `https://github.com/${context.repo.owner}/${context.repo.repo}`;
-
-            await github.rest.issues.createComment({
-              ...context.repo,
-              issue_number: context.issue.number,
-              body: "🙌 **Hey there, future contributor!** 🙌\n\n" +
-                    "This issue has been labeled as **good first issue**, which means it's a great place to get started with the OpenHands project.\n\n" +
-                    "If you're interested in working on it, feel free to! No need to ask for permission.\n\n" +
-                    "Be sure to check out our [development setup guide](" + repoUrl + "/blob/main/Development.md) to get your environment set up, and follow our [contribution guidelines](" + repoUrl + "/blob/main/CONTRIBUTING.md) when you're ready to submit a fix.\n\n" +
-                    "Feel free to join our developer community on [Slack](dub.sh/openhands). You can ask for [help](https://openhands-ai.slack.com/archives/C078L0FUGUX), [feedback](https://openhands-ai.slack.com/archives/C086ARSNMGA), and even ask for a [PR review](https://openhands-ai.slack.com/archives/C08D8FJ5771).\n\n" +
-                    "🙌 Happy hacking! 🙌\n\n" +
-                    "<!-- auto-comment:good-first-issue -->"
-            });
--- a/.gitignore
+++ b/.gitignore
@@ -257,5 +257,3 @@ containers/runtime/code

 # test results
 test-results
-.sessions
-.eval_sessions
--- a/Development.md
+++ b/Development.md
@@ -159,7 +159,7 @@ poetry run pytest ./tests/unit/test_*.py
 To reduce build time (e.g., if no changes were made to the client-runtime component), you can use an existing Docker
 container image by setting the SANDBOX_RUNTIME_CONTAINER_IMAGE environment variable to the desired Docker image.

-Example: `export SANDBOX_RUNTIME_CONTAINER_IMAGE=ghcr.io/all-hands-ai/runtime:0.55-nikolaik`
+Example: `export SANDBOX_RUNTIME_CONTAINER_IMAGE=ghcr.io/all-hands-ai/runtime:0.53-nikolaik`

 ## Develop inside Docker container

--- a/9
+++ b/9
@@ -1,12 +1,7 @@
-Portions of this software are licensed as follows:
-* All content that resides under the enterprise/ directory is licensed under the license defined in "enterprise/LICENSE".
-* Content outside of the above mentioned directories or restrictions above is available under the MIT license as defined below.
-
+The MIT License (MIT)
 =====================

-The MIT License (MIT)
-
-Copyright © 2025
+Copyright © 2023

 Permission is hereby granted, free of charge, to any person
 obtaining a copy of this software and associated documentation
--- a/MIGRATION_GUIDE.md
+++ b/MIGRATION_GUIDE.md
@@ -1,277 +0,0 @@
-# Migration Guide: From Shared Globals to Context System
-
-This guide explains how to migrate from the deprecated `openhands.server.shared` globals to the new context system.
-
-## Overview
-
-The new context system replaces global variables with dependency injection, providing:
-
- **Better testability**: Easy to mock dependencies in tests
- **SaaS extensibility**: Custom contexts for multi-tenant scenarios
- **Per-request contexts**: Different configurations per request
- **No import-time side effects**: Lazy initialization of dependencies
- **Type safety**: Better IDE support and type checking
-
-## Quick Migration
-
-### Before (Deprecated)
-```python
-from openhands.server.shared import config, server_config, file_store, sio
-
-def my_function():
-    # Use global variables
-    workspace_dir = config.workspace_dir
-    app_mode = server_config.app_mode
-    file_store.save_file(...)
-```
-
-### After (Recommended)
-```python
-from fastapi import Depends, Request
-from openhands.server.context import get_server_context, ServerContext
-
-@app.get('/my-endpoint')
-async def my_endpoint(
-    request: Request,
-    context: ServerContext = Depends(get_server_context)
-):
-    # Use context instead of globals
-    config = context.get_config()
-    server_config = context.get_server_config()
-    file_store = context.get_file_store()
-
-    workspace_dir = config.workspace_dir
-    app_mode = server_config.app_mode
-    file_store.save_file(...)
-```
-
-## Detailed Migration Steps
-
-### 1. Route Handlers
-
-**Before:**
-```python
-from openhands.server.shared import config, conversation_manager
-
-@app.post('/conversations')
-async def create_conversation(request: ConversationRequest):
-    conversation = conversation_manager.create_conversation(
-        request.user_id,
-        config.default_agent
-    )
-    return conversation
-```
-
-**After:**
-```python
-from fastapi import Depends
-from openhands.server.context import get_server_context, ServerContext
-
-@app.post('/conversations')
-async def create_conversation(
-    request: ConversationRequest,
-    context: ServerContext = Depends(get_server_context)
-):
-    config = context.get_config()
-    conversation_manager = context.get_conversation_manager()
-
-    conversation = conversation_manager.create_conversation(
-        request.user_id,
-        config.default_agent
-    )
-    return conversation
-```
-
-### 2. Service Classes
-
-**Before:**
-```python
-from openhands.server.shared import file_store, monitoring_listener
-
-class MyService:
-    def process_file(self, file_path: str):
-        content = file_store.read(file_path)
-        monitoring_listener.log_event('file_processed')
-        return content
-```
-
-**After:**
-```python
-from openhands.server.context import ServerContext
-
-class MyService:
-    def __init__(self, context: ServerContext):
-        self.context = context
-
-    def process_file(self, file_path: str):
-        file_store = self.context.get_file_store()
-        monitoring_listener = self.context.get_monitoring_listener()
-
-        content = file_store.read(file_path)
-        monitoring_listener.log_event('file_processed')
-        return content
-
-# In route handler:
-@app.post('/process')
-async def process_endpoint(
-    request: ProcessRequest,
-    context: ServerContext = Depends(get_server_context)
-):
-    service = MyService(context)
-    return service.process_file(request.file_path)
-```
-
-### 3. Store Classes
-
-**Before:**
-```python
-from openhands.server.shared import SettingsStoreImpl
-
-def get_user_settings(user_id: str):
-    store = SettingsStoreImpl(user_id)
-    return store.load()
-```
-
-**After:**
-```python
-from openhands.server.context import ServerContext
-
-def get_user_settings(user_id: str, context: ServerContext):
-    SettingsStoreClass = context.get_settings_store_class()
-    store = SettingsStoreClass(user_id)
-    return store.load()
-
-# In route handler:
-@app.get('/settings/{user_id}')
-async def get_settings(
-    user_id: str,
-    context: ServerContext = Depends(get_server_context)
-):
-    return get_user_settings(user_id, context)
-```
-
-### 4. Testing
-
-**Before:**
-```python
-import pytest
-from unittest.mock import patch
-
-def test_my_function():
-    with patch('openhands.server.shared.config') as mock_config:
-        mock_config.workspace_dir = '/test'
-        result = my_function()
-        assert result == expected
-```
-
-**After:**
-```python
-import pytest
-from openhands.server.context import create_server_context
-
-class MockServerContext:
-    def get_config(self):
-        mock_config = Mock()
-        mock_config.workspace_dir = '/test'
-        return mock_config
-
-def test_my_function():
-    context = MockServerContext()
-    result = my_function(context)
-    assert result == expected
-```
-
-## SaaS Extension Example
-
-The new context system makes it easy to extend OpenHands for SaaS scenarios:
-
-```python
-from openhands.server.context import ServerContext, set_context_class
-
-class SaaSServerContext(ServerContext):
-    def __init__(self, user_id: str, org_id: str):
-        self.user_id = user_id
-        self.org_id = org_id
-
-    def get_file_store(self):
-        # Return tenant-isolated file store
-        return MultiTenantFileStore(self.user_id, self.org_id)
-
-    def get_server_config(self):
-        # Return SaaS-specific configuration
-        return SaaSServerConfig(org_id=self.org_id)
-
-# Configure globally
-set_context_class('myapp.context.SaaSServerContext')
-
-# Use in routes with tenant context
-@app.get('/tenant/{org_id}/files')
-async def get_tenant_files(
-    org_id: str,
-    context: SaaSServerContext = Depends(get_server_context)
-):
-    file_store = context.get_file_store()
-    return file_store.list_files()
-```
-
-## Migration Checklist
-
- [ ] Replace `from openhands.server.shared import ...` with context injection
- [ ] Update route handlers to use `Depends(get_server_context)`
- [ ] Modify service classes to accept `ServerContext` parameter
- [ ] Update tests to use mock contexts instead of patching globals
- [ ] Remove direct imports of shared globals
- [ ] Test that all functionality still works
-
-## Backward Compatibility
-
-The old `openhands.server.shared` module still works but is deprecated. It will show deprecation warnings when imported. The globals are now implemented using the new context system internally.
-
-## Benefits After Migration
-
-1. **Better Testing**: Easy to mock dependencies without patching globals
-2. **Type Safety**: Better IDE support and type checking
-3. **Extensibility**: Easy to create custom contexts for different scenarios
-4. **Performance**: Lazy initialization reduces startup time
-5. **Maintainability**: Clear dependency relationships
-
-## Common Issues
-
-### Issue: Import errors during migration
-**Solution**: Make sure to import the context system correctly:
-```python
-from openhands.server.context import get_server_context, ServerContext
-```
-
-### Issue: Context not available in non-route functions
-**Solution**: Pass the context as a parameter:
-```python
-def helper_function(data: str, context: ServerContext):
-    config = context.get_config()
-    # ... use config
-```
-
-### Issue: Testing becomes more complex
-**Solution**: Create reusable mock contexts:
-```python
-# test_utils.py
-class TestServerContext(ServerContext):
-    def __init__(self):
-        self.mock_config = create_mock_config()
-        self.mock_file_store = create_mock_file_store()
-
-    def get_config(self):
-        return self.mock_config
-
-    def get_file_store(self):
-        return self.mock_file_store
-```
-
-## Getting Help
-
-If you encounter issues during migration:
-
-1. Check the examples in `examples/saas_extension.py`
-2. Look at the implementation in `openhands/server/context/`
-3. Review existing route handlers that have been migrated
-4. Create an issue if you find bugs or need clarification
--- a/README.md
+++ b/README.md
@@ -79,17 +79,17 @@ You'll find OpenHands running at [http://localhost:3000](http://localhost:3000)
 You can also run OpenHands directly with Docker:

 ```bash
-docker pull docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik
+docker pull docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik

 docker run -it --rm --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e LOG_ALL_EVENTS=true \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -v ~/.openhands:/.openhands \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55
+    docker.all-hands.dev/all-hands-ai/openhands:0.53
 ```

 </details>
@@ -130,6 +130,7 @@ If you want to modify the OpenHands source code, check out [Development.md](http
 Having issues? The [Troubleshooting Guide](https://docs.all-hands.dev/usage/troubleshooting) can help.

 ## 📖 Documentation
+  <a href="https://deepwiki.com/All-Hands-AI/OpenHands"><img src="https://deepwiki.com/badge.svg" alt="Ask DeepWiki" title="Autogenerated Documentation by DeepWiki"></a>

 To learn more about the project, and for tips on using OpenHands,
 check out our [documentation](https://docs.all-hands.dev/usage/getting-started).
--- a/README_CN.md
+++ b/README_CN.md
@@ -51,17 +51,17 @@ OpenHands也可以使用Docker在本地系统上运行。


 ```bash
-docker pull docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik
+docker pull docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik

 docker run -it --rm --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e LOG_ALL_EVENTS=true \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -v ~/.openhands:/.openhands \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55
+    docker.all-hands.dev/all-hands-ai/openhands:0.53
 ```

 > **注意**: 如果您在0.44版本之前使用过OpenHands，您可能需要运行 `mv ~/.openhands-state ~/.openhands` 来将对话历史迁移到新位置。
--- a/README_JA.md
+++ b/README_JA.md
@@ -42,17 +42,17 @@ OpenHandsはDockerを利用してローカル環境でも実行できます。
 > 公共ネットワークで実行していますか？[Hardened Docker Installation Guide](https://docs.all-hands.dev/usage/runtimes/docker#hardened-docker-installation)を参照して、ネットワークバインディングの制限や追加のセキュリティ対策を実施してください。

 ```bash
-docker pull docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik
+docker pull docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik

 docker run -it --rm --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e LOG_ALL_EVENTS=true \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -v ~/.openhands:/.openhands \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55
+    docker.all-hands.dev/all-hands-ai/openhands:0.53
 ```

 **注**: バージョン0.44以前のOpenHandsを使用していた場合は、会話履歴を移行するために `mv ~/.openhands-state ~/.openhands` を実行してください。
--- a/REFACTOR_PLAN.md
+++ b/REFACTOR_PLAN.md
@@ -1,230 +0,0 @@
-# OpenHands Server Context Refactoring Plan
-
-## Problem Statement
-
-The current OpenHands architecture has globals in `server/shared.py` that are initialized at import time based on environment variables. This creates several issues for the SaaS version:
-
-1. **Import-time dependencies**: All globals are created when modules are imported
-2. **Hard to extend**: SaaS can't easily override or extend components
-3. **CI/CD issues**: Everything depends on env vars being set correctly at import time
-4. **Per-user behavior**: Difficult to implement per-user/per-request behavior
-5. **Outside repo issues**: Hard to run SaaS from outside repo due to import dependencies
-
-## Current Problematic Globals
-
-From `openhands/server/shared.py`:
- `config: OpenHandsConfig` - Core app configuration
- `server_config: ServerConfig` - Server-specific configuration
- `file_store: FileStore` - File storage implementation
- `sio: socketio.AsyncServer` - Socket.IO server instance
- `conversation_manager` - Conversation management implementation
- `monitoring_listener` - Monitoring implementation
- `SettingsStoreImpl`, `SecretsStoreImpl`, `ConversationStoreImpl` - Storage implementations
-
-## Solution: ServerContext Pattern
-
-### 1. Create ServerContext Base Class
-
-Create `openhands/server/context/server_context.py`:
-
-```python
-from abc import ABC, abstractmethod
-from typing import Optional
-import socketio
-from openhands.core.config.openhands_config import OpenHandsConfig
-from openhands.server.config.server_config import ServerConfig
-from openhands.storage.files import FileStore
-# ... other imports
-
-class ServerContext(ABC):
-    """Base class for server context that holds all server dependencies.
-
-    This replaces the global variables in shared.py and allows for:
-    - Dependency injection
-    - Easy extensibility for SaaS
-    - Per-request contexts
-    - Testability
-    """
-
-    def __init__(self):
-        self._config: Optional[OpenHandsConfig] = None
-        self._server_config: Optional[ServerConfig] = None
-        self._file_store: Optional[FileStore] = None
-        # ... other cached instances
-
-    @abstractmethod
-    def get_config(self) -> OpenHandsConfig:
-        """Get the OpenHands configuration"""
-
-    @abstractmethod
-    def get_server_config(self) -> ServerConfig:
-        """Get the server configuration"""
-
-    @abstractmethod
-    def get_file_store(self) -> FileStore:
-        """Get the file store implementation"""
-
-    # ... other abstract methods for all current globals
-```
-
-### 2. Create Default Implementation
-
-Create `openhands/server/context/default_server_context.py`:
-
-```python
-class DefaultServerContext(ServerContext):
-    """Default implementation that maintains current behavior"""
-
-    def get_config(self) -> OpenHandsConfig:
-        if self._config is None:
-            self._config = load_openhands_config()
-        return self._config
-
-    def get_server_config(self) -> ServerConfig:
-        if self._server_config is None:
-            self._server_config = load_server_config()
-        return self._server_config
-
-    # ... implement all methods with current logic
-```
-
-### 3. Context Provider System
-
-Create `openhands/server/context/context_provider.py`:
-
-```python
-from fastapi import Request
-from openhands.utils.import_utils import get_impl
-
-_context_class: Optional[str] = None
-
-def set_context_class(context_class: str):
-    """Set the server context class to use"""
-    global _context_class
-    _context_class = context_class
-
-async def get_server_context(request: Request) -> ServerContext:
-    """Get server context from request, with caching"""
-    context = getattr(request.state, 'server_context', None)
-    if context:
-        return context
-
-    # Use configured context class or default
-    context_cls_name = _context_class or 'openhands.server.context.default_server_context.DefaultServerContext'
-    context_cls = get_impl(ServerContext, context_cls_name)
-    context = context_cls()
-
-    request.state.server_context = context
-    return context
-```
-
-### 4. Update Shared.py (Backward Compatibility)
-
-Keep `shared.py` for backward compatibility but make it use the context:
-
-```python
-# openhands/server/shared.py
-from openhands.server.context.default_server_context import DefaultServerContext
-
-# Create default context for backward compatibility
-_default_context = DefaultServerContext()
-
-# Expose globals for backward compatibility
-config = _default_context.get_config()
-server_config = _default_context.get_server_config()
-file_store = _default_context.get_file_store()
-# ... etc
-```
-
-### 5. Update Routes to Use Context
-
-Update all route files to use dependency injection:
-
-```python
-# Example: openhands/server/routes/settings.py
-from openhands.server.context import get_server_context
-
-@app.get('/settings')
-async def get_settings(
-    request: Request,
-    context: ServerContext = Depends(get_server_context)
-):
-    config = context.get_config()
-    # ... use config instead of importing from shared
-```
-
-## Benefits for SaaS
-
-### 1. Easy Extension
-
-SaaS can create their own context:
-
-```python
-# In SaaS repo: saas/server_context.py
-from openhands.server.context import ServerContext
-
-class SaaSServerContext(ServerContext):
-    def get_server_config(self) -> ServerConfig:
-        # Return SaaS-specific config with enterprise features
-        return SaaSServerConfig()
-
-    def get_conversation_manager(self) -> ConversationManager:
-        # Return multi-tenant conversation manager
-        return MultiTenantConversationManager()
-```
-
-### 2. Per-Request Contexts
-
-SaaS can implement per-user contexts:
-
-```python
-class PerUserServerContext(ServerContext):
-    def __init__(self, user_id: str, org_id: str):
-        super().__init__()
-        self.user_id = user_id
-        self.org_id = org_id
-
-    def get_file_store(self) -> FileStore:
-        # Return user-specific file store
-        return UserFileStore(self.user_id, self.org_id)
-```
-
-### 3. No Import-Time Dependencies
-
-SaaS can run without setting environment variables at import time:
-
-```python
-# In SaaS startup
-from openhands.server.context import set_context_class
-set_context_class('saas.server_context.SaaSServerContext')
-```
-
-## Migration Strategy
-
-### Phase 1: Create Context System
-1. Create ServerContext base class and default implementation
-2. Create context provider system
-3. Update shared.py for backward compatibility
-
-### Phase 2: Update Routes Gradually
-1. Update one route at a time to use context injection
-2. Test each route to ensure no regressions
-3. Keep backward compatibility during transition
-
-### Phase 3: Clean Up
-1. Remove globals from shared.py once all routes are updated
-2. Update documentation
-3. Create examples for SaaS extension
-
-## Implementation Order
-
-1. `openhands/server/context/server_context.py` - Base class
-2. `openhands/server/context/default_server_context.py` - Default implementation
-3. `openhands/server/context/context_provider.py` - Provider system
-4. `openhands/server/context/__init__.py` - Public API
-5. Update `openhands/server/shared.py` for backward compatibility
-6. Update routes one by one to use context injection
-7. Update tests to use context system
-8. Documentation and examples
-
-This approach provides a clean migration path while maintaining backward compatibility and enabling the SaaS extensibility requirements.
--- a/REFACTOR_SUMMARY.md
+++ b/REFACTOR_SUMMARY.md
@@ -1,206 +0,0 @@
-# OpenHands Server Globals Refactoring - Summary
-
-## Overview
-
-Successfully refactored OpenHands server globals in `shared.py` and `server_config.py` to enable SaaS extensibility without import-time dependencies. The refactoring introduces a dependency injection pattern using a `ServerContext` system that maintains backward compatibility while enabling multi-tenant SaaS scenarios.
-
-## Problem Solved
-
-### Before Refactoring
- **Global variables on import**: `shared.py` created globals like `config`, `server_config`, `file_store`, `sio`, etc. on module import
- **Import-time side effects**: Loading the module triggered configuration loading and dependency initialization
- **SaaS integration issues**: External SaaS repos had CI/CD problems due to environment variable dependencies
- **Testing difficulties**: Hard to mock dependencies due to global state
- **No extensibility**: Impossible to customize behavior for different tenants or environments
-
-### After Refactoring
- **Dependency injection**: Clean `ServerContext` pattern with lazy initialization
- **No import-time side effects**: Dependencies only loaded when actually needed
- **SaaS extensibility**: Easy to create custom contexts for multi-tenant scenarios
- **Better testability**: Easy to mock contexts for testing
- **Backward compatibility**: Existing code continues to work with deprecation warnings
-
-## Architecture Changes
-
-### New Context System
-
-```
-openhands/server/context/
-├── __init__.py                 # Public API
-├── server_context.py          # Abstract base class
-├── default_server_context.py  # Default implementation
-└── context_provider.py        # Dependency injection system
-```
-
-### Key Components
-
-1. **ServerContext (Abstract Base Class)**
-   - Defines interface for all server dependencies
-   - 9 abstract methods for different dependency types
-   - Extensible for SaaS implementations
-
-2. **DefaultServerContext**
-   - Maintains exact behavior of original shared.py
-   - Lazy initialization of all dependencies
-   - No import-time side effects
-
-3. **Context Provider System**
-   - `get_server_context()` for FastAPI dependency injection
-   - `set_context_class()` for global configuration
-   - `create_server_context()` for testing/CLI usage
-
-4. **Backward Compatibility Layer**
-   - `shared.py` now uses `__getattr__` for lazy loading
-   - All existing imports continue to work
-   - Deprecation warnings guide migration
-
-## SaaS Extensibility
-
-### Multi-Tenant Context Example
-
-```python
-class SaaSServerContext(ServerContext):
-    def __init__(self, user_id: str, org_id: str):
-        self.user_id = user_id
-        self.org_id = org_id
-
-    def get_file_store(self):
-        # Return tenant-isolated file store
-        return MultiTenantFileStore(self.user_id, self.org_id)
-
-    def get_server_config(self):
-        # Return SaaS-specific configuration
-        return SaaSServerConfig(org_id=self.org_id)
-
-# Configure globally
-set_context_class('myapp.context.SaaSServerContext')
-```
-
-### Benefits for SaaS
- **Per-tenant isolation**: Different storage, config, and features per organization
- **Enterprise features**: Easy to add billing, advanced monitoring, etc.
- **Scalable architecture**: Context per request enables horizontal scaling
- **Clean separation**: SaaS code stays in external repo, extends OpenHands cleanly
-
-## Migration Path
-
-### For OpenHands Core
- **Phase 1**: Refactoring complete, backward compatibility maintained
- **Phase 2**: Gradually migrate routes to use dependency injection
- **Phase 3**: Remove deprecated shared.py (future release)
-
-### For SaaS Implementations
- **Immediate**: Can use new context system for new features
- **Gradual**: Migrate existing code using migration guide
- **Benefits**: Cleaner architecture, better testing, easier deployment
-
-## Files Created/Modified
-
-### New Files
- `openhands/server/context/__init__.py` - Public API
- `openhands/server/context/server_context.py` - Abstract base class
- `openhands/server/context/default_server_context.py` - Default implementation
- `openhands/server/context/context_provider.py` - Dependency injection
- `examples/saas_extension.py` - SaaS extension example
- `MIGRATION_GUIDE.md` - Detailed migration instructions
- `test_refactor.py` - Comprehensive test suite
-
-### Modified Files
- `openhands/server/shared.py` - Backward compatibility layer
-
-## Testing Results
-
-Comprehensive test suite with 5 test categories:
-
-1. ✅ **Context System**: Import, creation, class switching
-2. ✅ **Backward Compatibility**: Lazy loading, attribute access
-3. ✅ **Abstract Base Class**: Proper abstraction, required methods
-4. ✅ **Default Context**: Instantiation, method availability
-5. ✅ **SaaS Example**: Multi-tenant context structure
-
-**Result: 5/5 tests passed** 🎉
-
-## Usage Examples
-
-### New Way (Recommended)
-```python
-from fastapi import Depends
-from openhands.server.context import get_server_context, ServerContext
-
-@app.get('/conversations')
-async def get_conversations(
-    context: ServerContext = Depends(get_server_context)
-):
-    config = context.get_config()
-    conversation_manager = context.get_conversation_manager()
-    return conversation_manager.list_conversations()
-```
-
-### Old Way (Still Works)
-```python
-from openhands.server.shared import config, conversation_manager
-
-@app.get('/conversations')
-async def get_conversations():
-    # Shows deprecation warning but works
-    return conversation_manager.list_conversations()
-```
-
-### SaaS Extension
-```python
-# In SaaS application startup
-from openhands.server.context import set_context_class
-set_context_class('myapp.context.SaaSServerContext')
-
-# Routes automatically get tenant-aware context
-@app.get('/tenant/{org_id}/conversations')
-async def get_tenant_conversations(
-    org_id: str,
-    context: SaaSServerContext = Depends(get_server_context)
-):
-    # context.org_id and context.user_id available
-    # All dependencies are tenant-isolated
-    conversation_manager = context.get_conversation_manager()
-    return conversation_manager.list_conversations()
-```
-
-## Benefits Achieved
-
-### For OpenHands Core
- ✅ **Better Architecture**: Clean dependency injection pattern
- ✅ **Improved Testing**: Easy to mock dependencies
- ✅ **No Breaking Changes**: Full backward compatibility
- ✅ **Performance**: Lazy loading reduces startup time
- ✅ **Type Safety**: Better IDE support and type checking
-
-### For SaaS Implementations
- ✅ **Multi-Tenancy**: Per-organization contexts and isolation
- ✅ **Extensibility**: Easy to add enterprise features
- ✅ **Clean Integration**: No need to fork OpenHands
- ✅ **Deployment Flexibility**: Can run from external repos
- ✅ **CI/CD Fixes**: No more environment variable dependencies
-
-### For Development
- ✅ **Maintainability**: Clear dependency relationships
- ✅ **Debugging**: Easier to trace dependency issues
- ✅ **Documentation**: Clear migration path and examples
- ✅ **Future-Proof**: Extensible architecture for new features
-
-## Next Steps
-
-1. **Immediate**: Refactoring is complete and tested
-2. **Short-term**: Begin migrating core routes to use dependency injection
-3. **Medium-term**: SaaS implementations can adopt new context system
-4. **Long-term**: Remove deprecated shared.py in future major release
-
-## Conclusion
-
-The refactoring successfully addresses all the original problems:
-
- ❌ **Import-time dependencies** → ✅ **Lazy initialization**
- ❌ **Global state pollution** → ✅ **Clean dependency injection**
- ❌ **SaaS integration issues** → ✅ **Multi-tenant context system**
- ❌ **Testing difficulties** → ✅ **Easy mocking and testing**
- ❌ **No extensibility** → ✅ **Pluggable context implementations**
-
-The new architecture enables OpenHands to support SaaS scenarios while maintaining full backward compatibility and improving the overall codebase quality.
--- a/config.template.toml
+++ b/config.template.toml
@@ -363,11 +363,10 @@ classpath = "my_package.my_module.MyCustomAgent"
 #confirmation_mode = false

 # The security analyzer to use (For Headless / CLI only -  In Web this is overridden by Session Init)
-# Available options: 'llm' (default), 'invariant'
-#security_analyzer = "llm"
+#security_analyzer = ""

 # Whether to enable security analyzer
-#enable_security_analyzer = true
+#enable_security_analyzer = false

 #################################### Condenser #################################
 # Condensers control how conversation history is managed and compressed when
--- a/containers/app/Dockerfile
+++ b/containers/app/Dockerfile
@@ -21,7 +21,7 @@ ENV POETRY_NO_INTERACTION=1 \
    POETRY_CACHE_DIR=/tmp/poetry_cache

 RUN apt-get update -y \
-    && apt-get install -y curl make git build-essential jq gettext \
+    && apt-get install -y curl make git build-essential \
    && python3 -m pip install poetry --break-system-packages

 COPY pyproject.toml poetry.lock ./
@@ -58,34 +58,34 @@ RUN sed -i 's/^UID_MIN.*/UID_MIN 499/' /etc/login.defs
 # Default is 60000, but we've seen up to 200000
 RUN sed -i 's/^UID_MAX.*/UID_MAX 1000000/' /etc/login.defs

-RUN groupadd --gid $OPENHANDS_USER_ID openhands
+RUN groupadd --gid $OPENHANDS_USER_ID app
 RUN useradd -l -m -u $OPENHANDS_USER_ID --gid $OPENHANDS_USER_ID -s /bin/bash openhands && \
-    usermod -aG openhands openhands && \
+    usermod -aG app openhands && \
    usermod -aG sudo openhands && \
    echo '%sudo ALL=(ALL) NOPASSWD:ALL' >> /etc/sudoers
-RUN chown -R openhands:openhands /app && chmod -R 770 /app
-RUN sudo chown -R openhands:openhands $WORKSPACE_BASE && sudo chmod -R 770 $WORKSPACE_BASE
+RUN chown -R openhands:app /app && chmod -R 770 /app
+RUN sudo chown -R openhands:app $WORKSPACE_BASE && sudo chmod -R 770 $WORKSPACE_BASE
 USER openhands

 ENV VIRTUAL_ENV=/app/.venv \
    PATH="/app/.venv/bin:$PATH" \
    PYTHONPATH='/app'

-COPY --chown=openhands:openhands --chmod=770 --from=backend-builder ${VIRTUAL_ENV} ${VIRTUAL_ENV}
+COPY --chown=openhands:app --chmod=770 --from=backend-builder ${VIRTUAL_ENV} ${VIRTUAL_ENV}

-COPY --chown=openhands:openhands --chmod=770 ./microagents ./microagents
-COPY --chown=openhands:openhands --chmod=770 ./openhands ./openhands
-COPY --chown=openhands:openhands --chmod=777 ./openhands/runtime/plugins ./openhands/runtime/plugins
-COPY --chown=openhands:openhands pyproject.toml poetry.lock README.md MANIFEST.in LICENSE ./
+COPY --chown=openhands:app --chmod=770 ./microagents ./microagents
+COPY --chown=openhands:app --chmod=770 ./openhands ./openhands
+COPY --chown=openhands:app --chmod=777 ./openhands/runtime/plugins ./openhands/runtime/plugins
+COPY --chown=openhands:app pyproject.toml poetry.lock README.md MANIFEST.in LICENSE ./

 # This is run as "openhands" user, and will create __pycache__ with openhands:openhands ownership
 RUN python openhands/core/download.py # No-op to download assets
 # Add this line to set group ownership of all files/directories not already in "app" group
-# openhands:openhands -> openhands:openhands
-RUN find /app \! -group openhands -exec chgrp openhands {} +
+# openhands:openhands -> openhands:app
+RUN find /app \! -group app -exec chgrp app {} +

-COPY --chown=openhands:openhands --chmod=770 --from=frontend-builder /app/build ./frontend/build
-COPY --chown=openhands:openhands --chmod=770 ./containers/app/entrypoint.sh /app/entrypoint.sh
+COPY --chown=openhands:app --chmod=770 --from=frontend-builder /app/build ./frontend/build
+COPY --chown=openhands:app --chmod=770 ./containers/app/entrypoint.sh /app/entrypoint.sh

 USER root

--- a/containers/app/entrypoint.sh
+++ b/containers/app/entrypoint.sh
@@ -54,7 +54,7 @@ else
      fi
    fi
  fi
-  usermod -aG openhands enduser
+  usermod -aG app enduser
  # get the user group of /var/run/docker.sock and set openhands to that group
  DOCKER_SOCKET_GID=$(stat -c '%g' /var/run/docker.sock)
  echo "Docker socket group id: $DOCKER_SOCKET_GID"
--- a/containers/dev/compose.yml
+++ b/containers/dev/compose.yml
@@ -12,7 +12,7 @@ services:
      - SANDBOX_API_HOSTNAME=host.docker.internal
      - DOCKER_HOST_ADDR=host.docker.internal
      #
-      - SANDBOX_RUNTIME_CONTAINER_IMAGE=${SANDBOX_RUNTIME_CONTAINER_IMAGE:-ghcr.io/all-hands-ai/runtime:0.55-nikolaik}
+      - SANDBOX_RUNTIME_CONTAINER_IMAGE=${SANDBOX_RUNTIME_CONTAINER_IMAGE:-ghcr.io/all-hands-ai/runtime:0.53-nikolaik}
      - SANDBOX_USER_ID=${SANDBOX_USER_ID:-1234}
      - WORKSPACE_MOUNT_PATH=${WORKSPACE_BASE:-$PWD/workspace}
    ports:
--- a/docker-compose.yml
+++ b/docker-compose.yml
@@ -7,7 +7,7 @@ services:
    image: openhands:latest
    container_name: openhands-app-${DATE:-}
    environment:
-      - SANDBOX_RUNTIME_CONTAINER_IMAGE=${SANDBOX_RUNTIME_CONTAINER_IMAGE:-docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik}
+      - SANDBOX_RUNTIME_CONTAINER_IMAGE=${SANDBOX_RUNTIME_CONTAINER_IMAGE:-docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik}
      #- SANDBOX_USER_ID=${SANDBOX_USER_ID:-1234} # enable this only if you want a specific non-root sandbox user but you will have to manually adjust permissions of ~/.openhands for this user
      - WORKSPACE_MOUNT_PATH=${WORKSPACE_BASE:-$PWD/workspace}
    ports:
--- a/docs/EXTENSIBILITY_MIGRATION.md
+++ b/docs/EXTENSIBILITY_MIGRATION.md
@@ -1,272 +0,0 @@
-# OpenHands Extensibility Migration Guide
-
-This guide explains how to migrate from the old global variable approach to the new factory-based extensibility system.
-
-## Overview
-
-OpenHands has been refactored to eliminate import-time dependencies on environment variables and global state. This enables external repositories to cleanly extend OpenHands without configuration conflicts.
-
-## The Problem We Solved
-
-### Before (Problematic)
-```python
-# In OpenHands shared.py - loaded at import time
-config = Config()  # Reads environment variables
-server_config = ServerConfig()  # More environment variables
-
-# External repos had to:
-# 1. Set environment variables before importing OpenHands
-# 2. Deal with global state conflicts
-# 3. Couldn't easily override specific behaviors
-```
-
-### After (Clean)
-```python
-# External repos can now:
-from openhands.server.factory import create_openhands_app
-
-app = create_openhands_app(
-    context_factory=lambda: MyCustomContext(),
-    include_oss_routes=False
-)
-```
-
-## Migration Paths
-
-### 1. For External Repositories (Recommended)
-
-**Old Way (Don't do this):**
-```python
-# external_repo/main.py
-import os
-os.environ['OPENHANDS_CONFIG_CLS'] = 'my_config.MyConfig'
-os.environ['CONVERSATION_MANAGER_CLASS'] = 'my_manager.MyManager'
-
-from openhands.server.app import app  # Imports with global state
-```
-
-**New Way (Recommended):**
-```python
-# external_repo/main.py
-from openhands.server.factory import create_openhands_app
-from external_repo.context import ExternalRepoContext
-
-def create_app():
-    return create_openhands_app(
-        context_factory=lambda: ExternalRepoContext(),
-        include_oss_routes=False,  # Skip OSS-specific routes
-        title='My Enterprise Platform'
-    )
-
-app = create_app()
-
-# Add your own routes
-@app.get('/enterprise/dashboard')
-async def dashboard():
-    return {'status': 'enterprise'}
-```
-
-### 2. For OpenHands Core Development
-
-**Old Way:**
-```python
-# In route handlers
-from openhands.server.shared import config, server_config
-
-@app.get('/example')
-async def example_route():
-    storage_path = config.workspace_base
-    app_mode = server_config.app_mode
-```
-
-**New Way:**
-```python
-# In route handlers
-from fastapi import Depends
-from openhands.server.context import get_server_context, ServerContext
-
-@app.get('/example')
-async def example_route(
-    context: ServerContext = Depends(get_server_context)
-):
-    config = context.get_config()
-    server_config = context.get_server_config()
-    storage_path = config.workspace_base
-    app_mode = server_config.app_mode
-```
-
-## Custom Context Implementation
-
-### Step 1: Create Your Context Class
-
-```python
-# my_extension/context.py
-from openhands.server.context.server_context import ServerContext
-
-class MyCustomContext(ServerContext):
-    def __init__(self, tenant_id: str = 'default'):
-        super().__init__()
-        self.tenant_id = tenant_id
-    
-    def get_config(self):
-        """Override with tenant-specific configuration."""
-        config = super().get_config()
-        config.workspace_base = f'/data/tenants/{self.tenant_id}/workspace'
-        return config
-    
-    def get_server_config(self):
-        """Override server configuration."""
-        server_config = super().get_server_config()
-        server_config.app_mode = 'ENTERPRISE'
-        server_config.enable_billing = True
-        return server_config
-```
-
-### Step 2: Create Your FastAPI App
-
-```python
-# my_extension/app.py
-from openhands.server.factory import create_openhands_app
-from my_extension.context import MyCustomContext
-
-def create_my_app():
-    # Option A: Extend OpenHands app directly
-    app = create_openhands_app(
-        context_factory=lambda: MyCustomContext(),
-        title='My Enterprise Platform'
-    )
-    
-    # Add your routes
-    @app.get('/enterprise/status')
-    async def enterprise_status():
-        return {'mode': 'enterprise'}
-    
-    return app
-
-# Option B: Create your own app and mount OpenHands
-from fastapi import FastAPI
-
-def create_my_app_with_mount():
-    main_app = FastAPI(title='My Platform')
-    
-    openhands_app = create_openhands_app(
-        context_factory=lambda: MyCustomContext()
-    )
-    
-    main_app.mount('/openhands', openhands_app)
-    
-    @main_app.get('/my-dashboard')
-    async def dashboard():
-        return {'dashboard': 'data'}
-    
-    return main_app
-```
-
-### Step 3: Run Your Application
-
-```python
-# my_extension/main.py
-import uvicorn
-from my_extension.app import create_my_app
-
-if __name__ == '__main__':
-    app = create_my_app()
-    uvicorn.run(app, host='0.0.0.0', port=8000)
-```
-
-## Advanced Patterns
-
-### Multi-Tenant Context
-
-```python
-class MultiTenantContext(ServerContext):
-    def __init__(self, request: Request):
-        super().__init__()
-        # Extract tenant from request
-        self.tenant_id = request.headers.get('X-Tenant-ID', 'default')
-    
-    def get_file_store(self):
-        # Return tenant-isolated file store
-        return TenantFileStore(tenant_id=self.tenant_id)
-
-# Use with factory
-def create_tenant_context(request: Request):
-    return MultiTenantContext(request)
-
-app = create_openhands_app(
-    context_factory=create_tenant_context
-)
-```
-
-### Custom Lifespan Management
-
-```python
-from contextlib import asynccontextmanager
-
-@asynccontextmanager
-async def my_lifespan(app: FastAPI):
-    # Startup
-    print("Starting my custom services...")
-    await initialize_my_database()
-    
-    yield
-    
-    # Shutdown
-    print("Shutting down my custom services...")
-    await cleanup_my_database()
-
-app = create_openhands_app(
-    context_factory=MyContext,
-    custom_lifespan=my_lifespan
-)
-```
-
-## Testing Your Extension
-
-```python
-# tests/test_my_extension.py
-from fastapi.testclient import TestClient
-from my_extension.app import create_my_app
-
-def test_my_extension():
-    app = create_my_app()
-    client = TestClient(app)
-    
-    # Test your custom routes
-    response = client.get('/enterprise/status')
-    assert response.status_code == 200
-    assert response.json()['mode'] == 'enterprise'
-    
-    # Test OpenHands routes still work
-    response = client.get('/api/health')
-    assert response.status_code == 200
-```
-
-## Benefits of the New Approach
-
-1. **No Environment Variables**: Configuration is done through code, not environment variables
-2. **Clean Separation**: External repos don't modify OpenHands globals
-3. **Dependency Injection**: Proper FastAPI dependency injection patterns
-4. **Testability**: Easy to mock contexts for testing
-5. **Flexibility**: Can create multiple apps with different configurations
-6. **No Import-Time Side Effects**: Safe to import OpenHands modules
-
-## Backward Compatibility
-
-The old `openhands.server.shared` module still works but is deprecated. It will show deprecation warnings and should be migrated to the new context system.
-
-## Common Pitfalls
-
-1. **Don't set environment variables**: Use the factory pattern instead
-2. **Don't import `openhands.server.app` directly**: Use the factory to create your own app
-3. **Don't modify global state**: Use dependency injection through contexts
-4. **Don't forget to override dependencies**: Use `app.dependency_overrides` if needed
-
-## Getting Help
-
-If you need help migrating your extension, please:
-1. Check the examples in `examples/external_repo_extension.py`
-2. Look at the test cases for patterns
-3. Open an issue with your specific use case
-
-The new system is designed to be more flexible and maintainable while enabling clean extensibility for all types of OpenHands deployments.
--- a/docs/openapi.json
+++ b/docs/openapi.json
--- a/docs/usage/cloud/project-management/jira-dc-integration.mdx
+++ b/docs/usage/cloud/project-management/jira-dc-integration.mdx
@@ -1,5 +1,5 @@
 ---
-title: Jira Data Center Integration (Coming soon...)
+title: Jira Data Center Integration (Beta)
 description: Complete guide for setting up Jira Data Center integration with OpenHands Cloud, including service account creation, personal access token generation, webhook configuration, and workspace integration setup.
 ---

--- a/docs/usage/cloud/project-management/jira-integration.mdx
+++ b/docs/usage/cloud/project-management/jira-integration.mdx
@@ -1,5 +1,5 @@
 ---
-title: Jira Cloud Integration (Coming soon...)
+title: Jira Cloud Integration
 description: Complete guide for setting up Jira Cloud integration with OpenHands Cloud, including service account creation, API token generation, webhook configuration, and workspace integration setup.
 ---

--- a/docs/usage/cloud/project-management/linear-integration.mdx
+++ b/docs/usage/cloud/project-management/linear-integration.mdx
@@ -1,5 +1,5 @@
 ---
-title: Linear Integration (Coming soon...)
+title: Linear Integration
 description: Complete guide for setting up Linear integration with OpenHands Cloud, including service account creation, API key generation, webhook configuration, and workspace integration setup.
 ---

--- a/docs/usage/cloud/project-management/overview.mdx
+++ b/docs/usage/cloud/project-management/overview.mdx
@@ -1,5 +1,5 @@
 ---
-title: Project Management Tool Integrations (Coming soon...)
+title: Project Management Tool Integrations
 description: Overview of OpenHands Cloud integrations with project management platforms including Jira Cloud, Jira Data Center, and Linear. Learn about setup requirements, usage methods, and troubleshooting.
 ---

@@ -18,9 +18,9 @@ Integration requires two levels of setup:
 2. **Workspace Integration** - Self-service configuration through the OpenHands Cloud UI to link your OpenHands account to the target workspace

 ### Platform-Specific Setup Guides:
- [Jira Cloud Integration (Coming soon...)](./jira-integration.md)
- [Jira Data Center Integration (Coming soon...)](./jira-dc-integration.md)
- [Linear Integration (Coming soon...)](./linear-integration.md)
+- [Jira Cloud Integration](./jira-integration.md)
+- [Jira Data Center Integration](./jira-dc-integration.md)
+- [Linear Integration](./linear-integration.md)

 ## Usage

--- a/docs/usage/confirmation-mode.mdx
+++ b/docs/usage/confirmation-mode.mdx
@@ -1,52 +0,0 @@
-# Confirmation Mode and Security Analyzers
-
-OpenHands provides a security framework to help protect users from potentially risky actions through **Confirmation Mode** and **Security Analyzers**. This system analyzes agent actions and prompts users for confirmation when high-risk operations are detected.
-
-## Overview
-
-The security system consists of two main components:
-
-1. **Confirmation Mode**: When enabled, the agent will pause and ask for user confirmation before executing actions that are flagged as high-risk by the security analyzer.
-
-2. **Security Analyzers**: These are modules that evaluate the risk level of agent actions and determine whether user confirmation is required.
-
-## Configuration
-
-### CLI
-In CLI mode, confirmation is enabled by default. You will have an option to uses the LLM Analyzer and will automatically confirm LOW and MEDIUM risk actions, only prompting for HIGH risk actions.
-
-## Security Analyzers
-
-OpenHands includes multiple analyzers:
-
- **No Analyzer**: Do not use any security analyzer. The agent will prompt you to confirm *EVERY* action.
- **LLM Risk Analyzer** (default): Uses the same LLM as the agent to assess action risk levels
- **Invariant Analyzer**: Uses Invariant Labs' policy engine to evaluate action traces against security policies
-
-### LLM Risk Analyzer
-The default analyzer that leverages the agent's LLM to evaluate the security risk of each action. It considers the action type, parameters, and context to assign risk levels.
-
-### Invariant Analyzer
-An advanced analyzer that:
- Collects conversation events and parses them into a trace
- Checks the trace against an Invariant policy to classify risk (low, medium, high)
- Manages an Invariant server container automatically if needed
- Supports optional browsing-alignment and harmful-content checks
-
-## How It Works
-
-1. **Action Analysis**: When the agent wants to perform an action, the selected security analyzer evaluates its risk level.
-
-2. **Risk Assessment**: The analyzer returns one of three risk levels:
-   - **LOW**: Action proceeds without confirmation
-   - **MEDIUM**: Action proceeds without confirmation (may be configurable in future)
-   - **HIGH**: Action is paused, and user confirmation is requested
-
-3. **User Confirmation**: For high-risk actions, a confirmation dialog appears with:
-   - Description of the action
-   - Risk assessment explanation
-   - Options to approve or deny action
-
-4. **Action Execution**: Based on user response:
-   - **Approve**: Action proceeds as planned
-   - **Deny**: Action is cancelled
--- a/docs/usage/how-to/cli-mode.mdx
+++ b/docs/usage/how-to/cli-mode.mdx
@@ -87,13 +87,19 @@ source ~/.bashrc  # or source ~/.zshrc

 </AccordionGroup>

+3. Launch an interactive OpenHands conversation from the command line:
+```bash
+# If using uvx (recommended)
+uvx --python 3.12 --from openhands-ai openhands
+```
+
 <Note>
  If you have cloned the repository, you can also run the CLI directly using Poetry:

  poetry run openhands
 </Note>

-3. Set your model, API key, and other preferences using the UI (or alternatively environment variables, below).
+4. Set your model, API key, and other preferences using the UI (or alternatively environment variables, below).

 This command opens an interactive prompt where you can type tasks or commands and get responses from OpenHands.
 The first time you run the CLI, it will take you through configuring the required LLM
@@ -113,7 +119,7 @@ The conversation history will be saved in `~/.openhands/sessions`.
 ```bash
 docker run -it \
    --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e SANDBOX_USER_ID=$(id -u) \
    -e SANDBOX_VOLUMES=$SANDBOX_VOLUMES \
    -e LLM_API_KEY=$LLM_API_KEY \
@@ -122,8 +128,8 @@ docker run -it \
    -v ~/.openhands:/.openhands \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app-$(date +%Y%m%d%H%M%S) \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55 \
-    python -m openhands.cli.entry --override-cli-mode true
+    docker.all-hands.dev/all-hands-ai/openhands:0.53 \
+    python -m openhands.cli.main --override-cli-mode true
 ```

 <Note>
--- a/docs/usage/how-to/headless-mode.mdx
+++ b/docs/usage/how-to/headless-mode.mdx
@@ -61,7 +61,7 @@ export GITHUB_TOKEN="your-token"  # Required for repository operations
 # Run OpenHands
 docker run -it \
    --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e SANDBOX_USER_ID=$(id -u) \
    -e SANDBOX_VOLUMES=$SANDBOX_VOLUMES \
    -e LLM_API_KEY=$LLM_API_KEY \
@@ -73,7 +73,7 @@ docker run -it \
    -v ~/.openhands:/.openhands \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app-$(date +%Y%m%d%H%M%S) \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55 \
+    docker.all-hands.dev/all-hands-ai/openhands:0.53 \
    python -m openhands.core.main -t "write a bash script that prints hi"
 ```

--- a/docs/usage/llms/local-llms.mdx
+++ b/docs/usage/llms/local-llms.mdx
@@ -68,23 +68,23 @@ Download and install the LM Studio desktop app from [lmstudio.ai](https://lmstud
 1. Check [the installation guide](/usage/local-setup) and ensure all prerequisites are met before running OpenHands, then run:

 ```bash
-docker pull docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik
+docker pull docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik

 docker run -it --rm --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e LOG_ALL_EVENTS=true \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -v ~/.openhands:/.openhands \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55
+    docker.all-hands.dev/all-hands-ai/openhands:0.53
 ```

 2. Wait until the server is running (see log below):
 ```
 Digest: sha256:e72f9baecb458aedb9afc2cd5bc935118d1868719e55d50da73190d3a85c674f
-Status: Image is up to date for docker.all-hands.dev/all-hands-ai/openhands:0.55
+Status: Image is up to date for docker.all-hands.dev/all-hands-ai/openhands:0.53
 Starting OpenHands...
 Running OpenHands as root
 14:22:13 - openhands:INFO: server_config.py:50 - Using config class None
--- a/docs/usage/local-setup.mdx
+++ b/docs/usage/local-setup.mdx
@@ -45,13 +45,6 @@ A system with a modern processor and a minimum of **4GB RAM** is recommended to
  1. [Install WSL](https://learn.microsoft.com/en-us/windows/wsl/install).
  2. Run `wsl --version` in powershell and confirm `Default Version: 2`.

-  **Ubuntu (Linux Distribution)**
-
-  1. Install Ubuntu: `wsl --install -d Ubuntu` in PowerShell as Administrator.
-  2. Restart computer when prompted.
-  3. Open Ubuntu from Start menu to complete setup.
-  4. Verify installation: `wsl --list` should show Ubuntu.
-
  **Docker Desktop**

  1. [Install Docker Desktop on Windows](https://docs.docker.com/desktop/setup/install/windows-install).
@@ -60,7 +53,7 @@ A system with a modern processor and a minimum of **4GB RAM** is recommended to
  - Resources > WSL Integration: `Enable integration with my default WSL distro` is enabled.

  <Note>
-  The docker command below to start the app must be run inside the WSL terminal. Use `wsl -d Ubuntu` in PowerShell or search "Ubuntu" in the Start menu to access the Ubuntu terminal.
+  The docker command below to start the app must be run inside the WSL terminal.
  </Note>

  **Alternative: Windows without WSL**
@@ -116,17 +109,17 @@ Note that you'll still need `uv` installed for the default MCP servers to work p
 <Accordion title="Docker Command (Click to expand)">

 ```bash
-docker pull docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik
+docker pull docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik

 docker run -it --rm --pull=always \
-    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.55-nikolaik \
+    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0.53-nikolaik \
    -e LOG_ALL_EVENTS=true \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -v ~/.openhands:/.openhands \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app \
-    docker.all-hands.dev/all-hands-ai/openhands:0.55
+    docker.all-hands.dev/all-hands-ai/openhands:0.53
 ```

 </Accordion>
--- a/docs/usage/runtimes/docker.mdx
+++ b/docs/usage/runtimes/docker.mdx
@@ -130,28 +130,3 @@ docker run # ... \
 <Note>
 **Docker Desktop Required**: Network isolation features, including custom networks and `host.docker.internal` routing, require Docker Desktop. Docker Engine alone does not support these features on localhost across custom networks. If you're using Docker Engine without Docker Desktop, network isolation may not work as expected.
 </Note>
-
-### Sidecar Containers
-
-If you want to run sidecar containers to the sandbox 'runner' containers without exposing the sandbox containers to the host network, you can use the `SANDBOX_ADDITIONAL_NETWORKS` environment variable to specify additional Docker network names that should be added to the sandbox containers.
-
-```bash
-docker network create openhands-sccache
-
-docker run -d \
-  --hostname openhandsredis \
-  --network openhands-sccache \
-  redis
-
-docker run # ...
-    -e SANDBOX_ADDITIONAL_NETWORKS='["openhands-sccache"]' \
-    # ...
-```
-
-Then all sandbox instances will have to access a shared redis instance at `openhandsredis:6379`.
-
-#### Docker Compose gotcha
-
-Note that Docker Compose adds a prefix (a scope) by default to created networks, which is not taken into account by the additional networks config. Therefore when using docker compose you have to either:
- specify a network name via the `name` field to remove the scoping (https://docs.docker.com/reference/compose-file/networks/#name) 
- or provide the scope within the given config (e.g. `SANDBOX_ADDITIONAL_NETWORKS: '["myscope_openhands-sccache"]'` where `myscope` is the docker-compose assigned prefix). 
--- a/docs/usage/runtimes/e2b.mdx
+++ b/docs/usage/runtimes/e2b.mdx
@@ -22,7 +22,7 @@ SDK to spawn and control these sandboxes.

 You can use the E2B CLI to create a custom sandbox with a Dockerfile. Read the full guide
 [here](https://e2b.dev/docs/guide/custom-sandbox). The premade OpenHands sandbox for E2B is set up in the `containers`
-directory, and it's called `openhands`.
+directory. and it's called `openhands`.

 ## Debugging

--- a/docs/usage/troubleshooting/troubleshooting.mdx
+++ b/docs/usage/troubleshooting/troubleshooting.mdx
@@ -38,23 +38,6 @@ On initial prompt, an error is seen with `Permission Denied` or `PermissionError
 * If mounting a local directory, ensure your `WORKSPACE_BASE` has the necessary permissions for the user running
  OpenHands.

-### On Linux, Getting ConnectTimeout Error
-
-**Description**
-
-When running on Linux, you might run into the error `ERROR:root:<class 'httpx.ConnectTimeout'>: timed out`.
-
-**Resolution**
-
-If you installed Docker from your distribution’s package repository (e.g., docker.io on Debian/Ubuntu), be aware that
-these packages can sometimes be outdated or include changes that cause compatibility issues. try reinstalling Docker
-[using the official instructions](https://docs.docker.com/engine/install/) to ensure you are running a compatible version.
-
-If that does not solve the issue, try incrementally adding the following parameters to the docker run command:
-* `--network host`
-* `-e SANDBOX_USE_HOST_NETWORK=true`
-* `-e DOCKER_HOST_ADDR=127.0.0.1`
-
 ### Internal Server Error. Ports are not available

 **Description**
--- a/enterprise/LICENSE
+++ b/enterprise/LICENSE
@@ -1,89 +0,0 @@
-# PolyForm Free Trial License 1.0.0
-
-## Acceptance
-
-In order to get any license under these terms, you must agree
-to them as both strict obligations and conditions to all
-your licenses.
-
-## Copyright License
-
-The licensor grants you a copyright license for the software
-to do everything you might do with the software that would
-otherwise infringe the licensor's copyright in it for any
-permitted purpose.  However, you may only make changes or
-new works based on the software according to [Changes and New
-Works License](#changes-and-new-works-license), and you may
-not distribute copies of the software.
-
-## Changes and New Works License
-
-The licensor grants you an additional copyright license to
-make changes and new works based on the software for any
-permitted purpose.
-
-## Patent License
-
-The licensor grants you a patent license for the software that
-covers patent claims the licensor can license, or becomes able
-to license, that you would infringe by using the software.
-
-## Fair Use
-
-You may have "fair use" rights for the software under the
-law. These terms do not limit them.
-
-## Free Trial
-
-Use of the software for more than 30 days per calendar year is not allowed without a commercial license.
-
-## No Other Rights
-
-These terms do not allow you to sublicense or transfer any of
-your licenses to anyone else, or prevent the licensor from
-granting licenses to anyone else.  These terms do not imply
-any other licenses.
-
-## Patent Defense
-
-If you make any written claim that the software infringes or
-contributes to infringement of any patent, your patent license
-for the software granted under these terms ends immediately. If
-your company makes such a claim, your patent license ends
-immediately for work on behalf of your company.
-
-## Violations
-
-If you violate any of these terms, or do anything with the
-software not covered by your licenses, all your licenses
-end immediately.
-
-## No Liability
-
-***As far as the law allows, the software comes as is, without
-any warranty or condition, and the licensor will not be liable
-to you for any damages arising out of these terms or the use
-or nature of the software, under any kind of legal claim.***
-
-## Definitions
-
-The **licensor** is the individual or entity offering these
-terms, and the **software** is the software the licensor makes
-available under these terms.
-
-**You** refers to the individual or entity agreeing to these
-terms.
-
-**Your company** is any legal entity, sole proprietorship,
-or other kind of organization that you work for, plus all
-organizations that have control over, are under the control of,
-or are under common control with that organization.  **Control**
-means ownership of substantially all the assets of an entity,
-or the power to direct its management and policies by vote,
-contract, or otherwise.  Control can be direct or indirect.
-
-**Your licenses** are all the licenses granted to you for the
-software under these terms.
-
-**Use** means anything you do with the software requiring one
-of your licenses.
--- a/evaluation/benchmarks/EDA/run_infer.py
+++ b/evaluation/benchmarks/EDA/run_infer.py
@@ -9,8 +9,7 @@ from evaluation.utils.shared import (
    EvalMetadata,
    EvalOutput,
    compatibility_for_eval_history_pairs,
-    get_metrics,
-    get_openhands_config_for_eval,
+    get_default_sandbox_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -61,15 +60,18 @@ AGENT_CLS_TO_INST_SUFFIX = {
 def get_config(
    metadata: EvalMetadata,
 ) -> OpenHandsConfig:
-    # Create config with EDA-specific container image
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    sandbox_config = get_default_sandbox_config_for_eval()
+    sandbox_config.base_container_image = 'python:3.12-bookworm'
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
-
-    # Override the container image for EDA
-    config.sandbox.base_container_image = 'python:3.12-bookworm'
-
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
    agent_config.enable_prompt_extensions = False
@@ -144,7 +146,7 @@ def process_instance(

    logger.info(f'Final message: {final_message} | Ground truth: {instance["text"]}')
    test_result = game.reward()
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
--- a/evaluation/benchmarks/agent_bench/run_infer.py
+++ b/evaluation/benchmarks/agent_bench/run_infer.py
@@ -17,8 +17,7 @@ from evaluation.utils.shared import (
    EvalMetadata,
    EvalOutput,
    compatibility_for_eval_history_pairs,
-    get_metrics,
-    get_openhands_config_for_eval,
+    get_default_sandbox_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -41,12 +40,19 @@ from openhands.utils.async_utils import call_async_from_sync
 def get_config(
    metadata: EvalMetadata,
 ) -> OpenHandsConfig:
-    # Create config with agent_bench-specific container image
-    config = get_openhands_config_for_eval(metadata=metadata)
-
-    # Override the container image for agent_bench
-    config.sandbox.base_container_image = 'python:3.12-slim'
+    sandbox_config = get_default_sandbox_config_for_eval()
+    sandbox_config.base_container_image = 'python:3.12-slim'

+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        runtime=os.environ.get('RUNTIME', 'docker'),
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
+    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
    agent_config.enable_prompt_extensions = False
@@ -267,7 +273,7 @@ def process_instance(
    # remove when it becomes unnecessary
    histories = compatibility_for_eval_history_pairs(state.history)

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Save the output
    output = EvalOutput(
--- a/evaluation/benchmarks/aider_bench/run_infer.py
+++ b/evaluation/benchmarks/aider_bench/run_infer.py
@@ -17,8 +17,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -51,10 +49,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.11-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
-        sandbox_config=sandbox_config,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime=os.environ.get('RUNTIME', 'docker'),
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -243,7 +246,7 @@ def process_instance(
    # for compatibility with the existing output format, we can remake the pairs here
    # remove when it becomes unnecessary
    histories = compatibility_for_eval_history_pairs(state.history)
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Save the output
    output = EvalOutput(
--- a/evaluation/benchmarks/biocoder/run_infer.py
+++ b/evaluation/benchmarks/biocoder/run_infer.py
@@ -15,8 +15,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -62,10 +60,15 @@ def get_config(
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = BIOCODER_BENCH_CONTAINER_IMAGE

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -291,7 +294,7 @@ def process_instance(
        raise ValueError('State should not be None.')

    test_result = complete_runtime(runtime, instance)
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None
    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
    # remove when it becomes unnecessary
--- a/evaluation/benchmarks/bird/run_infer.py
+++ b/evaluation/benchmarks/bird/run_infer.py
@@ -18,8 +18,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -76,10 +74,15 @@ def get_config(
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -419,7 +422,7 @@ def process_instance(
    # You can simply get the LAST `MessageAction` from the returned `state.history` and parse it for evaluation.
    if state is None:
        raise ValueError('State should not be None.')
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
--- a/evaluation/benchmarks/browsing_delegation/run_infer.py
+++ b/evaluation/benchmarks/browsing_delegation/run_infer.py
@@ -11,8 +11,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -41,8 +39,14 @@ def get_config(
    )
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata, runtime='docker', sandbox_config=sandbox_config
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        runtime='docker',
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -84,7 +88,7 @@ def process_instance(
    if state is None:
        raise ValueError('State should not be None.')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None
    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
    # remove when it becomes unnecessary
--- a/evaluation/benchmarks/commit0/run_infer.py
+++ b/evaluation/benchmarks/commit0/run_infer.py
@@ -16,8 +16,6 @@ from evaluation.utils.shared import (
    assert_and_raise,
    codeact_user_response,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -115,11 +113,16 @@ def get_config(
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = base_container_image

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
-        sandbox_config=sandbox_config,
-        runtime=os.environ.get('RUNTIME', 'docker'),
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
        enable_browser=RUN_WITH_BROWSING,
+        runtime=os.environ.get('RUNTIME', 'docker'),
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
@@ -477,7 +480,7 @@ def process_instance(

    # NOTE: this is NO LONGER the event stream, but an agent history that includes delegate agent's events
    histories = [event_to_dict(event) for event in state.history]
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Save the output
    output = EvalOutput(
--- a/evaluation/benchmarks/discoverybench/run_infer.py
+++ b/evaluation/benchmarks/discoverybench/run_infer.py
@@ -17,8 +17,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -66,10 +64,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -291,7 +294,7 @@ def process_instance(
    if state is None:
        raise ValueError('State should not be None.')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None
    test_result = complete_runtime(state)

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
--- a/evaluation/benchmarks/gaia/run_infer.py
+++ b/evaluation/benchmarks/gaia/run_infer.py
@@ -22,8 +22,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -61,10 +59,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'nikolaik/python-nodejs:python3.12-nodejs22'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
-        sandbox_config=sandbox_config,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    if metadata.agent_config:
@@ -266,7 +269,7 @@ Here is the task:
        'model_answer': model_answer,
        'ground_truth': instance['Final answer'],
    }
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
--- a/evaluation/benchmarks/gorilla/run_infer.py
+++ b/evaluation/benchmarks/gorilla/run_infer.py
@@ -12,8 +12,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -44,10 +42,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -105,7 +108,7 @@ def process_instance(
    # attempt to parse model_answer
    ast_eval_fn = instance['ast_eval']
    correct, hallucination = ast_eval_fn(instance_id, model_answer_raw)
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None
    logger.info(
        f'Final message: {model_answer_raw} | Correctness: {correct} | Hallucination: {hallucination}'
    )
--- a/evaluation/benchmarks/gpqa/run_infer.py
+++ b/evaluation/benchmarks/gpqa/run_infer.py
@@ -30,8 +30,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -65,10 +63,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -289,7 +292,7 @@ Ok now its time to start solving the question. Good luck!
    if state is None:
        raise ValueError('State should not be None.')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Save the output
    output = EvalOutput(
--- a/evaluation/benchmarks/humanevalfix/run_infer.py
+++ b/evaluation/benchmarks/humanevalfix/run_infer.py
@@ -23,8 +23,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -86,10 +84,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -245,7 +248,7 @@ def process_instance(

    if state is None:
        raise ValueError('State should not be None.')
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None
    test_result = complete_runtime(runtime, instance)

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
--- a/evaluation/benchmarks/lca_ci_build_repair/eval_infer.py
+++ b/evaluation/benchmarks/lca_ci_build_repair/eval_infer.py
@@ -16,7 +16,6 @@ import ruamel.yaml
 from evaluation.utils.shared import (
    EvalMetadata,
    get_default_sandbox_config_for_eval,
-    get_openhands_config_for_eval,
    make_metadata,
 )
 from openhands.core.config import (
@@ -38,10 +37,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
--- a/evaluation/benchmarks/lca_ci_build_repair/run_infer.py
+++ b/evaluation/benchmarks/lca_ci_build_repair/run_infer.py
@@ -22,8 +22,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -49,10 +47,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -332,7 +335,7 @@ Be thorough in your exploration, testing, and reasoning. It's fine if your think
        )
    )
    assert state is not None
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else {}

    test_result = complete_runtime(runtime, instance)

--- a/evaluation/benchmarks/logic_reasoning/run_infer.py
+++ b/evaluation/benchmarks/logic_reasoning/run_infer.py
@@ -10,8 +10,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -53,10 +51,15 @@ def get_config(
        '$OH_INTERPRETER_PATH -m pip install scitools-pyke'
    )

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -244,7 +247,7 @@ def process_instance(
    )
    test_result['final_message'] = final_message

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None
    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
    # remove when it becomes unnecessary
--- a/evaluation/benchmarks/miniwob/run_infer.py
+++ b/evaluation/benchmarks/miniwob/run_infer.py
@@ -13,8 +13,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -59,10 +57,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'xingyaoww/od-eval-miniwob:v1.0'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
@@ -171,7 +174,7 @@ def process_instance(
    if state is None:
        raise ValueError('State should not be None.')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Instruction is the first message from the USER
    instruction = ''
--- a/evaluation/benchmarks/mint/run_infer.py
+++ b/evaluation/benchmarks/mint/run_infer.py
@@ -15,8 +15,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -111,10 +109,15 @@ def get_config(
        f'$OH_INTERPRETER_PATH -m pip install {" ".join(MINT_DEPENDENCIES)}'
    )

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -202,7 +205,7 @@ def process_instance(
        task_state = state.extra_data['task_state']
        logger.info('Task state: ' + str(task_state.to_dict()))

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
--- a/evaluation/benchmarks/ml_bench/run_infer.py
+++ b/evaluation/benchmarks/ml_bench/run_infer.py
@@ -26,8 +26,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -81,10 +79,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'public.ecr.aws/i5g0m1f6/ml-bench'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -247,7 +250,7 @@ def process_instance(instance: Any, metadata: EvalMetadata, reset_logger: bool =
        )
    )
    assert state is not None
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else {}

    test_result = complete_runtime(runtime)

--- a/evaluation/benchmarks/multi_swe_bench/eval_infer.py
+++ b/evaluation/benchmarks/multi_swe_bench/eval_infer.py
@@ -23,7 +23,6 @@ from evaluation.utils.shared import (
    EvalMetadata,
    EvalOutput,
    get_default_sandbox_config_for_eval,
-    get_openhands_config_for_eval,
    prepare_dataset,
    reset_logger_for_multiprocessing,
    run_evaluation,
@@ -88,9 +87,13 @@ def get_config(metadata: EvalMetadata, instance: pd.Series) -> OpenHandsConfig:
        dataset_name=metadata.dataset,
        instance_id=instance['instance_id'],
    )
-    config = get_openhands_config_for_eval(
+    config = OpenHandsConfig(
+        run_as_openhands=False,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    return config

--- a/evaluation/benchmarks/multi_swe_bench/run_infer.py
+++ b/evaluation/benchmarks/multi_swe_bench/run_infer.py
@@ -21,7 +21,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    get_default_sandbox_config_for_eval,
    get_metrics,
-    get_openhands_config_for_eval,
    is_fatal_evaluation_error,
    make_metadata,
    prepare_dataset,
@@ -342,11 +341,16 @@ def get_config(
        instance_id=instance['instance_id'],
    )

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
        enable_browser=RUN_WITH_BROWSING,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
--- a/evaluation/benchmarks/nocode_bench/run_infer_nc.py
+++ b/evaluation/benchmarks/nocode_bench/run_infer_nc.py
@@ -31,7 +31,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    get_default_sandbox_config_for_eval,
    get_metrics,
-    get_openhands_config_for_eval,
    is_fatal_evaluation_error,
    make_metadata,
    prepare_dataset,
@@ -175,10 +174,15 @@ def get_config(
        instance_id=instance['instance_id'],
    )

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )

    config.set_llm_config(
--- a/evaluation/benchmarks/scienceagentbench/run_infer.py
+++ b/evaluation/benchmarks/scienceagentbench/run_infer.py
@@ -12,8 +12,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -65,10 +63,16 @@ def get_config(
    sandbox_config.base_container_image = (
        'docker.io/xingyaoww/openhands-eval-scienceagentbench'
    )
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        max_budget_per_task=4,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
@@ -214,7 +218,7 @@ If the program uses some packages that are incompatible, please figure out alter
    # You can simply get the LAST `MessageAction` from the returned `state.history` and parse it for evaluation.
    if state is None:
        raise ValueError('State should not be None.')
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
--- a/evaluation/benchmarks/swe_bench/eval_infer.py
+++ b/evaluation/benchmarks/swe_bench/eval_infer.py
@@ -19,7 +19,6 @@ from evaluation.utils.shared import (
    EvalMetadata,
    EvalOutput,
    get_default_sandbox_config_for_eval,
-    get_openhands_config_for_eval,
    prepare_dataset,
    reset_logger_for_multiprocessing,
    run_evaluation,
@@ -84,9 +83,13 @@ def get_config(metadata: EvalMetadata, instance: pd.Series) -> OpenHandsConfig:
        dataset_name=metadata.dataset,
        instance_id=instance['instance_id'],
    )
-    config = get_openhands_config_for_eval(
+    config = OpenHandsConfig(
+        run_as_openhands=False,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    return config

--- a/evaluation/benchmarks/swe_bench/run_infer.py
+++ b/evaluation/benchmarks/swe_bench/run_infer.py
@@ -32,7 +32,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    get_default_sandbox_config_for_eval,
    get_metrics,
-    get_openhands_config_for_eval,
    is_fatal_evaluation_error,
    make_metadata,
    prepare_dataset,
@@ -228,11 +227,16 @@ def get_config(
        instance_id=instance['instance_id'],
    )

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
        enable_browser=RUN_WITH_BROWSING,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )

    config.set_llm_config(
--- a/evaluation/benchmarks/swe_bench/run_infer_interact.py
+++ b/evaluation/benchmarks/swe_bench/run_infer_interact.py
@@ -21,7 +21,6 @@ from evaluation.utils.shared import (
    EvalException,
    EvalMetadata,
    EvalOutput,
-    get_metrics,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -180,7 +179,7 @@ def process_instance(
        raise ValueError('State should not be None.')

    histories = [event_to_dict(event) for event in state.history]
-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Save the output
    instruction = message_action.content
--- a/evaluation/benchmarks/swe_bench/run_localize.py
+++ b/evaluation/benchmarks/swe_bench/run_localize.py
@@ -20,7 +20,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    get_default_sandbox_config_for_eval,
    get_metrics,
-    get_openhands_config_for_eval,
    is_fatal_evaluation_error,
    make_metadata,
    prepare_dataset,
@@ -200,11 +199,16 @@ def get_config(
        'REPO_PATH': f'/workspace/{workspace_dir_name}/',
    }

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
        enable_browser=RUN_WITH_BROWSING,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
--- a/evaluation/benchmarks/swe_bench/scripts/rollout_swegym.sh
+++ b/evaluation/benchmarks/swe_bench/scripts/rollout_swegym.sh
@@ -13,7 +13,6 @@ N_RUNS=${4:-1}
 export EXP_NAME=$EXP_NAME
 # use 2x resources for rollout since some codebases are pretty resource-intensive
 export DEFAULT_RUNTIME_RESOURCE_FACTOR=2
-export ITERATIVE_EVAL_MODE=false
 echo "MODEL: $MODEL"
 echo "EXP_NAME: $EXP_NAME"
 DATASET="SWE-Gym/SWE-Gym"  # change this to the "/SWE-Gym-Lite" if you want to rollout the lite subset
--- a/evaluation/benchmarks/testgeneval/eval_infer.py
+++ b/evaluation/benchmarks/testgeneval/eval_infer.py
@@ -37,7 +37,6 @@ from evaluation.benchmarks.testgeneval.utils import load_testgeneval_dataset
 from evaluation.utils.shared import (
    EvalMetadata,
    EvalOutput,
-    get_openhands_config_for_eval,
    prepare_dataset,
    reset_logger_for_multiprocessing,
    run_evaluation,
@@ -59,21 +58,20 @@ def get_config(instance: pd.Series) -> OpenHandsConfig:
        f'Invalid container image for instance {instance["instance_id_swebench"]}.'
    )
    logger.info(f'Using instance container image: {base_container_image}.')
-
-    # Create custom sandbox config for testgeneval with specific requirements
-    sandbox_config = SandboxConfig(
-        base_container_image=base_container_image,
-        use_host_network=False,
-        timeout=1800,  # Longer timeout than default (300)
-        api_key=os.environ.get('ALLHANDS_API_KEY'),
-        remote_runtime_api_url=os.environ.get(
-            'SANDBOX_REMOTE_RUNTIME_API_URL', 'http://localhost:8000'
+    return OpenHandsConfig(
+        run_as_openhands=False,
+        runtime=os.environ.get('RUNTIME', 'eventstream'),
+        sandbox=SandboxConfig(
+            base_container_image=base_container_image,
+            use_host_network=False,
+            timeout=1800,
+            api_key=os.environ.get('ALLHANDS_API_KEY'),
+            remote_runtime_api_url=os.environ.get(
+                'SANDBOX_REMOTE_RUNTIME_API_URL', 'http://localhost:8000'
+            ),
        ),
-    )
-
-    return get_openhands_config_for_eval(
-        sandbox_config=sandbox_config,
-        runtime=os.environ.get('RUNTIME', 'docker'),  # Different default runtime
+        workspace_base=None,
+        workspace_mount_path=None,
    )


--- a/evaluation/benchmarks/testgeneval/run_infer.py
+++ b/evaluation/benchmarks/testgeneval/run_infer.py
@@ -25,7 +25,6 @@ from evaluation.utils.shared import (
    assert_and_raise,
    codeact_user_response,
    get_metrics,
-    get_openhands_config_for_eval,
    is_fatal_evaluation_error,
    make_metadata,
    prepare_dataset,
@@ -127,26 +126,29 @@ def get_config(
        f'Submit an issue on https://github.com/All-Hands-AI/OpenHands if you run into any issues.'
    )

-    sandbox_config = SandboxConfig(
-        base_container_image=base_container_image,
-        enable_auto_lint=True,
-        use_host_network=False,
-        # large enough timeout, since some testcases take very long to run
-        timeout=300,
-        # Add platform to the sandbox config to solve issue 4401
-        platform='linux/amd64',
-        api_key=os.environ.get('ALLHANDS_API_KEY', None),
-        remote_runtime_api_url=os.environ.get(
-            'SANDBOX_REMOTE_RUNTIME_API_URL', 'http://localhost:8000'
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
+        runtime=os.environ.get('RUNTIME', 'eventstream'),
+        sandbox=SandboxConfig(
+            base_container_image=base_container_image,
+            enable_auto_lint=True,
+            use_host_network=False,
+            # large enough timeout, since some testcases take very long to run
+            timeout=300,
+            # Add platform to the sandbox config to solve issue 4401
+            platform='linux/amd64',
+            api_key=os.environ.get('ALLHANDS_API_KEY', None),
+            remote_runtime_api_url=os.environ.get(
+                'SANDBOX_REMOTE_RUNTIME_API_URL', 'http://localhost:8000'
+            ),
+            keep_runtime_alive=False,
+            remote_runtime_init_timeout=3600,
        ),
-        keep_runtime_alive=False,
-        remote_runtime_init_timeout=3600,
-    )
-
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
-        sandbox_config=sandbox_config,
-        runtime=os.environ.get('RUNTIME', 'docker'),
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
--- a/evaluation/benchmarks/the_agent_company/run_infer.py
+++ b/evaluation/benchmarks/the_agent_company/run_infer.py
@@ -12,10 +12,7 @@ import tempfile
 import yaml
 from browsing import pre_login

-from evaluation.utils.shared import (
-    get_default_sandbox_config_for_eval,
-    get_openhands_config_for_eval,
-)
+from evaluation.utils.shared import get_default_sandbox_config_for_eval
 from openhands.controller.state.state import State
 from openhands.core.config import (
    LLMConfig,
@@ -45,17 +42,19 @@ def get_config(
    sandbox_config.enable_auto_lint = True
    # If the web services are running on the host machine, this must be set to True
    sandbox_config.use_host_network = True
-    config = get_openhands_config_for_eval(
+    config = OpenHandsConfig(
+        run_as_openhands=False,
+        max_budget_per_task=4,
        max_iterations=100,
+        save_trajectory_path=os.path.join(
+            mount_path_on_host, f'traj_{task_short_name}.json'
+        ),
+        sandbox=sandbox_config,
        # we mount trajectories path so that trajectories, generated by OpenHands
        # controller, can be accessible to the evaluator file in the runtime container
-        sandbox_config=sandbox_config,
        workspace_mount_path=mount_path_on_host,
+        workspace_mount_path_in_sandbox='/outputs',
    )
-    config.save_trajectory_path = os.path.join(
-        mount_path_on_host, f'traj_{task_short_name}.json'
-    )
-    config.max_budget_per_task = 4
    config.set_llm_config(llm_config)
    if agent_config:
        config.set_agent_config(agent_config)
--- a/evaluation/benchmarks/toolqa/run_infer.py
+++ b/evaluation/benchmarks/toolqa/run_infer.py
@@ -11,8 +11,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -45,10 +43,15 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.base_container_image = 'python:3.12-bookworm'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -131,7 +134,7 @@ def process_instance(instance: Any, metadata: EvalMetadata, reset_logger: bool =
    correct = eval_answer(str(model_answer_raw), str(answer))
    logger.info(f'Final message: {model_answer_raw} | Correctness: {correct}')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # history is now available as a stream of events, rather than list of pairs of (Action, Observation)
    # for compatibility with the existing output format, we can remake the pairs here
--- a/evaluation/benchmarks/visual_swe_bench/run_infer.py
+++ b/evaluation/benchmarks/visual_swe_bench/run_infer.py
@@ -20,7 +20,6 @@ from evaluation.utils.shared import (
    codeact_user_response,
    get_default_sandbox_config_for_eval,
    get_metrics,
-    get_openhands_config_for_eval,
    is_fatal_evaluation_error,
    make_metadata,
    prepare_dataset,
@@ -161,11 +160,16 @@ def get_config(
        instance_id=instance['instance_id'],
    )

-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
+        max_iterations=metadata.max_iterations,
        enable_browser=RUN_WITH_BROWSING,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
--- a/evaluation/benchmarks/visualwebarena/run_infer.py
+++ b/evaluation/benchmarks/visualwebarena/run_infer.py
@@ -12,8 +12,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -74,10 +72,16 @@ def get_config(
        'VWA_WIKIPEDIA': f'{base_url}:8888',
        'VWA_HOMEPAGE': f'{base_url}:4399',
    }
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
+        attach_to_existing=True,
    )
    config.set_llm_config(
        update_llm_config_for_completions_logging(
@@ -175,7 +179,7 @@ def process_instance(
    if state is None:
        raise ValueError('State should not be None.')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Instruction obtained from the first message from the USER
    instruction = ''
--- a/evaluation/benchmarks/webarena/run_infer.py
+++ b/evaluation/benchmarks/webarena/run_infer.py
@@ -12,8 +12,6 @@ from evaluation.utils.shared import (
    EvalOutput,
    compatibility_for_eval_history_pairs,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -66,10 +64,15 @@ def get_config(
        'MAP': f'{base_url}:3000',
        'HOMEPAGE': f'{base_url}:4399',
    }
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime='docker',
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
    )
    config.set_llm_config(metadata.llm_config)
    agent_config = config.get_agent_config(metadata.agent_class)
@@ -160,7 +163,7 @@ def process_instance(
    if state is None:
        raise ValueError('State should not be None.')

-    metrics = get_metrics(state)
+    metrics = state.metrics.get() if state.metrics else None

    # Instruction is the first message from the USER
    instruction = ''
--- a/evaluation/integration_tests/run_infer.py
+++ b/evaluation/integration_tests/run_infer.py
@@ -9,8 +9,6 @@ from evaluation.utils.shared import (
    EvalMetadata,
    EvalOutput,
    get_default_sandbox_config_for_eval,
-    get_metrics,
-    get_openhands_config_for_eval,
    make_metadata,
    prepare_dataset,
    reset_logger_for_multiprocessing,
@@ -46,12 +44,18 @@ def get_config(
 ) -> OpenHandsConfig:
    sandbox_config = get_default_sandbox_config_for_eval()
    sandbox_config.platform = 'linux/amd64'
-    config = get_openhands_config_for_eval(
-        metadata=metadata,
+    config = OpenHandsConfig(
+        default_agent=metadata.agent_class,
+        run_as_openhands=False,
        runtime=os.environ.get('RUNTIME', 'docker'),
-        sandbox_config=sandbox_config,
+        max_iterations=metadata.max_iterations,
+        sandbox=sandbox_config,
+        # do not mount workspace
+        workspace_base=None,
+        workspace_mount_path=None,
+        # debug
+        debug=True,
    )
-    config.debug = True
    config.set_llm_config(
        update_llm_config_for_completions_logging(
            metadata.llm_config, metadata.eval_output_dir, instance_id
@@ -131,7 +135,7 @@ def process_instance(
        assert len(histories) > 0, 'History should not be empty'

        test_result: TestResult = test_class.verify_result(runtime, histories)
-        metrics = get_metrics(state)
+        metrics = state.metrics.get() if state.metrics else None
    finally:
        runtime.close()

--- a/evaluation/utils/scripts/aggregate_token_usage.py
+++ b/evaluation/utils/scripts/aggregate_token_usage.py
@@ -1,209 +0,0 @@
-#!/usr/bin/env python3
-"""
-Script to aggregate token usage metrics from LLM completion files.
-
-Usage:
-    python aggregate_token_usage.py <directory_path> [--input-cost <cost>] [--output-cost <cost>] [--cached-cost <cost>]
-
-Arguments:
-    directory_path: Path to the directory containing completion files
-    --input-cost: Cost per input token (default: 0.0)
-    --output-cost: Cost per output token (default: 0.0)
-    --cached-cost: Cost per cached token (default: 0.0)
-"""
-
-import argparse
-import json
-import os
-from pathlib import Path
-
-
-def aggregate_token_usage(
-    directory_path, input_cost=0.0, output_cost=0.0, cached_cost=0.0
-):
-    """
-    Aggregate token usage metrics from all JSON completion files in the directory.
-
-    Args:
-        directory_path (str): Path to directory containing completion files
-        input_cost (float): Cost per input token
-        output_cost (float): Cost per output token
-        cached_cost (float): Cost per cached token
-    """
-
-    # Initialize counters
-    totals = {
-        'input_tokens': 0,
-        'output_tokens': 0,
-        'cached_tokens': 0,
-        'total_tokens': 0,
-        'files_processed': 0,
-        'files_with_errors': 0,
-        'cost': 0,
-    }
-
-    # Find all JSON files recursively
-    json_files = list(Path(directory_path).rglob('*.json'))
-
-    print(f'Found {len(json_files)} JSON files to process...')
-
-    for json_file in json_files:
-        try:
-            with open(json_file, 'r', encoding='utf-8') as f:
-                data = json.load(f)
-
-            # Look for usage data in response or fncall_response
-            usage_data = None
-            if (
-                'response' in data
-                and isinstance(data['response'], dict)
-                and 'usage' in data['response']
-            ):
-                usage_data = data['response']['usage']
-            elif (
-                'fncall_response' in data
-                and isinstance(data['fncall_response'], dict)
-                and 'usage' in data['fncall_response']
-            ):
-                usage_data = data['fncall_response']['usage']
-
-            if usage_data:
-                # Extract token counts
-                completion_tokens = usage_data.get('completion_tokens', 0)
-                prompt_tokens = usage_data.get('prompt_tokens', 0)
-                cached_tokens = usage_data.get('cached_tokens', 0)
-
-                # Handle cases where cached_tokens might be in prompt_tokens_details
-                if cached_tokens == 0 and 'prompt_tokens_details' in usage_data:
-                    details = usage_data['prompt_tokens_details']
-                    if isinstance(details, dict) and 'cached_tokens' in details:
-                        cached_tokens = details.get('cached_tokens', 0) or 0
-
-                # Calculate non-cached input tokens
-                non_cached_input = prompt_tokens - cached_tokens
-
-                # Update totals
-                totals['input_tokens'] += non_cached_input
-                totals['output_tokens'] += completion_tokens
-                totals['cached_tokens'] += cached_tokens
-                totals['total_tokens'] += prompt_tokens + completion_tokens
-
-            if 'cost' in data:
-                totals['cost'] += data['cost']
-            totals['files_processed'] += 1
-
-            # Progress indicator
-            if totals['files_processed'] % 1000 == 0:
-                print(f'Processed {totals["files_processed"]} files...')
-
-        except Exception as e:
-            totals['files_with_errors'] += 1
-            if totals['files_with_errors'] <= 5:  # Only show first 5 errors
-                print(f'Error processing {json_file}: {e}')
-
-    # Calculate costs
-    input_cost_total = totals['input_tokens'] * input_cost
-    output_cost_total = totals['output_tokens'] * output_cost
-    cached_cost_total = totals['cached_tokens'] * cached_cost
-    total_cost = input_cost_total + output_cost_total + cached_cost_total
-
-    # Print results
-    print('\n' + '=' * 60)
-    print('TOKEN USAGE AGGREGATION RESULTS')
-    print('=' * 60)
-    print(f'Files processed: {totals["files_processed"]:,}')
-    print(f'Files with errors: {totals["files_with_errors"]:,}')
-    print()
-    print('TOKEN COUNTS:')
-    print(f'  Input tokens (non-cached):             {totals["input_tokens"]:,}')
-    print(f'  Output tokens:                         {totals["output_tokens"]:,}')
-    print(f'  Cached tokens:                         {totals["cached_tokens"]:,}')
-    print(f'  Total tokens:                          {totals["total_tokens"]:,}')
-    print(f'  Total costs (based on returned value): ${totals["cost"]:.6f}')
-    print()
-
-    if input_cost > 0 or output_cost > 0 or cached_cost > 0:
-        print('COST CALCULATED BASED ON PROVIDED RATE:')
-        print(
-            f'  Input cost:   ${input_cost_total:.6f} ({totals["input_tokens"]:,} × ${input_cost:.6f})'
-        )
-        print(
-            f'  Output cost:  ${output_cost_total:.6f} ({totals["output_tokens"]:,} × ${output_cost:.6f})'
-        )
-        print(
-            f'  Cached cost:  ${cached_cost_total:.6f} ({totals["cached_tokens"]:,} × ${cached_cost:.6f})'
-        )
-        print(f'  Total cost:   ${total_cost:.6f}')
-        print()
-
-    print('SUMMARY:')
-    print(
-        f'  Total input tokens:  {totals["input_tokens"] + totals["cached_tokens"]:,}'
-    )
-    print(f'  Total output tokens: {totals["output_tokens"]:,}')
-    print(f'  Grand total tokens:  {totals["total_tokens"]:,}')
-
-    return totals
-
-
-def main():
-    parser = argparse.ArgumentParser(
-        description='Aggregate token usage metrics from LLM completion files',
-        formatter_class=argparse.RawDescriptionHelpFormatter,
-        epilog="""
-Examples:
-  python aggregate_token_usage.py /path/to/completions
-  python aggregate_token_usage.py /path/to/completions --input-cost 0.000001 --output-cost 0.000002
-  python aggregate_token_usage.py /path/to/completions --input-cost 0.000001 --output-cost 0.000002 --cached-cost 0.0000005
-        """,
-    )
-
-    parser.add_argument(
-        'directory_path', help='Path to directory containing completion files'
-    )
-
-    parser.add_argument(
-        '--input-cost',
-        type=float,
-        default=0.0,
-        help='Cost per input token (default: 0.0)',
-    )
-
-    parser.add_argument(
-        '--output-cost',
-        type=float,
-        default=0.0,
-        help='Cost per output token (default: 0.0)',
-    )
-
-    parser.add_argument(
-        '--cached-cost',
-        type=float,
-        default=0.0,
-        help='Cost per cached token (default: 0.0)',
-    )
-
-    args = parser.parse_args()
-
-    # Validate directory path
-    if not os.path.exists(args.directory_path):
-        print(f"Error: Directory '{args.directory_path}' does not exist.")
-        return 1
-
-    if not os.path.isdir(args.directory_path):
-        print(f"Error: '{args.directory_path}' is not a directory.")
-        return 1
-
-    # Run aggregation
-    try:
-        aggregate_token_usage(
-            args.directory_path, args.input_cost, args.output_cost, args.cached_cost
-        )
-        return 0
-    except Exception as e:
-        print(f'Error during aggregation: {e}')
-        return 1
-
-
-if __name__ == '__main__':
-    exit(main())
--- a/evaluation/utils/shared.py
+++ b/evaluation/utils/shared.py
@@ -668,23 +668,8 @@ def is_fatal_runtime_error(error: str | None) -> bool:


 def get_metrics(state: State) -> dict[str, Any]:
-    """Extract metrics for evaluations.
-
-    Prefer ConversationStats (source of truth) and fall back to state.metrics for
-    backward compatibility.
-    """
-    metrics: dict[str, Any]
-    try:
-        if getattr(state, 'conversation_stats', None):
-            combined = state.conversation_stats.get_combined_metrics()
-            metrics = combined.get()
-        elif getattr(state, 'metrics', None):
-            metrics = state.metrics.get()
-        else:
-            metrics = {}
-    except Exception:
-        metrics = state.metrics.get() if getattr(state, 'metrics', None) else {}
-
+    """Extract metrics from the state."""
+    metrics = state.metrics.get() if state.metrics else {}
    metrics['condenser'] = get_condensation_metadata(state)
    return metrics

@@ -703,79 +688,3 @@ def get_default_sandbox_config_for_eval() -> SandboxConfig:
        remote_runtime_enable_retries=True,
        remote_runtime_class='sysbox',
    )
-
-
-def get_openhands_config_for_eval(
-    metadata: EvalMetadata | None = None,
-    sandbox_config: SandboxConfig | None = None,
-    runtime: str | None = None,
-    max_iterations: int | None = None,
-    default_agent: str | None = None,
-    enable_browser: bool = False,
-    workspace_base: str | None = None,
-    workspace_mount_path: str | None = None,
-):
-    """Create an OpenHandsConfig with common patterns used across evaluation scripts.
-
-    This function provides a standardized way to create OpenHands configurations
-    for evaluation runs, with sensible defaults that match the patterns used in
-    most run_infer.py scripts. Individual evaluation scripts can override specific
-    attributes as needed.
-
-    Args:
-        metadata: EvalMetadata containing agent class, max iterations, etc.
-        sandbox_config: Custom sandbox config. If None, uses get_default_sandbox_config_for_eval()
-        runtime: Runtime type. If None, uses environment RUNTIME or 'docker'
-        max_iterations: Max iterations for the agent. If None, uses metadata.max_iterations
-        default_agent: Agent class name. If None, uses metadata.agent_class
-        enable_browser: Whether to enable browser functionality
-        workspace_base: Workspace base path. Defaults to None
-        workspace_mount_path: Workspace mount path. Defaults to None
-
-    Returns:
-        OpenHandsConfig: Configured for evaluation with eval-specific overrides applied
-    """
-    # Defer import to avoid circular imports at module load time
-    from openhands.core.config.openhands_config import (
-        OpenHandsConfig as _OHConfig,  # type: ignore
-    )
-
-    # Use provided sandbox config or get default
-    if sandbox_config is None:
-        sandbox_config = get_default_sandbox_config_for_eval()
-
-    # Extract values from metadata if provided
-    if metadata is not None:
-        if max_iterations is None:
-            max_iterations = metadata.max_iterations
-        if default_agent is None:
-            default_agent = metadata.agent_class
-
-    # Use environment runtime or default
-    if runtime is None:
-        runtime = os.environ.get('RUNTIME', 'docker')
-
-    # Provide sensible defaults if still None
-    if default_agent is None:
-        default_agent = 'CodeActAgent'
-    if max_iterations is None:
-        max_iterations = 50
-
-    # Always use repo-local .eval_sessions directory (absolute path)
-    eval_store = os.path.abspath(os.path.join(os.getcwd(), '.eval_sessions'))
-
-    # Create the base config with evaluation-specific overrides
-    config = _OHConfig(
-        default_agent=default_agent,
-        run_as_openhands=False,
-        runtime=runtime,
-        max_iterations=max_iterations,
-        enable_browser=enable_browser,
-        sandbox=sandbox_config,
-        workspace_base=workspace_base,
-        workspace_mount_path=workspace_mount_path,
-        file_store='local',
-        file_store_path=eval_store,
-    )
-
-    return config
--- a/examples/external_repo_extension.py
+++ b/examples/external_repo_extension.py
@@ -1,217 +0,0 @@
-"""Example of how an external repository can extend OpenHands.
-
-This demonstrates the proper way for external repositories to build upon OpenHands
-without relying on environment variables or global state. The external repo can:
-
-1. Create its own FastAPI app with custom context
-2. Add its own routes and middleware
-3. Include OpenHands routes as needed
-4. Override specific behaviors through dependency injection
-
-This approach eliminates the need for environment variable configuration
-and allows clean separation between OpenHands core and extensions.
-"""
-
-from contextlib import asynccontextmanager
-from typing import AsyncIterator, Optional
-
-from fastapi import Depends, FastAPI, Request
-from fastapi.responses import JSONResponse
-
-from openhands.server.context.server_context import ServerContext
-from openhands.server.factory import create_openhands_app
-
-
-# Step 1: Create your custom ServerContext
-class ExternalRepoContext(ServerContext):
-    """Custom context for external repository with enterprise features."""
-    
-    def __init__(self, tenant_id: str = 'default', user_id: Optional[str] = None):
-        super().__init__()
-        self.tenant_id = tenant_id
-        self.user_id = user_id
-        self._custom_config = None
-    
-    def get_config(self):
-        """Override config with tenant-specific settings."""
-        config = super().get_config()
-        
-        # Add tenant-specific configuration
-        config.update({
-            'tenant_id': self.tenant_id,
-            'custom_storage_path': f'/data/tenants/{self.tenant_id}',
-            'custom_feature_flags': {
-                'enterprise_features': True,
-                'advanced_analytics': True,
-            }
-        })
-        
-        return config
-    
-    def get_server_config(self):
-        """Override server config for enterprise deployment."""
-        server_config = super().get_server_config()
-        
-        # Customize for enterprise
-        server_config.app_mode = 'ENTERPRISE'  # Custom app mode
-        server_config.enable_billing = True
-        server_config.hide_llm_settings = False
-        
-        return server_config
-    
-    def get_file_store(self):
-        """Use tenant-isolated file storage."""
-        # In a real implementation, this would return a tenant-aware file store
-        file_store = super().get_file_store()
-        # Customize file store for tenant isolation
-        return file_store
-
-
-# Step 2: Create your custom lifespan (optional)
-@asynccontextmanager
-async def external_repo_lifespan(app: FastAPI) -> AsyncIterator[None]:
-    """Custom lifespan for external repo initialization."""
-    print("🚀 Starting external repo services...")
-    
-    # Initialize your custom services here
-    # e.g., database connections, external API clients, etc.
-    
-    yield
-    
-    print("🛑 Shutting down external repo services...")
-    # Cleanup your custom services here
-
-
-# Step 3: Create context factory for your needs
-def create_external_context(tenant_id: str = 'default') -> ExternalRepoContext:
-    """Factory function to create context instances."""
-    return ExternalRepoContext(tenant_id=tenant_id)
-
-
-# Step 4: Create your FastAPI app with OpenHands integration
-def create_external_app() -> FastAPI:
-    """Create the external repository's FastAPI application."""
-    
-    # Option A: Create OpenHands app with your custom context
-    openhands_app = create_openhands_app(
-        context_factory=lambda: create_external_context(),
-        include_oss_routes=False,  # Skip OSS routes for enterprise
-        custom_lifespan=external_repo_lifespan,
-        title='My Enterprise Platform',
-        description='Enterprise platform built on OpenHands'
-    )
-    
-    # Option B: Create your own app and mount OpenHands
-    main_app = FastAPI(
-        title='My Enterprise Platform',
-        description='Enterprise platform with OpenHands integration',
-        version='1.0.0'
-    )
-    
-    # Add your custom routes
-    @main_app.get('/enterprise/status')
-    async def enterprise_status():
-        return {'status': 'running', 'mode': 'enterprise'}
-    
-    @main_app.get('/enterprise/tenant/{tenant_id}/info')
-    async def tenant_info(
-        tenant_id: str,
-        request: Request,
-        # Use dependency injection to get context
-        context: ServerContext = Depends(lambda r: create_external_context(tenant_id))
-    ):
-        config = context.get_config()
-        return {
-            'tenant_id': tenant_id,
-            'storage_path': config.get('custom_storage_path'),
-            'features': config.get('custom_feature_flags', {})
-        }
-    
-    # Add custom middleware
-    @main_app.middleware('http')
-    async def tenant_middleware(request: Request, call_next):
-        # Extract tenant from header or path
-        tenant_id = request.headers.get('X-Tenant-ID', 'default')
-        request.state.tenant_id = tenant_id
-        
-        response = await call_next(request)
-        response.headers['X-Tenant-ID'] = tenant_id
-        return response
-    
-    # Mount OpenHands app at a subpath
-    main_app.mount('/openhands', openhands_app)
-    
-    return main_app
-
-
-# Step 5: Alternative approach - extend OpenHands app directly
-def create_extended_openhands_app() -> FastAPI:
-    """Alternative: extend OpenHands app directly with custom routes."""
-    
-    app = create_openhands_app(
-        context_factory=lambda: create_external_context(),
-        custom_lifespan=external_repo_lifespan
-    )
-    
-    # Add your routes to the OpenHands app
-    @app.get('/api/enterprise/dashboard')
-    async def enterprise_dashboard(
-        request: Request,
-        context: ServerContext = Depends(lambda r: create_external_context())
-    ):
-        config = context.get_config()
-        return {
-            'dashboard_data': 'enterprise_metrics',
-            'tenant_features': config.get('custom_feature_flags', {})
-        }
-    
-    return app
-
-
-# Example usage in external repo's main.py
-if __name__ == '__main__':
-    import uvicorn
-    
-    # Choose your approach
-    app = create_external_app()  # Full custom app with OpenHands mounted
-    # app = create_extended_openhands_app()  # Extended OpenHands app
-    
-    # Run the server
-    uvicorn.run(
-        app,
-        host='0.0.0.0',
-        port=8000,
-        reload=True
-    )
-
-
-# Example of how to test the integration
-def test_external_integration():
-    """Test that the external integration works correctly."""
-    from fastapi.testclient import TestClient
-    
-    app = create_external_app()
-    client = TestClient(app)
-    
-    # Test custom routes
-    response = client.get('/enterprise/status')
-    assert response.status_code == 200
-    assert response.json()['mode'] == 'enterprise'
-    
-    # Test tenant-specific routes
-    response = client.get('/enterprise/tenant/acme-corp/info')
-    assert response.status_code == 200
-    data = response.json()
-    assert data['tenant_id'] == 'acme-corp'
-    assert 'enterprise_features' in data['features']
-    
-    # Test OpenHands routes still work
-    response = client.get('/openhands/api/health')
-    assert response.status_code == 200
-    
-    print("✅ All integration tests passed!")
-
-
-if __name__ == '__main__':
-    # Run tests
-    test_external_integration()
--- a/frontend/tests/components/features/home/repo-connector.test.tsx
+++ b/frontend/tests/components/features/home/repo-connector.test.tsx
@@ -54,14 +54,12 @@ const MOCK_RESPOSITORIES: GitRepository[] = [
    full_name: "rbren/polaris",
    git_provider: "github",
    is_public: true,
-    main_branch: "main",
  },
  {
    id: "2",
    full_name: "All-Hands-AI/OpenHands",
    git_provider: "github",
    is_public: true,
-    main_branch: "main",
  },
 ];

@@ -101,15 +99,16 @@ describe("RepoConnector", () => {

    // First select the provider
    const providerDropdown = await waitFor(() =>
-      screen.getByTestId("git-provider-dropdown"),
+      screen.getByText("Select Provider"),
    );
    await userEvent.click(providerDropdown);
-    await userEvent.click(screen.getByText("GitHub"));
+    await userEvent.click(screen.getByText("Github"));

    // Then interact with the repository dropdown
-    const repoInput = await waitFor(() =>
-      screen.getByTestId("git-repo-dropdown"),
+    const repoDropdown = await waitFor(() =>
+      screen.getByTestId("repo-dropdown"),
    );
+    const repoInput = within(repoDropdown).getByRole("combobox");
    await userEvent.click(repoInput);

    // Wait for the options to be loaded and displayed
@@ -135,23 +134,23 @@ describe("RepoConnector", () => {
    expect(launchButton).toBeDisabled();

    // Mock the repository branches API call
-    vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue({ branches: [
+    vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue([
      { name: "main", commit_sha: "123", protected: false },
      { name: "develop", commit_sha: "456", protected: false },
-    ], has_next_page: false, current_page: 1, per_page: 30, total_count: 2 });
+    ]);

    // First select the provider
    const providerDropdown = await waitFor(() =>
-      screen.getByTestId("git-provider-dropdown"),
+      screen.getByText("Select Provider"),
    );
    await userEvent.click(providerDropdown);
-    await userEvent.click(screen.getByText("GitHub"));
+    await userEvent.click(screen.getByText("Github"));

    // Then select the repository
-    const repoInput = await waitFor(() =>
-      screen.getByTestId("git-repo-dropdown"),
+    const repoDropdown = await waitFor(() =>
+      screen.getByTestId("repo-dropdown"),
    );
-
+    const repoInput = within(repoDropdown).getByRole("combobox");
    await userEvent.click(repoInput);

    // Wait for the options to be loaded and displayed
@@ -162,8 +161,7 @@ describe("RepoConnector", () => {

    // Wait for the branch to be auto-selected
    await waitFor(() => {
-      const branchInput = screen.getByTestId("git-branch-dropdown-input");
-      expect(branchInput).toHaveValue("main");
+      expect(screen.getByText("main")).toBeInTheDocument();
    });

    expect(launchButton).toBeEnabled();
@@ -226,19 +224,6 @@ describe("RepoConnector", () => {

  it("should create a conversation and redirect with the selected repo when pressing the launch button", async () => {
    const createConversationSpy = vi.spyOn(OpenHands, "createConversation");
-    createConversationSpy.mockResolvedValue({
-      conversation_id: "mock-conversation-id",
-      title: "Test Conversation",
-      selected_repository: "user/repo1",
-      selected_branch: "main",
-      git_provider: "github",
-      last_updated_at: "2023-01-01T00:00:00Z",
-      created_at: "2023-01-01T00:00:00Z",
-      status: "STARTING",
-      runtime_status: null,
-      url: null,
-      session_api_key: null,
-    });
    const retrieveUserGitRepositoriesSpy = vi.spyOn(
      OpenHands,
      "retrieveUserGitRepositories",
@@ -259,23 +244,23 @@ describe("RepoConnector", () => {
    expect(createConversationSpy).not.toHaveBeenCalled();

    // Mock the repository branches API call
-    vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue({ branches: [
+    vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue([
      { name: "main", commit_sha: "123", protected: false },
      { name: "develop", commit_sha: "456", protected: false },
-    ], has_next_page: false, current_page: 1, per_page: 30, total_count: 2 });
+    ]);

    // First select the provider
    const providerDropdown = await waitFor(() =>
-      screen.getByTestId("git-provider-dropdown"),
+      screen.getByText("Select Provider"),
    );
    await userEvent.click(providerDropdown);
-    await userEvent.click(screen.getByText("GitHub"));
+    await userEvent.click(screen.getByText("Github"));

    // Then select the repository
-    const repoInput = await waitFor(() =>
-      within(repoConnector).getByTestId("git-repo-dropdown"),
+    const repoDropdown = await waitFor(() =>
+      within(repoConnector).getByTestId("repo-dropdown"),
    );
-
+    const repoInput = within(repoDropdown).getByRole("combobox");
    await userEvent.click(repoInput);

    // Wait for the options to be loaded and displayed
@@ -286,8 +271,7 @@ describe("RepoConnector", () => {

    // Wait for the branch to be auto-selected
    await waitFor(() => {
-      const branchInput = screen.getByTestId("git-branch-dropdown-input");
-      expect(branchInput).toHaveValue("main");
+      expect(screen.getByText("main")).toBeInTheDocument();
    });

    await userEvent.click(launchButton);
@@ -304,8 +288,6 @@ describe("RepoConnector", () => {
  });

  it("should change the launch button text to 'Loading...' when creating a conversation", async () => {
-    const createConversationSpy = vi.spyOn(OpenHands, "createConversation");
-    createConversationSpy.mockImplementation(() => new Promise(() => {})); // Never resolves to keep loading state
    const retrieveUserGitRepositoriesSpy = vi.spyOn(
      OpenHands,
      "retrieveUserGitRepositories",
@@ -316,10 +298,10 @@ describe("RepoConnector", () => {
    });

    // Mock the repository branches API call
-    vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue({ branches: [
+    vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue([
      { name: "main", commit_sha: "123", protected: false },
      { name: "develop", commit_sha: "456", protected: false },
-    ], has_next_page: false, current_page: 1, per_page: 30, total_count: 2 });
+    ]);

    renderRepoConnector();

@@ -327,16 +309,16 @@ describe("RepoConnector", () => {

    // First select the provider
    const providerDropdown = await waitFor(() =>
-      screen.getByTestId("git-provider-dropdown"),
+      screen.getByText("Select Provider"),
    );
    await userEvent.click(providerDropdown);
-    await userEvent.click(screen.getByText("GitHub"));
+    await userEvent.click(screen.getByText("Github"));

    // Then select the repository
-    const repoInput = await waitFor(() =>
-      screen.getByTestId("git-repo-dropdown"),
+    const repoDropdown = await waitFor(() =>
+      screen.getByTestId("repo-dropdown"),
    );
-
+    const repoInput = within(repoDropdown).getByRole("combobox");
    await userEvent.click(repoInput);

    // Wait for the options to be loaded and displayed
@@ -347,8 +329,7 @@ describe("RepoConnector", () => {

    // Wait for the branch to be auto-selected
    await waitFor(() => {
-      const branchInput = screen.getByTestId("git-branch-dropdown-input");
-      expect(branchInput).toHaveValue("main");
+      expect(screen.getByText("main")).toBeInTheDocument();
    });

    await userEvent.click(launchButton);
@@ -377,7 +358,7 @@ describe("RepoConnector", () => {
    const goToSettingsButton = await screen.findByTestId(
      "navigate-to-settings-button",
    );
-    const dropdown = screen.queryByTestId("git-repo-dropdown");
+    const dropdown = screen.queryByTestId("repo-dropdown");
    const launchButton = screen.queryByTestId("repo-launch-button");
    const providerLinks = screen.queryAllByText(/add git(hub|lab) repos/i);

--- a/frontend/tests/components/features/home/repo-selection-form.test.tsx
+++ b/frontend/tests/components/features/home/repo-selection-form.test.tsx
@@ -151,7 +151,7 @@ describe("RepositorySelectionForm", () => {
    });

    renderForm();
-    expect(await screen.findByTestId("git-repo-dropdown")).toBeInTheDocument();
+    expect(await screen.findByTestId("repo-dropdown")).toBeInTheDocument();
  });

  it("shows error message when repository fetch fails", async () => {
@@ -168,10 +168,10 @@ describe("RepositorySelectionForm", () => {
    renderForm();

    expect(
-      await screen.findByTestId("dropdown-error"),
+      await screen.findByTestId("repo-dropdown-error"),
    ).toBeInTheDocument();
    expect(
-      screen.getByText("Failed to load data"),
+      screen.getByText("HOME$FAILED_TO_LOAD_REPOSITORIES"),
    ).toBeInTheDocument();
  });

@@ -231,13 +231,14 @@ describe("RepositorySelectionForm", () => {

    renderForm();

-    const input = await screen.findByTestId("git-repo-dropdown");
+    const dropdown = await screen.findByTestId("repo-dropdown");
+    const input = dropdown.querySelector('input[type="text"]') as HTMLInputElement;
+    expect(input).toBeInTheDocument();

    await userEvent.type(input, "https://github.com/kubernetes/kubernetes");
    expect(searchGitReposSpy).toHaveBeenLastCalledWith(
      "kubernetes/kubernetes",
      3,
-      "github",
    );
  });

@@ -266,13 +267,14 @@ describe("RepositorySelectionForm", () => {

    renderForm();

-    const input = await screen.findByTestId("git-repo-dropdown");
+    const dropdown = await screen.findByTestId("repo-dropdown");
+    const input = dropdown.querySelector('input[type="text"]') as HTMLInputElement;
+    expect(input).toBeInTheDocument();

    await userEvent.type(input, "https://github.com/kubernetes/kubernetes");
    expect(searchGitReposSpy).toHaveBeenLastCalledWith(
      "kubernetes/kubernetes",
      3,
-      "github",
    );
  });
 });
--- a/frontend/tests/components/features/microagent-management/microagent-management.test.tsx
+++ b/frontend/tests/components/features/microagent-management/microagent-management.test.tsx
--- a/frontend/tests/routes/home-screen.test.tsx
+++ b/frontend/tests/routes/home-screen.test.tsx
@@ -37,27 +37,34 @@ const selectRepository = async (repoName: string) => {

  // First select the provider
  const providerDropdown = await waitFor(() =>
-    screen.getByTestId("git-provider-dropdown"),
+    screen.getByText("Select Provider"),
  );
  await userEvent.click(providerDropdown);
-  await userEvent.click(screen.getByText("GitHub"));
+  await userEvent.click(screen.getByText("Github"));

  // Then select the repository
-  const repoInput = within(repoConnector).getByTestId("git-repo-dropdown");
+  const dropdown = within(repoConnector).getByTestId("repo-dropdown");
+  const repoInput = within(dropdown).getByRole("combobox");
  await userEvent.click(repoInput);

  // Wait for the options to be loaded and displayed
  await waitFor(() => {
-    const dropdownMenu = screen.getByTestId("git-repo-dropdown-menu");
-    expect(within(dropdownMenu).getByText(repoName)).toBeInTheDocument();
+    const options = screen.getAllByText(repoName);
+    // Find the option in the dropdown (it will have role="option")
+    const dropdownOption = options.find(
+      (el) => el.getAttribute("role") === "option",
+    );
+    expect(dropdownOption).toBeInTheDocument();
  });
-  const dropdownMenu = screen.getByTestId("git-repo-dropdown-menu");
-  await userEvent.click(within(dropdownMenu).getByText(repoName));
+  const options = screen.getAllByText(repoName);
+  const dropdownOption = options.find(
+    (el) => el.getAttribute("role") === "option",
+  );
+  await userEvent.click(dropdownOption!);

  // Wait for the branch to be auto-selected
  await waitFor(() => {
-    const branchInput = screen.getByTestId("git-branch-dropdown-input");
-    expect(branchInput).toHaveValue("main");
+    expect(screen.getByText("main")).toBeInTheDocument();
  });
 };

@@ -78,14 +85,12 @@ const MOCK_RESPOSITORIES: GitRepository[] = [
    full_name: "octocat/hello-world",
    git_provider: "github",
    is_public: true,
-    main_branch: "main",
  },
  {
    id: "2",
    full_name: "octocat/earth",
    git_provider: "github",
    is_public: true,
-    main_branch: "main",
  },
 ];

@@ -135,10 +140,10 @@ describe("HomeScreen", () => {
        await screen.findAllByTestId("task-launch-button");

      // Mock the repository branches API call
-      vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue({ branches: [
+      vi.spyOn(OpenHands, "getRepositoryBranches").mockResolvedValue([
        { name: "main", commit_sha: "123", protected: false },
        { name: "develop", commit_sha: "456", protected: false },
-      ], has_next_page: false, current_page: 1, per_page: 30, total_count: 2 });
+      ]);

      // Select a repository to enable the repo launch button
      await selectRepository("octocat/hello-world");
--- a/frontend/tests/routes/llm-settings.test.tsx
+++ b/frontend/tests/routes/llm-settings.test.tsx
@@ -79,35 +79,6 @@ describe("Content", () => {
        expect(screen.getByTestId("set-indicator")).toBeInTheDocument();
      });
    });
-
-    it("should conditionally show security analyzer based on confirmation mode", async () => {
-      renderLlmSettingsScreen();
-      await screen.findByTestId("llm-settings-screen");
-
-      const confirmation = screen.getByTestId("enable-confirmation-mode-switch");
-
-      // Initially confirmation mode is false, so security analyzer should not be visible
-      expect(confirmation).not.toBeChecked();
-      expect(
-        screen.queryByTestId("security-analyzer-input"),
-      ).not.toBeInTheDocument();
-
-      // Enable confirmation mode
-      await userEvent.click(confirmation);
-      expect(confirmation).toBeChecked();
-
-      // Security analyzer should now be visible
-      screen.getByTestId("security-analyzer-input");
-
-      // Disable confirmation mode again
-      await userEvent.click(confirmation);
-      expect(confirmation).not.toBeChecked();
-
-      // Security analyzer should be hidden again
-      expect(
-        screen.queryByTestId("security-analyzer-input"),
-      ).not.toBeInTheDocument();
-    });
  });

  describe("Advanced form", () => {
@@ -136,6 +107,7 @@ describe("Content", () => {
      within(advancedForm).getByTestId("llm-api-key-input");
      within(advancedForm).getByTestId("llm-api-key-help-anchor-advanced");
      within(advancedForm).getByTestId("agent-input");
+      within(advancedForm).getByTestId("enable-confirmation-mode-switch");
      within(advancedForm).getByTestId("enable-memory-condenser-switch");

      await userEvent.click(advancedSwitch);
@@ -158,6 +130,9 @@ describe("Content", () => {
      const baseUrl = screen.getByTestId("base-url-input");
      const apiKey = screen.getByTestId("llm-api-key-input");
      const agent = screen.getByTestId("agent-input");
+      const confirmation = screen.getByTestId(
+        "enable-confirmation-mode-switch",
+      );
      const condensor = screen.getByTestId("enable-memory-condenser-switch");

      expect(model).toHaveValue("openhands/claude-sonnet-4-20250514");
@@ -165,7 +140,15 @@ describe("Content", () => {
      expect(apiKey).toHaveValue("");
      expect(apiKey).toHaveProperty("placeholder", "");
      expect(agent).toHaveValue("CodeActAgent");
+      expect(confirmation).not.toBeChecked();
      expect(condensor).toBeChecked();
+
+      // check that security analyzer is present
+      expect(
+        screen.queryByTestId("security-analyzer-input"),
+      ).not.toBeInTheDocument();
+      await userEvent.click(confirmation);
+      screen.getByTestId("security-analyzer-input");
    });

    it("should render the advanced form if existings settings are advanced", async () => {
@@ -194,7 +177,7 @@ describe("Content", () => {
        agent: "CoActAgent",
        confirmation_mode: true,
        enable_default_condenser: false,
-        security_analyzer: "none",
+        security_analyzer: "mock-invariant",
      });

      renderLlmSettingsScreen();
@@ -220,7 +203,7 @@ describe("Content", () => {
        expect(agent).toHaveValue("CoActAgent");
        expect(confirmation).toBeChecked();
        expect(condensor).not.toBeChecked();
-        expect(securityAnalyzer).toHaveValue("SETTINGS$SECURITY_ANALYZER_NONE");
+        expect(securityAnalyzer).toHaveValue("mock-invariant");
      });
    });
  });
@@ -310,7 +293,7 @@ describe("Form submission", () => {
    // select security analyzer
    const securityAnalyzer = screen.getByTestId("security-analyzer-input");
    await userEvent.click(securityAnalyzer);
-    const securityAnalyzerOption = screen.getByText("SETTINGS$SECURITY_ANALYZER_NONE");
+    const securityAnalyzerOption = screen.getByText("mock-invariant");
    await userEvent.click(securityAnalyzerOption);

    const submitButton = screen.getByTestId("submit-button");
@@ -323,7 +306,7 @@ describe("Form submission", () => {
        agent: "CoActAgent",
        confirmation_mode: true,
        enable_default_condenser: false,
-        security_analyzer: null,
+        security_analyzer: "mock-invariant",
      }),
    );
  });
@@ -392,10 +375,8 @@ describe("Form submission", () => {
    const baseUrl = await screen.findByTestId("base-url-input");
    const apiKey = await screen.findByTestId("llm-api-key-input");
    const agent = await screen.findByTestId("agent-input");
-    const condensor = await screen.findByTestId("enable-memory-condenser-switch");
-
-    // Confirmation mode switch is now in basic settings, always visible
    const confirmation = await screen.findByTestId("enable-confirmation-mode-switch");
+    const condensor = await screen.findByTestId("enable-memory-condenser-switch");

    // enter custom model
    await userEvent.type(model, "-mini");
@@ -470,17 +451,14 @@ describe("Form submission", () => {
    // select security analyzer
    const securityAnalyzer = await screen.findByTestId("security-analyzer-input");
    await userEvent.click(securityAnalyzer);
-    const securityAnalyzerOption = screen.getByText("SETTINGS$SECURITY_ANALYZER_NONE");
+    const securityAnalyzerOption = screen.getByText("mock-invariant");
    await userEvent.click(securityAnalyzerOption);
-    expect(securityAnalyzer).toHaveValue("SETTINGS$SECURITY_ANALYZER_NONE");
+    expect(securityAnalyzer).toHaveValue("mock-invariant");

    expect(submitButton).not.toBeDisabled();

-    // revert back to original value
-    await userEvent.click(securityAnalyzer);
-    const originalSecurityAnalyzerOption = screen.getByText("SETTINGS$SECURITY_ANALYZER_LLM_DEFAULT");
-    await userEvent.click(originalSecurityAnalyzerOption);
-    expect(securityAnalyzer).toHaveValue("SETTINGS$SECURITY_ANALYZER_LLM_DEFAULT");
+    await userEvent.clear(securityAnalyzer);
+    expect(securityAnalyzer).toHaveValue("");
    expect(submitButton).toBeDisabled();
  });

@@ -574,7 +552,7 @@ describe("Form submission", () => {
      expect.objectContaining({
        llm_model: "openhands/claude-sonnet-4-20250514",
        llm_base_url: "",
-        confirmation_mode: true, // Confirmation mode is now a basic setting, should be preserved
+        confirmation_mode: false,
      }),
    );
  });
--- a/frontend/tests/routes/secrets-settings.test.tsx
+++ b/frontend/tests/routes/secrets-settings.test.tsx
@@ -107,7 +107,9 @@ describe("Content", () => {
      expect(screen.queryByTestId("add-secret-button")).not.toBeInTheDocument(),
    );
    const button = await screen.findByTestId("connect-git-button");
-    expect(button).toHaveAttribute("href", "/settings/integrations");
+    await userEvent.click(button);
+
+    screen.getByTestId("git-settings-screen");
  });

  it("should render an empty table when there are no existing secrets", async () => {
--- a/frontend/tests/routes/settings.test.tsx
+++ b/frontend/tests/routes/settings.test.tsx
@@ -136,7 +136,7 @@ describe("Settings Screen", () => {
      "secrets",
      "api keys",
    ];
-    const sectionsToExclude = ["llm"];
+    const sectionsToExclude = ["llm", "mcp"];

    renderSettingsScreen();

--- a/frontend/tests/utils/has-advanced-settings-set.test.ts
+++ b/frontend/tests/utils/has-advanced-settings-set.test.ts
@@ -29,5 +29,23 @@ describe("hasAdvancedSettingsSet", () => {
        }),
      ).toBe(true);
    });
+
+    test("CONFIRMATION_MODE is true", () => {
+      expect(
+        hasAdvancedSettingsSet({
+          ...DEFAULT_SETTINGS,
+          CONFIRMATION_MODE: true,
+        }),
+      ).toBe(true);
+    });
+
+    test("SECURITY_ANALYZER is set", () => {
+      expect(
+        hasAdvancedSettingsSet({
+          ...DEFAULT_SETTINGS,
+          SECURITY_ANALYZER: "test",
+        }),
+      ).toBe(true);
+    });
  });
 });
--- a/frontend/package-lock.json
+++ b/frontend/package-lock.json
--- a/frontend/package.json
+++ b/frontend/package.json
@@ -1,6 +1,6 @@
 {
  "name": "openhands-frontend",
-  "version": "0.55.0",
+  "version": "0.53.0",
  "private": true,
  "type": "module",
  "engines": {
@@ -11,50 +11,50 @@
    "@heroui/use-infinite-scroll": "^2.2.10",
    "@microlink/react-json-view": "^1.26.2",
    "@monaco-editor/react": "^4.7.0-rc.0",
-    "@react-router/node": "^7.8.2",
-    "@react-router/serve": "^7.8.2",
-    "@react-types/shared": "^3.32.0",
+    "@react-router/node": "^7.8.0",
+    "@react-router/serve": "^7.8.0",
+    "@react-types/shared": "^3.31.0",
    "@reduxjs/toolkit": "^2.8.2",
-    "@stripe/react-stripe-js": "^3.9.2",
-    "@stripe/stripe-js": "^7.9.0",
+    "@stripe/react-stripe-js": "^3.9.0",
+    "@stripe/stripe-js": "^7.8.0",
    "@tailwindcss/postcss": "^4.1.12",
    "@tailwindcss/vite": "^4.1.12",
-    "@tanstack/react-query": "^5.85.5",
+    "@tanstack/react-query": "^5.85.3",
    "@uidotdev/usehooks": "^2.4.1",
-    "@vitejs/plugin-react": "^5.0.2",
+    "@vitejs/plugin-react": "^5.0.0",
    "@xterm/addon-fit": "^0.10.0",
    "@xterm/xterm": "^5.4.0",
    "axios": "^1.11.0",
    "clsx": "^2.1.1",
    "date-fns": "^4.1.0",
-    "downshift": "^9.0.10",
    "eslint-config-airbnb-typescript": "^18.0.0",
    "framer-motion": "^12.23.12",
-    "i18next": "^25.4.2",
+    "i18next": "^25.3.6",
    "i18next-browser-languagedetector": "^8.2.0",
    "i18next-http-backend": "^3.0.2",
-    "isbot": "^5.1.30",
-    "jose": "^6.1.0",
-    "lucide-react": "^0.542.0",
+    "isbot": "^5.1.29",
+    "jose": "^6.0.12",
+    "lucide-react": "^0.539.0",
    "monaco-editor": "^0.52.2",
-    "posthog-js": "^1.261.0",
+    "posthog-js": "^1.260.1",
    "react": "^19.1.1",
    "react-dom": "^19.1.1",
    "react-highlight": "^0.15.0",
-    "react-hot-toast": "^2.6.0",
-    "react-i18next": "^15.7.2",
+    "react-hot-toast": "^2.5.1",
+    "react-i18next": "^15.6.1",
    "react-icons": "^5.5.0",
    "react-markdown": "^10.1.0",
    "react-redux": "^9.2.0",
-    "react-router": "^7.8.2",
-    "react-syntax-highlighter": "^15.6.6",
+    "react-router": "^7.8.0",
+    "react-select": "^5.10.2",
+    "react-syntax-highlighter": "^15.6.1",
    "react-textarea-autosize": "^8.5.9",
    "remark-breaks": "^4.0.0",
    "remark-gfm": "^4.0.1",
    "sirv-cli": "^3.0.1",
    "socket.io-client": "^4.8.1",
    "tailwind-merge": "^3.3.1",
-    "vite": "^7.1.3",
+    "vite": "^7.1.1",
    "web-vitals": "^5.1.0",
    "ws": "^8.18.2"
  },
@@ -88,17 +88,17 @@
    "@babel/traverse": "^7.28.3",
    "@babel/types": "^7.28.2",
    "@mswjs/socket.io-binding": "^0.2.0",
-    "@playwright/test": "^1.55.0",
-    "@react-router/dev": "^7.8.2",
+    "@playwright/test": "^1.54.2",
+    "@react-router/dev": "^7.8.0",
    "@tailwindcss/typography": "^0.5.16",
    "@tanstack/eslint-plugin-query": "^5.83.1",
    "@testing-library/dom": "^10.4.1",
-    "@testing-library/jest-dom": "^6.8.0",
+    "@testing-library/jest-dom": "^6.7.0",
    "@testing-library/react": "^16.3.0",
    "@testing-library/user-event": "^14.6.1",
-    "@types/node": "^24.3.0",
-    "@types/react": "^19.1.12",
-    "@types/react-dom": "^19.1.9",
+    "@types/node": "^24.2.0",
+    "@types/react": "^19.1.9",
+    "@types/react-dom": "^19.1.7",
    "@types/react-highlight": "^0.12.8",
    "@types/react-syntax-highlighter": "^15.5.13",
    "@types/ws": "^8.18.1",
@@ -117,16 +117,16 @@
    "eslint-plugin-prettier": "^5.5.4",
    "eslint-plugin-react": "^7.37.5",
    "eslint-plugin-react-hooks": "^4.6.2",
-    "eslint-plugin-unused-imports": "^4.2.0",
+    "eslint-plugin-unused-imports": "^4.1.4",
    "husky": "^9.1.7",
    "jsdom": "^26.1.0",
    "lint-staged": "^16.1.4",
    "msw": "^2.6.6",
    "prettier": "^3.6.2",
-    "stripe": "^18.5.0",
+    "stripe": "^18.4.0",
    "tailwindcss": "^4.1.8",
    "typescript": "^5.9.2",
-    "vite-plugin-svgr": "^4.5.0",
+    "vite-plugin-svgr": "^4.2.0",
    "vite-tsconfig-paths": "^5.1.4",
    "vitest": "^3.0.2"
  },
--- a/frontend/src/api/open-hands.ts
+++ b/frontend/src/api/open-hands.ts
@@ -21,17 +21,11 @@ import {
 } from "./open-hands.types";
 import { openHands } from "./open-hands-axios";
 import { ApiSettings, PostApiSettings, Provider } from "#/types/settings";
-import {
-  GitUser,
-  GitRepository,
-  PaginatedBranchesResponse,
-  Branch,
-} from "#/types/git";
+import { GitUser, GitRepository, Branch } from "#/types/git";
 import { SuggestedTask } from "#/components/features/home/tasks/task.types";
 import { extractNextPageFromLink } from "#/utils/extract-next-page-from-link";
 import { RepositoryMicroagent } from "#/types/microagent-management";
 import { BatchFeedbackData } from "#/hooks/query/use-batch-feedback";
-import { SubscriptionAccess } from "#/types/billing";

 class OpenHands {
  private static currentConversation: Conversation | null = null;
@@ -434,13 +428,6 @@ class OpenHands {
    return data.credits;
  }

-  static async getSubscriptionAccess(): Promise<SubscriptionAccess | null> {
-    const { data } = await openHands.get<SubscriptionAccess | null>(
-      "/api/billing/subscription-access",
-    );
-    return data;
-  }
-
  static async getGitUser(): Promise<GitUser> {
    const response = await openHands.get<GitUser>("/api/user/info");

@@ -580,35 +567,11 @@ class OpenHands {
    };
  }

-  static async getRepositoryBranches(
-    repository: string,
-    page: number = 1,
-    perPage: number = 30,
-  ): Promise<PaginatedBranchesResponse> {
-    const { data } = await openHands.get<PaginatedBranchesResponse>(
-      `/api/user/repository/branches?repository=${encodeURIComponent(repository)}&page=${page}&per_page=${perPage}`,
-    );
-
-    return data;
-  }
-
-  static async searchRepositoryBranches(
-    repository: string,
-    query: string,
-    perPage: number = 30,
-    selectedProvider?: Provider,
-  ): Promise<Branch[]> {
+  static async getRepositoryBranches(repository: string): Promise<Branch[]> {
    const { data } = await openHands.get<Branch[]>(
-      `/api/user/search/branches`,
-      {
-        params: {
-          repository,
-          query,
-          per_page: perPage,
-          selected_provider: selectedProvider,
-        },
-      },
+      `/api/user/repository/branches?repository=${encodeURIComponent(repository)}`,
    );
+
    return data;
  }

@@ -763,27 +726,6 @@ class OpenHands {
    );
    return data;
  }
-
-  static async getMicroagentManagementConversations(
-    selectedRepository: string,
-    pageId?: string,
-    limit: number = 100,
-  ): Promise<Conversation[]> {
-    const params: Record<string, string | number> = {
-      limit,
-      selected_repository: selectedRepository,
-    };
-
-    if (pageId) {
-      params.page_id = pageId;
-    }
-
-    const { data } = await openHands.get<ResultSet<Conversation>>(
-      "/api/microagent-management/conversations",
-      { params },
-    );
-    return data.results;
-  }
 }

 export default OpenHands;
--- a/frontend/src/api/open-hands.types.ts
+++ b/frontend/src/api/open-hands.types.ts
@@ -49,11 +49,13 @@ export interface GetConfigResponse {
  APP_SLUG?: string;
  GITHUB_CLIENT_ID: string;
  POSTHOG_CLIENT_KEY: string;
+  STRIPE_PUBLISHABLE_KEY?: string;
  PROVIDERS_CONFIGURED?: Provider[];
  AUTH_URL?: string;
  FEATURE_FLAGS: {
    ENABLE_BILLING: boolean;
    HIDE_LLM_SETTINGS: boolean;
+    HIDE_MICROAGENT_MANAGEMENT?: boolean;
    ENABLE_JIRA: boolean;
    ENABLE_JIRA_DC: boolean;
    ENABLE_LINEAR: boolean;
--- a/frontend/src/components/common/git-branch-dropdown.tsx
+++ b/frontend/src/components/common/git-branch-dropdown.tsx
@@ -0,0 +1,69 @@
+import { useMemo } from "react";
+import { useRepositoryBranches } from "../../hooks/query/use-repository-branches";
+import { ReactSelectDropdown, SelectOption } from "./react-select-dropdown";
+
+export interface GitBranchDropdownProps {
+  repositoryName?: string | null;
+  value?: string | null;
+  placeholder?: string;
+  className?: string;
+  errorMessage?: string;
+  disabled?: boolean;
+  onChange?: (branchName: string | null) => void;
+}
+
+export function GitBranchDropdown({
+  repositoryName,
+  value,
+  placeholder = "Select branch...",
+  className,
+  errorMessage,
+  disabled = false,
+  onChange,
+}: GitBranchDropdownProps) {
+  const { data: branches, isLoading } = useRepositoryBranches(
+    repositoryName || null,
+  );
+
+  const options: SelectOption[] = useMemo(
+    () =>
+      branches?.map((branch) => ({
+        value: branch.name,
+        label: branch.name,
+      })) || [],
+    [branches],
+  );
+
+  const hasNoBranches = !isLoading && branches && branches.length === 0;
+
+  const selectedOption = useMemo(
+    () => options.find((option) => option.value === value) || null,
+    [options, value],
+  );
+
+  const handleChange = (option: SelectOption | null) => {
+    onChange?.(option?.value || null);
+  };
+
+  const isDisabled = disabled || !repositoryName || isLoading || hasNoBranches;
+
+  const displayPlaceholder = hasNoBranches ? "No branches found" : placeholder;
+  const displayErrorMessage = hasNoBranches
+    ? "This repository has no branches"
+    : errorMessage;
+
+  return (
+    <ReactSelectDropdown
+      options={options}
+      value={selectedOption}
+      placeholder={displayPlaceholder}
+      className={className}
+      errorMessage={displayErrorMessage}
+      disabled={isDisabled}
+      isClearable={false}
+      isSearchable
+      isLoading={isLoading}
+      onChange={handleChange}
+    />
+  );
+}
--- a/frontend/src/components/common/git-provider-dropdown.tsx
+++ b/frontend/src/components/common/git-provider-dropdown.tsx
@@ -0,0 +1,58 @@
+import { useMemo } from "react";
+import { Provider } from "../../types/settings";
+import { ReactSelectDropdown, SelectOption } from "./react-select-dropdown";
+
+export interface GitProviderDropdownProps {
+  providers: Provider[];
+  value?: Provider | null;
+  placeholder?: string;
+  className?: string;
+  errorMessage?: string;
+  disabled?: boolean;
+  isLoading?: boolean;
+  onChange?: (provider: Provider | null) => void;
+}
+
+export function GitProviderDropdown({
+  providers,
+  value,
+  placeholder = "Select Provider",
+  className,
+  errorMessage,
+  disabled = false,
+  isLoading = false,
+  onChange,
+}: GitProviderDropdownProps) {
+  const options: SelectOption[] = useMemo(
+    () =>
+      providers.map((provider) => ({
+        value: provider,
+        label: provider.charAt(0).toUpperCase() + provider.slice(1),
+      })),
+    [providers],
+  );
+
+  const selectedOption = useMemo(
+    () => options.find((option) => option.value === value) || null,
+    [options, value],
+  );
+
+  const handleChange = (option: SelectOption | null) => {
+    onChange?.(option?.value as Provider | null);
+  };
+
+  return (
+    <ReactSelectDropdown
+      options={options}
+      value={selectedOption}
+      placeholder={placeholder}
+      className={className}
+      errorMessage={errorMessage}
+      disabled={disabled}
+      isClearable={false}
+      isSearchable={false}
+      isLoading={isLoading}
+      onChange={handleChange}
+    />
+  );
+}
--- a/frontend/src/components/common/git-repository-dropdown.tsx
+++ b/frontend/src/components/common/git-repository-dropdown.tsx
@@ -0,0 +1,201 @@
+import { useCallback, useMemo, useRef } from "react";
+import { useTranslation } from "react-i18next";
+import { Provider } from "../../types/settings";
+import { useGitRepositories } from "../../hooks/query/use-git-repositories";
+import OpenHands from "../../api/open-hands";
+import { GitRepository } from "../../types/git";
+import {
+  ReactSelectAsyncDropdown,
+  AsyncSelectOption,
+} from "./react-select-async-dropdown";
+
+export interface GitRepositoryDropdownProps {
+  provider: Provider;
+  value?: string | null;
+  placeholder?: string;
+  className?: string;
+  errorMessage?: string;
+  disabled?: boolean;
+  onChange?: (repository?: GitRepository) => void;
+}
+
+interface SearchCache {
+  [key: string]: GitRepository[];
+}
+
+export function GitRepositoryDropdown({
+  provider,
+  value,
+  placeholder = "Search repositories...",
+  className,
+  errorMessage,
+  disabled = false,
+  onChange,
+}: GitRepositoryDropdownProps) {
+  const { t } = useTranslation();
+  const {
+    data,
+    fetchNextPage,
+    hasNextPage,
+    isLoading,
+    isFetchingNextPage,
+    isError,
+  } = useGitRepositories({
+    provider,
+    enabled: !disabled,
+  });
+
+  const allOptions: AsyncSelectOption[] = useMemo(
+    () =>
+      data?.pages
+        ? data.pages.flatMap((page) =>
+            page.data.map((repo) => ({
+              value: repo.id,
+              label: repo.full_name,
+            })),
+          )
+        : [],
+    [data],
+  );
+
+  // Keep track of search results
+  const searchCache = useRef<SearchCache>({});
+
+  const selectedOption = useMemo(() => {
+    // First check in loaded pages
+    const option = allOptions.find((opt) => opt.value === value);
+    if (option) return option;
+
+    // If not found, check in search cache
+    const repo = Object.values(searchCache.current)
+      .flat()
+      .find((r) => r.id === value);
+
+    if (repo) {
+      return {
+        value: repo.id,
+        label: repo.full_name,
+      };
+    }
+
+    return null;
+  }, [allOptions, value]);
+
+  const loadOptions = useCallback(
+    async (inputValue: string): Promise<AsyncSelectOption[]> => {
+      // If empty input, show all loaded options
+      if (!inputValue.trim()) {
+        return allOptions;
+      }
+
+      // If it looks like a URL, pass the full URL to the API along with the provider
+      if (inputValue.startsWith("https://")) {
+        try {
+          const searchResults = await OpenHands.searchGitRepositories(
+            inputValue,
+            3,
+            provider,
+          );
+          // Cache by URL to preserve mapping
+          searchCache.current[inputValue] = searchResults;
+          return searchResults.map((repo) => ({
+            value: repo.id,
+            label: repo.full_name,
+          }));
+        } catch (_) {
+          // Fallback: attempt with extracted path if server doesn't support URL search
+          const match = inputValue.match(/https:\/\/[^/]+\/([^/]+\/[^/]+)/);
+          if (match) {
+            const repoName = match[1];
+            const searchResults = await OpenHands.searchGitRepositories(
+              repoName,
+              3,
+              provider,
+            );
+            searchCache.current[repoName] = searchResults;
+            return searchResults.map((repo) => ({
+              value: repo.id,
+              label: repo.full_name,
+            }));
+          }
+        }
+      }
+
+      // For any other input, search via API for the selected provider
+      if (inputValue.length >= 2) {
+        const searchResults = await OpenHands.searchGitRepositories(
+          inputValue,
+          10,
+          provider,
+        );
+        // Cache the search results
+        searchCache.current[inputValue] = searchResults;
+        return searchResults.map((repo) => ({
+          value: repo.id,
+          label: repo.full_name,
+        }));
+      }
+
+      // For very short inputs, do local filtering
+      return allOptions.filter((option) =>
+        option.label.toLowerCase().includes(inputValue.toLowerCase()),
+      );
+    },
+    [allOptions, provider],
+  );
+
+  const handleChange = (option: AsyncSelectOption | null) => {
+    if (!option) {
+      onChange?.(undefined);
+      return;
+    }
+
+    // First check in loaded pages
+    let repo = data?.pages
+      ?.flatMap((p) => p.data)
+      .find((r) => r.id === option.value);
+
+    // If not found, check in search results
+    if (!repo) {
+      repo = Object.values(searchCache.current)
+        .flat()
+        .find((r) => r.id === option.value);
+    }
+
+    onChange?.(repo);
+  };
+
+  const handleMenuScrollToBottom = useCallback(() => {
+    if (hasNextPage && !isFetchingNextPage && !isLoading) {
+      fetchNextPage();
+    }
+  }, [hasNextPage, isFetchingNextPage, isLoading, fetchNextPage]);
+
+  return (
+    <>
+      <ReactSelectAsyncDropdown
+        testId="repo-dropdown"
+        loadOptions={loadOptions}
+        value={selectedOption}
+        placeholder={placeholder}
+        className={className}
+        errorMessage={errorMessage}
+        disabled={disabled}
+        isClearable={false}
+        isLoading={isLoading || isLoading || isFetchingNextPage}
+        cacheOptions
+        defaultOptions={allOptions}
+        onChange={handleChange}
+        onMenuScrollToBottom={handleMenuScrollToBottom}
+      />
+      {isError && (
+        <div
+          data-testid="repo-dropdown-error"
+          className="text-red-500 text-sm mt-1"
+        >
+          {t("HOME$FAILED_TO_LOAD_REPOSITORIES")}
+        </div>
+      )}
+    </>
+  );
+}
--- a/frontend/src/components/common/react-select-async-dropdown.tsx
+++ b/frontend/src/components/common/react-select-async-dropdown.tsx
@@ -0,0 +1,79 @@
+import { useCallback, useMemo } from "react";
+import AsyncSelect from "react-select/async";
+import { cn } from "#/utils/utils";
+import { SelectOptionBase, getCustomStyles } from "./react-select-styles";
+
+export type AsyncSelectOption = SelectOptionBase;
+
+export interface ReactSelectAsyncDropdownProps {
+  loadOptions: (inputValue: string) => Promise<AsyncSelectOption[]>;
+  testId?: string;
+  placeholder?: string;
+  value?: AsyncSelectOption | null;
+  defaultValue?: AsyncSelectOption | null;
+  className?: string;
+  errorMessage?: string;
+  disabled?: boolean;
+  isClearable?: boolean;
+  isLoading?: boolean;
+  cacheOptions?: boolean;
+  defaultOptions?: boolean | AsyncSelectOption[];
+  onChange?: (option: AsyncSelectOption | null) => void;
+  onMenuScrollToBottom?: () => void;
+}
+
+export function ReactSelectAsyncDropdown({
+  loadOptions,
+  testId,
+  placeholder = "Search...",
+  value,
+  defaultValue,
+  className,
+  errorMessage,
+  disabled = false,
+  isClearable = false,
+  isLoading = false,
+  cacheOptions = true,
+  defaultOptions = true,
+  onChange,
+  onMenuScrollToBottom,
+}: ReactSelectAsyncDropdownProps) {
+  const customStyles = useMemo(() => getCustomStyles<AsyncSelectOption>(), []);
+
+  const handleLoadOptions = useCallback(
+    (inputValue: string, callback: (options: AsyncSelectOption[]) => void) => {
+      loadOptions(inputValue)
+        .then((options) => callback(options))
+        .catch(() => callback([]));
+    },
+    [loadOptions],
+  );
+
+  return (
+    <div data-testid={testId} className={cn("w-full", className)}>
+      <AsyncSelect
+        loadOptions={handleLoadOptions}
+        value={value}
+        defaultValue={defaultValue}
+        placeholder={placeholder}
+        isDisabled={disabled}
+        isClearable={isClearable}
+        isLoading={isLoading}
+        cacheOptions={cacheOptions}
+        defaultOptions={defaultOptions}
+        onChange={onChange}
+        onMenuScrollToBottom={onMenuScrollToBottom}
+        styles={customStyles}
+        className="w-full"
+      />
+      {errorMessage && (
+        <p
+          data-testid="repo-dropdown-error"
+          className="text-red-500 text-sm mt-1"
+        >
+          {errorMessage}
+        </p>
+      )}
+    </div>
+  );
+}
--- a/frontend/src/components/common/react-select-dropdown.tsx
+++ b/frontend/src/components/common/react-select-dropdown.tsx
@@ -0,0 +1,57 @@
+import { useMemo } from "react";
+import Select from "react-select";
+import { cn } from "#/utils/utils";
+import { SelectOptionBase, getCustomStyles } from "./react-select-styles";
+
+export type SelectOption = SelectOptionBase;
+
+export interface ReactSelectDropdownProps {
+  options: SelectOption[];
+  placeholder?: string;
+  value?: SelectOption | null;
+  defaultValue?: SelectOption | null;
+  className?: string;
+  errorMessage?: string;
+  disabled?: boolean;
+  isClearable?: boolean;
+  isSearchable?: boolean;
+  isLoading?: boolean;
+  onChange?: (option: SelectOption | null) => void;
+}
+
+export function ReactSelectDropdown({
+  options,
+  placeholder = "Select option...",
+  value,
+  defaultValue,
+  className,
+  errorMessage,
+  disabled = false,
+  isClearable = false,
+  isSearchable = true,
+  isLoading = false,
+  onChange,
+}: ReactSelectDropdownProps) {
+  const customStyles = useMemo(() => getCustomStyles<SelectOption>(), []);
+
+  return (
+    <div className={cn("w-full", className)}>
+      <Select
+        options={options}
+        value={value}
+        defaultValue={defaultValue}
+        placeholder={placeholder}
+        isDisabled={disabled}
+        isClearable={isClearable}
+        isSearchable={isSearchable}
+        isLoading={isLoading}
+        onChange={onChange}
+        styles={customStyles}
+        className="w-full"
+      />
+      {errorMessage && (
+        <p className="text-red-500 text-sm mt-1">{errorMessage}</p>
+      )}
+    </div>
+  );
+}
--- a/frontend/src/components/common/react-select-styles.ts
+++ b/frontend/src/components/common/react-select-styles.ts
@@ -0,0 +1,92 @@
+import { StylesConfig } from "react-select";
+
+export interface SelectOptionBase {
+  value: string;
+  label: string;
+}
+
+export const getCustomStyles = <T extends SelectOptionBase>(): StylesConfig<
+  T,
+  false
+> => ({
+  control: (provided, state) => ({
+    ...provided,
+    backgroundColor: state.isDisabled ? "#363636" : "#454545", // darker tertiary when disabled
+    border: "1px solid #717888",
+    borderRadius: "0.125rem",
+    minHeight: "2.5rem",
+    padding: "0 0.5rem",
+    boxShadow: state.isFocused ? "0 0 0 1px #717888" : "none",
+    opacity: state.isDisabled ? 0.6 : 1,
+    cursor: state.isDisabled ? "not-allowed" : "pointer",
+    "&:hover": {
+      borderColor: "#717888",
+    },
+  }),
+  input: (provided) => ({
+    ...provided,
+    color: "#ECEDEE", // content
+  }),
+  placeholder: (provided) => ({
+    ...provided,
+    fontStyle: "italic",
+    color: "#B7BDC2", // tertiary-light
+  }),
+  singleValue: (provided, state) => ({
+    ...provided,
+    color: state.isDisabled ? "#B7BDC2" : "#ECEDEE", // tertiary-light when disabled, content otherwise
+  }),
+  menu: (provided) => ({
+    ...provided,
+    backgroundColor: "#454545", // tertiary
+    border: "1px solid #717888",
+    borderRadius: "0.75rem",
+    overflow: "hidden", // ensure menu items don't overflow rounded corners
+  }),
+  menuList: (provided) => ({
+    ...provided,
+    padding: "0.25rem", // add some padding around menu items
+  }),
+  option: (provided, state) => {
+    let backgroundColor = "transparent";
+    if (state.isSelected) {
+      backgroundColor = "#C9B974"; // primary for selected
+    } else if (state.isFocused) {
+      backgroundColor = "#24272E"; // base-secondary for hover/focus
+    }
+
+    return {
+      ...provided,
+      backgroundColor,
+      color: state.isSelected ? "#000000" : "#ECEDEE", // black text on yellow, white on gray
+      borderRadius: "0.5rem", // rounded menu items
+      margin: "0.125rem 0", // small gap between items
+      "&:hover": {
+        backgroundColor: state.isSelected ? "#C9B974" : "#24272E", // keep yellow if selected, else gray
+        color: state.isSelected ? "#000000" : "#ECEDEE", // maintain text color on hover
+      },
+      "&:active": {
+        backgroundColor: state.isSelected ? "#C9B974" : "#24272E",
+        color: state.isSelected ? "#000000" : "#ECEDEE",
+      },
+    };
+  },
+  clearIndicator: (provided) => ({
+    ...provided,
+    color: "#B7BDC2", // tertiary-light
+    "&:hover": {
+      color: "#ECEDEE", // content
+    },
+  }),
+  dropdownIndicator: (provided) => ({
+    ...provided,
+    color: "#B7BDC2", // tertiary-light
+    "&:hover": {
+      color: "#ECEDEE", // content
+    },
+  }),
+  loadingIndicator: (provided) => ({
+    ...provided,
+    color: "#B7BDC2", // tertiary-light
+  }),
+});
--- a/frontend/src/components/features/chat/chat-message.tsx
+++ b/frontend/src/components/features/chat/chat-message.tsx
@@ -9,7 +9,6 @@ import { CopyToClipboardButton } from "#/components/shared/buttons/copy-to-clipb
 import { anchor } from "../markdown/anchor";
 import { OpenHandsSourceType } from "#/types/core/base";
 import { paragraph } from "../markdown/paragraph";
-import { TooltipButton } from "#/components/shared/buttons/tooltip-button";

 interface ChatMessageProps {
  type: OpenHandsSourceType;
@@ -17,7 +16,6 @@ interface ChatMessageProps {
  actions?: Array<{
    icon: React.ReactNode;
    onClick: () => void;
-    tooltip?: string;
  }>;
 }

@@ -68,35 +66,17 @@ export function ChatMessage({
          "items-center gap-1",
        )}
      >
-        {actions?.map((action, index) =>
-          action.tooltip ? (
-            <TooltipButton
-              key={index}
-              tooltip={action.tooltip}
-              ariaLabel={action.tooltip}
-              placement="top"
-            >
-              <button
-                type="button"
-                onClick={action.onClick}
-                className="button-base p-1 cursor-pointer"
-                aria-label={`Action ${index + 1}`}
-              >
-                {action.icon}
-              </button>
-            </TooltipButton>
-          ) : (
-            <button
-              key={index}
-              type="button"
-              onClick={action.onClick}
-              className="button-base p-1 cursor-pointer"
-              aria-label={`Action ${index + 1}`}
-            >
-              {action.icon}
-            </button>
-          ),
-        )}
+        {actions?.map((action, index) => (
+          <button
+            key={index}
+            type="button"
+            onClick={action.onClick}
+            className="button-base p-1 cursor-pointer"
+            aria-label={`Action ${index + 1}`}
+          >
+            {action.icon}
+          </button>
+        ))}

        <CopyToClipboardButton
          isHidden={!isHovering}
--- a/frontend/src/components/features/chat/event-content-helpers/get-observation-content.ts
+++ b/frontend/src/components/features/chat/event-content-helpers/get-observation-content.ts
@@ -72,9 +72,6 @@ const getRecallObservationContent = (event: RecallObservation): string => {
    if (event.extras.repo_instructions) {
      content += `\n\n**Repository Instructions:**\n\n${event.extras.repo_instructions}`;
    }
-    if (event.extras.conversation_instructions) {
-      content += `\n\n**Conversation Instructions:**\n\n${event.extras.conversation_instructions}`;
-    }
    if (event.extras.additional_agent_instructions) {
      content += `\n\n**Additional Instructions:**\n\n${event.extras.additional_agent_instructions}`;
    }
--- a/frontend/src/components/features/chat/event-message.tsx
+++ b/frontend/src/components/features/chat/event-message.tsx
@@ -46,7 +46,6 @@ interface EventMessageProps {
  actions?: Array<{
    icon: React.ReactNode;
    onClick: () => void;
-    tooltip?: string;
  }>;
  isInLast10Actions: boolean;
 }
--- a/frontend/src/components/features/chat/messages.tsx
+++ b/frontend/src/components/features/chat/messages.tsx
@@ -1,5 +1,4 @@
 import React from "react";
-import { useTranslation } from "react-i18next";
 import { createPortal } from "react-dom";
 import { OpenHandsAction } from "#/types/core/actions";
 import { OpenHandsObservation } from "#/types/core/observations";
@@ -25,17 +24,6 @@ import { AgentState } from "#/types/agent-state";
 import { getFirstPRUrl } from "#/utils/parse-pr-url";
 import MemoryIcon from "#/icons/memory_icon.svg?react";

-const isErrorEvent = (evt: unknown): evt is { error: true; message: string } =>
-  typeof evt === "object" &&
-  evt !== null &&
-  "error" in evt &&
-  evt.error === true;
-
-const isAgentStatusError = (evt: unknown): boolean =>
-  isOpenHandsEvent(evt) &&
-  isAgentStateChangeObservation(evt) &&
-  evt.extras.agent_state === AgentState.ERROR;
-
 interface MessagesProps {
  messages: (OpenHandsAction | OpenHandsObservation)[];
  isAwaitingUserConfirmation: boolean;
@@ -43,11 +31,8 @@ interface MessagesProps {

 export const Messages: React.FC<MessagesProps> = React.memo(
  ({ messages, isAwaitingUserConfirmation }) => {
-    const {
-      createConversationAndSubscribe,
-      isPending,
-      unsubscribeFromConversation,
-    } = useCreateConversationAndSubscribeMultiple();
+    const { createConversationAndSubscribe, isPending } =
+      useCreateConversationAndSubscribeMultiple();
    const { getOptimisticUserMessage } = useOptimisticUserMessage();
    const { conversationId } = useConversationId();
    const { data: conversation } = useUserConversation(conversationId);
@@ -63,8 +48,6 @@ export const Messages: React.FC<MessagesProps> = React.memo(
      EventMicroagentStatus[]
    >([]);

-    const { t } = useTranslation();
-
    const actionHasObservationPair = React.useCallback(
      (event: OpenHandsAction | OpenHandsObservation): boolean => {
        if (isOpenHandsAction(event)) {
@@ -110,6 +93,20 @@ export const Messages: React.FC<MessagesProps> = React.memo(

    const handleMicroagentEvent = React.useCallback(
      (socketEvent: unknown, microagentConversationId: string) => {
+        // Handle error events
+        const isErrorEvent = (
+          evt: unknown,
+        ): evt is { error: true; message: string } =>
+          typeof evt === "object" &&
+          evt !== null &&
+          "error" in evt &&
+          evt.error === true;
+
+        const isAgentStatusError = (evt: unknown): boolean =>
+          isOpenHandsEvent(evt) &&
+          isAgentStateChangeObservation(evt) &&
+          evt.extras.agent_state === AgentState.ERROR;
+
        if (isErrorEvent(socketEvent) || isAgentStatusError(socketEvent)) {
          setMicroagentStatuses((prev) =>
            prev.map((statusEntry) =>
@@ -122,11 +119,7 @@ export const Messages: React.FC<MessagesProps> = React.memo(
          isOpenHandsEvent(socketEvent) &&
          isAgentStateChangeObservation(socketEvent)
        ) {
-          // Handle completion states
-          if (
-            socketEvent.extras.agent_state === AgentState.FINISHED ||
-            socketEvent.extras.agent_state === AgentState.AWAITING_USER_INPUT
-          ) {
+          if (socketEvent.extras.agent_state === AgentState.FINISHED) {
            setMicroagentStatuses((prev) =>
              prev.map((statusEntry) =>
                statusEntry.conversationId === microagentConversationId
@@ -134,8 +127,6 @@ export const Messages: React.FC<MessagesProps> = React.memo(
                  : statusEntry,
              ),
            );
-
-            unsubscribeFromConversation(microagentConversationId);
          }
        } else if (
          isOpenHandsEvent(socketEvent) &&
@@ -156,27 +147,9 @@ export const Messages: React.FC<MessagesProps> = React.memo(
              ),
            );
          }
-
-          unsubscribeFromConversation(microagentConversationId);
-        } else {
-          // For any other event, transition from WAITING to CREATING if still waiting
-          setMicroagentStatuses((prev) => {
-            const currentStatus = prev.find(
-              (entry) => entry.conversationId === microagentConversationId,
-            )?.status;
-
-            if (currentStatus === MicroagentStatus.WAITING) {
-              return prev.map((statusEntry) =>
-                statusEntry.conversationId === microagentConversationId
-                  ? { ...statusEntry, status: MicroagentStatus.CREATING }
-                  : statusEntry,
-              );
-            }
-            return prev; // No change needed
-          });
        }
      },
-      [setMicroagentStatuses, unsubscribeFromConversation],
+      [setMicroagentStatuses],
    );

    const handleLaunchMicroagent = (
@@ -205,13 +178,13 @@ export const Messages: React.FC<MessagesProps> = React.memo(
        },
        onSuccessCallback: (newConversationId: string) => {
          setShowLaunchMicroagentModal(false);
-          // Update status with conversation ID - start with WAITING
+          // Update status with conversation ID
          setMicroagentStatuses((prev) => [
            ...prev.filter((status) => status.eventId !== selectedEventId),
            {
              eventId: selectedEventId,
              conversationId: newConversationId,
-              status: MicroagentStatus.WAITING,
+              status: MicroagentStatus.CREATING,
            },
          ]);
        },
@@ -246,7 +219,6 @@ export const Messages: React.FC<MessagesProps> = React.memo(
                        setSelectedEventId(message.id);
                        setShowLaunchMicroagentModal(true);
                      },
-                      tooltip: t("MICROAGENT$ADD_TO_MEMORY"),
                    },
                  ]
                : undefined
--- a/frontend/src/components/features/chat/microagent/launch-microagent-modal.tsx
+++ b/frontend/src/components/features/chat/microagent/launch-microagent-modal.tsx
@@ -76,10 +76,6 @@ export function LaunchMicroagentModal({
            </button>
          </div>

-          <span className="text-sm text-[#A3A3A3] font-normal leading-5">
-            {t("MICROAGENT$DEFINITION")}
-          </span>
-
          <form
            data-testid="launch-microagent-modal"
            onSubmit={onSubmit}
--- a/frontend/src/components/features/chat/microagent/microagent-status-indicator.tsx
+++ b/frontend/src/components/features/chat/microagent/microagent-status-indicator.tsx
@@ -19,8 +19,6 @@ export function MicroagentStatusIndicator({

  const getStatusText = () => {
    switch (status) {
-      case MicroagentStatus.WAITING:
-        return t("MICROAGENT$STATUS_WAITING");
      case MicroagentStatus.CREATING:
        return t("MICROAGENT$STATUS_CREATING");
      case MicroagentStatus.COMPLETED:
@@ -37,8 +35,6 @@ export function MicroagentStatusIndicator({

  const getStatusIcon = () => {
    switch (status) {
-      case MicroagentStatus.WAITING:
-        return <Spinner size="sm" />;
      case MicroagentStatus.CREATING:
        return <Spinner size="sm" />;
      case MicroagentStatus.COMPLETED:
--- a/frontend/src/components/features/chat/microagent/microagent-status-toast.tsx
+++ b/frontend/src/components/features/chat/microagent/microagent-status-toast.tsx
@@ -10,11 +10,6 @@ interface ConversationCreatedToastProps {
  onClose: () => void;
 }

-interface ConversationStartingToastProps {
-  conversationId: string;
-  onClose: () => void;
-}
-
 function ConversationCreatedToast({
  conversationId,
  onClose,
@@ -42,33 +37,6 @@ function ConversationCreatedToast({
  );
 }

-function ConversationStartingToast({
-  conversationId,
-  onClose,
-}: ConversationStartingToastProps) {
-  const { t } = useTranslation();
-  return (
-    <div className="flex items-start gap-2">
-      <Spinner size="sm" />
-      <div>
-        {t("MICROAGENT$CONVERSATION_STARTING")}
-        <br />
-        <a
-          href={`/conversations/${conversationId}`}
-          target="_blank"
-          rel="noopener noreferrer"
-          className="underline"
-        >
-          {t("MICROAGENT$VIEW_CONVERSATION")}
-        </a>
-      </div>
-      <button type="button" onClick={onClose}>
-        <CloseIcon />
-      </button>
-    </div>
-  );
-}
-
 interface ConversationFinishedToastProps {
  conversationId: string;
  onClose: () => void;
@@ -110,18 +78,10 @@ function ConversationErroredToast({
  errorMessage,
  onClose,
 }: ConversationErroredToastProps) {
-  const { t } = useTranslation();
-
-  // Check if the error message is a translation key
-  const displayMessage =
-    errorMessage === "MICROAGENT$UNKNOWN_ERROR"
-      ? t(errorMessage)
-      : errorMessage;
-
  return (
    <div className="flex items-start gap-2">
      <SuccessIndicator status="error" />
-      <div>{displayMessage}</div>
+      <div>{errorMessage}</div>
      <button type="button" onClick={onClose}>
        <CloseIcon />
      </button>
@@ -176,18 +136,3 @@ export const renderConversationErroredToast = (
      duration: 5000,
    },
  );
-
-export const renderConversationStartingToast = (conversationId: string) =>
-  toast(
-    (toastInstance) => (
-      <ConversationStartingToast
-        conversationId={conversationId}
-        onClose={() => toast.dismiss(toastInstance.id)}
-      />
-    ),
-    {
-      ...TOAST_OPTIONS,
-      id: `starting-${conversationId}`,
-      duration: 10000, // Show for 10 seconds or until dismissed
-    },
-  );
--- a/frontend/src/components/features/controls/controls.tsx
+++ b/frontend/src/components/features/controls/controls.tsx
@@ -7,10 +7,11 @@ import { ConversationCard } from "../conversation-panel/conversation-card";
 import { Provider } from "#/types/settings";

 interface ControlsProps {
+  setSecurityOpen: (isOpen: boolean) => void;
  showSecurityLock: boolean;
 }

-export function Controls({ showSecurityLock }: ControlsProps) {
+export function Controls({ setSecurityOpen, showSecurityLock }: ControlsProps) {
  const { data: conversation } = useActiveConversation();
  const [contextMenuOpen, setContextMenuOpen] = React.useState(false);

@@ -20,7 +21,9 @@ export function Controls({ showSecurityLock }: ControlsProps) {
        <AgentControlBar />
        <AgentStatusBar />

-        {showSecurityLock && <SecurityLock />}
+        {showSecurityLock && (
+          <SecurityLock onClick={() => setSecurityOpen(true)} />
+        )}
      </div>

      <ConversationCard
--- a/frontend/src/components/features/controls/security-lock.tsx
+++ b/frontend/src/components/features/controls/security-lock.tsx
@@ -1,28 +1,17 @@
 import { IoLockClosed } from "react-icons/io5";
-import { Tooltip } from "@heroui/react";
-import { useTranslation } from "react-i18next";
-import { Link } from "react-router";
-import { I18nKey } from "#/i18n/declaration";

-export function SecurityLock() {
-  const { t } = useTranslation();
+interface SecurityLockProps {
+  onClick: () => void;
+}

+export function SecurityLock({ onClick }: SecurityLockProps) {
  return (
-    <Tooltip
-      content={
-        <div className="max-w-xs p-2">
-          {t(I18nKey.SETTINGS$CONFIRMATION_MODE_LOCK_TOOLTIP)}
-        </div>
-      }
-      placement="top"
+    <div
+      className="cursor-pointer hover:opacity-80 transition-all"
+      style={{ marginRight: "8px" }}
+      onClick={onClick}
    >
-      <Link
-        to="/settings"
-        className="mr-2 cursor-pointer hover:opacity-80 transition-all"
-        aria-label={t(I18nKey.SETTINGS$TITLE)}
-      >
-        <IoLockClosed size={20} />
-      </Link>
-    </Tooltip>
+      <IoLockClosed size={20} />
+    </div>
  );
 }
--- a/frontend/src/components/features/conversation-panel/confirm-stop-modal.tsx
+++ b/frontend/src/components/features/conversation-panel/confirm-stop-modal.tsx
@@ -23,9 +23,9 @@ export function ConfirmStopModal({
    <ModalBackdrop>
      <ModalBody className="items-start border border-tertiary">
        <div className="flex flex-col gap-2">
-          <BaseModalTitle title={t(I18nKey.CONVERSATION$CONFIRM_PAUSE)} />
+          <BaseModalTitle title={t(I18nKey.CONVERSATION$CONFIRM_STOP)} />
          <BaseModalDescription
-            description={t(I18nKey.CONVERSATION$PAUSE_WARNING)}
+            description={t(I18nKey.CONVERSATION$STOP_WARNING)}
          />
        </div>
        <div
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Graham Neubig	ccca13c6b9	fix(git): use public HTTPS clone URL when token is missing/empty for public repos Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 18:50:34 -04:00
Graham Neubig	75a472cf74	fix(frontend): treat GitLab branch listing errors as no branches (enable Launch without token) Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 18:50:33 -04:00
Graham Neubig	dbe5e1628b	fix(gitlab): public repo URL search & repo verification fallback to unauthenticated for public GitLab projects Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 18:50:32 -04:00
openhands	a84e02b100	fix(frontend): ensure repo search uses selected provider and supports full URL input for GitLab - Pass provider into searchGitRepositories calls from GitRepositoryDropdown - Allow full https URL to be sent to backend for provider-aware resolution This helps GitLab E2E when listing repos fails, enabling URL-based search. Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 19:11:10 +00:00
openhands	46d9e7a633	Merge remote-tracking branch 'origin/main' into e2e-gitlab-integration-test	2025-08-18 18:40:25 +00:00
openhands	61b7053eee	style(e2e): apply ruff-format changes Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 18:39:06 +00:00
openhands	ae58f41aa3	ci(e2e): resolve merge conflict; include new browsing test and gitlab integration in E2E matrix\n\nCo-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 17:45:53 +00:00
openhands	d42c4779c0	test(e2e): add URL wait for conversations route; expand conversation selectors\n\nCo-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 17:01:36 +00:00
openhands	022644b4fe	test(e2e): stabilize provider and branch selectors for GitLab flow; use react-select control and option role\n\nCo-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 16:41:22 +00:00
openhands	5378f9f446	test(e2e): fix branch dropdown click interception by clicking react-select control and forcing click; type into input before selecting main\n\nCo-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 16:03:58 +00:00
openhands	f25a2c00b0	test(e2e): stabilize GitLab repo selection by targeting [data-testid=repo-dropdown] and input focus; fallback selectors retained\n\nCo-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 15:06:41 +00:00
openhands	cfe01d4c8a	Merge main: resolve e2e README conflict; keep GitLab e2e docs and align examples\n\nCo-authored-by: openhands <openhands@all-hands.dev>	2025-08-18 11:56:18 +00:00
openhands	b2fec83b9a	feat: Improve GitLab repository selection with fallback strategies - Try multiple GitLab repositories in order of preference - Add fallback to select any available repository if GitLab repos not found - Adapt verification questions based on selected repository - Handle cases where GitLab API access is limited - Provide better error messages and debugging information - Clear search field before typing new repository names - Enhanced repository option detection with multiple selectors	2025-08-15 21:35:18 +00:00
openhands	b96f754b55	fix: Handle provider selection reset after settings navigation - Provider selection gets reset when navigating back from settings page - Added detection for provider dropdown after settings navigation - Enhanced provider dropdown clicking with more selectors and JavaScript fallback - Added more GitLab option selectors for better reliability - This should fix the issue where GitLab provider wasn't selected after settings	2025-08-15 21:20:40 +00:00
openhands	6135aad457	feat: Handle single provider case in GitLab E2E test - Check if provider dropdown exists before trying to interact with it - Provider dropdown only appears when multiple providers are configured - If single provider (GitLab only), it's auto-selected by the UI - This should fix the provider selection issue that was causing test failures - Simplified logic removes complex debugging code	2025-08-15 21:06:55 +00:00
openhands	b40c4e41e4	feat: Add comprehensive debugging and JavaScript-based interaction for provider dropdown - Add detailed debugging to inspect DOM elements - Add JavaScript-based clicking approaches - Try multiple strategies to find and click provider dropdown - Add fallback to Playwright selectors - This should help identify why the dropdown interaction is failing	2025-08-15 20:52:29 +00:00
openhands	f904fa6a56	feat: Add more robust provider dropdown interaction methods - Add multiple approaches to find and click provider dropdown - Try React Select control selectors first - Add fallback to find dropdowns by position in Connect section - Add coordinate-based clicking as last resort - Improve error handling and debugging output - This should handle different React Select implementations	2025-08-15 20:18:47 +00:00
openhands	9576059eda	feat: Improve GitLab E2E test with robust selector fallbacks - Add multiple selector strategies for provider dropdown - Add multiple selector strategies for repository search - Add multiple selector strategies for branch selection - Add fallback logic to find dropdowns by position in Connect section - Improve error handling and debugging output - This should handle different UI implementations and be more resilient	2025-08-15 20:05:44 +00:00
openhands	03d2d9a57a	feat: Update GitLab E2E test for new provider selection UI - Handle new UI flow with provider selection dropdown - Add steps to select GitLab provider first - Then search for repositories in GitLab - Then select branch - Update all step numbers accordingly - This should work with the new multi-step repository selection interface	2025-08-15 19:52:58 +00:00
openhands	0a81d5a977	feat: Update GitLab E2E test to use actual GitLab repository - Use gitlab-org/gitlab-foss as test repository (public GitLab repo) - Update test description to reflect GitLab-specific testing - This will properly test GitLab integration functionality - Repository is publicly accessible so should work with GitLab token	2025-08-15 19:33:55 +00:00
openhands	c53b222ef4	fix: Simplify GitLab E2E test to use exact conversation test logic - Copy exact repository selection logic from working conversation test - Use OpenHands repository to test basic functionality first - Remove complex fallback logic that was causing issues - Update test description to reflect current approach - This should fix repository selection and allow test to proceed	2025-08-15 19:32:26 +00:00
openhands	f45920de09	fix: Improve GitLab E2E test repository selection logic - Use OpenHands repository temporarily to test basic functionality - Add better repository selection fallback logic - Clear existing text before typing repository name - Add repository verification step - Improve error handling and debugging output - Test will verify basic repository cloning functionality first	2025-08-15 19:13:43 +00:00
openhands	cf2971b374	fix: Update GitLab E2E test to follow existing patterns and verify actual repository functionality - Rewrite test to follow same pattern as settings and conversation tests - Configure GitLab token in settings first (similar to GitHub test) - Select gitlab-org/gitlab-foss repository and launch it - Wait for agent initialization and ask it to count lines in README.md - Verify agent can actually access and work with cloned repository - Remove non-existent provider dropdown logic - Add comprehensive error handling and screenshots - Test now verifies end-to-end GitLab integration functionality	2025-08-15 18:53:14 +00:00
openhands	70a59f48d3	ci: Add GitLab integration test to E2E workflow - Add GITLAB_TOKEN environment variable to E2E tests - Include test_gitlab_integration.py::test_gitlab_repository_cloning in test suite - Ensures GitLab repository cloning functionality is tested in CI Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-15 18:00:53 +00:00
openhands	dc2d1fcd9a	feat: Add E2E test for GitLab repository cloning - Add comprehensive end-to-end test for GitLab integration - Test verifies complete flow from GitLab token configuration to repository cloning - Includes GitLab provider selection and workspace verification - Update E2E test documentation with GitLab test instructions - Addresses issue #10380 Co-authored-by: openhands <openhands@all-hands.dev>	2025-08-15 15:56:10 +00:00