fmt

Merge branch 'swiftyos/nightly-autopilot' of github.com:Significant-Gravitas/AutoGPT into swiftyos/nightly-autopilot
updating email templates
2026-03-17 03:00:27 -04:00 · 2026-03-16 18:29:32 +01:00 · 2026-03-16 18:28:16 +01:00 · 2026-03-16 18:28:13 +01:00 · 2026-03-16 16:46:36 +01:00 · 2026-03-16 15:40:55 +01:00
157 changed files with 14963 additions and 1736 deletions
--- a/.claude/skills/backend-check/SKILL.md
+++ b/.claude/skills/backend-check/SKILL.md
@@ -1,17 +0,0 @@
---
-name: backend-check
-description: Run the full backend formatting, linting, and test suite. Ensures code quality before commits and PRs. TRIGGER when backend Python code has been modified and needs validation.
-user-invocable: true
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# Backend Check
-
-## Steps
-
-1. **Format**: `poetry run format` — runs formatting AND linting. NEVER run ruff/black/isort individually
-2. **Fix** any remaining errors manually, re-run until clean
-3. **Test**: `poetry run test` (runs DB setup + pytest). For specific files: `poetry run pytest -s -vvv <test_files>`
-4. **Snapshots** (if needed): `poetry run pytest path/to/test.py --snapshot-update` — review with `git diff`
--- a/.claude/skills/code-style/SKILL.md
+++ b/.claude/skills/code-style/SKILL.md
@@ -1,35 +0,0 @@
---
-name: code-style
-description: Python code style preferences for the AutoGPT backend. Apply when writing or reviewing Python code. TRIGGER when writing new Python code, reviewing PRs, or refactoring backend code.
-user-invocable: false
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# Code Style
-
-## Imports
-
- **Top-level only** — no local/inner imports. Move all imports to the top of the file.
-
-## Typing
-
- **No duck typing** — avoid `hasattr`, `getattr`, `isinstance` for type dispatch. Use proper typed interfaces, unions, or protocols.
- **Pydantic models** over dataclass, namedtuple, or raw dict for structured data.
- **No linter suppressors** — avoid `# type: ignore`, `# noqa`, `# pyright: ignore` etc. 99% of the time the right fix is fixing the type/code, not silencing the tool.
-
-## Code Structure
-
- **List comprehensions** over manual loop-and-append.
- **Early return** — guard clauses first, avoid deep nesting.
- **Flatten inline** — prefer short, concise expressions. Reduce `if/else` chains with direct returns or ternaries when readable.
- **Modular functions** — break complex logic into small, focused functions rather than long blocks with nested conditionals.
-
-## Review Checklist
-
-Before finishing, always ask:
- Can any function be split into smaller pieces?
- Is there unnecessary nesting that an early return would eliminate?
- Can any loop be a comprehension?
- Is there a simpler way to express this logic?
--- a/.claude/skills/frontend-check/SKILL.md
+++ b/.claude/skills/frontend-check/SKILL.md
@@ -1,16 +0,0 @@
---
-name: frontend-check
-description: Run the full frontend formatting, linting, and type checking suite. Ensures code quality before commits and PRs. TRIGGER when frontend TypeScript/React code has been modified and needs validation.
-user-invocable: true
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# Frontend Check
-
-## Steps (in order)
-
-1. **Format**: `pnpm format` — NEVER run individual formatters
-2. **Lint**: `pnpm lint` — fix errors, re-run until clean
-3. **Types**: `pnpm types` — if it keeps failing after multiple attempts, stop and ask the user
--- a/.claude/skills/new-block/SKILL.md
+++ b/.claude/skills/new-block/SKILL.md
@@ -1,29 +0,0 @@
---
-name: new-block
-description: Create a new backend block following the Block SDK Guide. Guides through provider configuration, schema definition, authentication, and testing. TRIGGER when user asks to create a new block, add a new integration, or build a new node for the graph editor.
-user-invocable: true
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# New Block Creation
-
-Read `docs/platform/block-sdk-guide.md` first for the full guide.
-
-## Steps
-
-1. **Provider config** (if external service): create `_config.py` with `ProviderBuilder`
-2. **Block file** in `backend/blocks/` (from `autogpt_platform/backend/`):
-   - Generate a UUID once with `uuid.uuid4()`, then **hard-code that string** as `id` (IDs must be stable across imports)
-   - `Input(BlockSchema)` and `Output(BlockSchema)` classes
-   - `async def run` that `yield`s output fields
-3. **Files**: use `store_media_file()` with `"for_block_output"` for outputs
-4. **Test**: `poetry run pytest 'backend/blocks/test/test_block.py::test_available_blocks[MyBlock]' -xvs`
-5. **Format**: `poetry run format`
-
-## Rules
-
- Analyze interfaces: do inputs/outputs connect well with other blocks in a graph?
- Use top-level imports, avoid duck typing
- Always use `for_block_output` for block outputs
--- a/.claude/skills/openapi-regen/SKILL.md
+++ b/.claude/skills/openapi-regen/SKILL.md
@@ -1,28 +0,0 @@
---
-name: openapi-regen
-description: Regenerate the OpenAPI spec and frontend API client. Starts the backend REST server, fetches the spec, and regenerates the typed frontend hooks. TRIGGER when API routes change, new endpoints are added, or frontend API types are stale.
-user-invocable: true
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# OpenAPI Spec Regeneration
-
-## Steps
-
-1. **Run end-to-end** in a single shell block (so `REST_PID` persists):
-   ```bash
-   cd autogpt_platform/backend && poetry run rest &
-   REST_PID=$!
-   WAIT=0; until curl -sf http://localhost:8006/health > /dev/null 2>&1; do sleep 1; WAIT=$((WAIT+1)); [ $WAIT -ge 60 ] && echo "Timed out" && kill $REST_PID && exit 1; done
-   cd ../frontend && pnpm generate:api:force
-   kill $REST_PID
-   pnpm types && pnpm lint && pnpm format
-   ```
-
-## Rules
-
- Always use `pnpm generate:api:force` (not `pnpm generate:api`)
- Don't manually edit files in `src/app/api/__generated__/`
- Generated hooks follow: `use{Method}{Version}{OperationName}`
--- a/.claude/skills/pr-address/SKILL.md
+++ b/.claude/skills/pr-address/SKILL.md
@@ -0,0 +1,79 @@
+---
+name: pr-address
+description: Address PR review comments and loop until CI green and all comments resolved. TRIGGER when user asks to address comments, fix PR feedback, respond to reviewers, or babysit/monitor a PR.
+user-invocable: true
+args: "[PR number or URL] — if omitted, finds PR for current branch."
+metadata:
+  author: autogpt-team
+  version: "1.0.0"
+---
+
+# PR Address
+
+## Find the PR
+
+```bash
+gh pr list --head $(git branch --show-current) --repo Significant-Gravitas/AutoGPT
+gh pr view {N}
+```
+
+## Fetch comments (all sources)
+
+```bash
+gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/reviews       # top-level reviews
+gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/comments      # inline review comments
+gh api repos/Significant-Gravitas/AutoGPT/issues/{N}/comments     # PR conversation comments
+```
+
+**Bots to watch for:**
+- `autogpt-reviewer` — posts "Blockers", "Should Fix", "Nice to Have". Address ALL of them.
+- `sentry[bot]` — bug predictions. Fix real bugs, explain false positives.
+- `coderabbitai[bot]` — automated review. Address actionable items.
+
+## For each unaddressed comment
+
+Address comments **one at a time**: fix → commit → push → inline reply → next.
+
+1. Read the referenced code, make the fix (or reply explaining why it's not needed)
+2. Commit and push the fix
+3. Reply **inline** (not as a new top-level comment) referencing the fixing commit — this is what resolves the conversation for bot reviewers (coderabbitai, sentry):
+
+| Comment type | How to reply |
+|---|---|
+| Inline review (`pulls/{N}/comments`) | `gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/comments/{ID}/replies -f body="Fixed in <commit-sha>: <description>"` |
+| Conversation (`issues/{N}/comments`) | `gh api repos/Significant-Gravitas/AutoGPT/issues/{N}/comments -f body="Fixed in <commit-sha>: <description>"` |
+
+## Format and commit
+
+After fixing, format the changed code:
+
+- **Backend** (from `autogpt_platform/backend/`): `poetry run format`
+- **Frontend** (from `autogpt_platform/frontend/`): `pnpm format && pnpm lint && pnpm types`
+
+If API routes changed, regenerate the frontend client:
+```bash
+cd autogpt_platform/backend && poetry run rest &
+REST_PID=$!
+trap "kill $REST_PID 2>/dev/null" EXIT
+WAIT=0; until curl -sf http://localhost:8006/health > /dev/null 2>&1; do sleep 1; WAIT=$((WAIT+1)); [ $WAIT -ge 60 ] && echo "Timed out" && exit 1; done
+cd ../frontend && pnpm generate:api:force
+kill $REST_PID 2>/dev/null; trap - EXIT
+```
+Never manually edit files in `src/app/api/__generated__/`.
+
+Then commit and **push immediately** — never batch commits without pushing.
+
+For backend commits in worktrees: `poetry run git commit` (pre-commit hooks).
+
+## The loop
+
+```text
+address comments → format → commit → push
+→ re-check comments → fix new ones → push
+→ wait for CI → re-check comments after CI settles
+→ repeat until: all comments addressed AND CI green AND no new comments arriving
+```
+
+While CI runs, stay productive: run local tests, address remaining comments.
+
+**The loop ends when:** CI fully green + all comments addressed + no new comments since CI settled.
--- a/.claude/skills/pr-create/SKILL.md
+++ b/.claude/skills/pr-create/SKILL.md
@@ -1,31 +0,0 @@
---
-name: pr-create
-description: Create a pull request for the current branch. TRIGGER when user asks to create a PR, open a pull request, push changes for review, or submit work for merging.
-user-invocable: true
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# Create Pull Request
-
-## Steps
-
-1. **Check for existing PR**: `gh pr view --json url -q .url 2>/dev/null` — if a PR already exists, output its URL and stop
-2. **Understand changes**: `git status`, `git diff dev...HEAD`, `git log dev..HEAD --oneline`
-3. **Read PR template**: `.github/PULL_REQUEST_TEMPLATE.md`
-4. **Draft PR title**: Use conventional commits format (see CLAUDE.md for types and scopes)
-5. **Fill out PR template** as the body — be thorough in the Changes section
-6. **Format first** (if relevant changes exist):
-   - Backend: `cd autogpt_platform/backend && poetry run format`
-   - Frontend: `cd autogpt_platform/frontend && pnpm format`
-   - Fix any lint errors, then commit formatting changes before pushing
-7. **Push**: `git push -u origin HEAD`
-8. **Create PR**: `gh pr create --base dev`
-9. **Output** the PR URL
-
-## Rules
-
- Always target `dev` branch
- Do NOT run tests — CI will handle that
- Use the PR template from `.github/PULL_REQUEST_TEMPLATE.md`
--- a/.claude/skills/pr-review/SKILL.md
+++ b/.claude/skills/pr-review/SKILL.md
@@ -1,51 +1,74 @@
 ---
 name: pr-review
-description: Address all open PR review comments systematically. Fetches comments, addresses each one, reacts +1/-1, and replies when clarification is needed. Keeps iterating until all comments are addressed and CI is green. TRIGGER when user shares a PR URL, asks to address review comments, fix PR feedback, or respond to reviewer comments.
+description: Review a PR for correctness, security, code quality, and testing issues. TRIGGER when user asks to review a PR, check PR quality, or give feedback on a PR.
 user-invocable: true
+args: "[PR number or URL] — if omitted, finds PR for current branch."
 metadata:
  author: autogpt-team
  version: "1.0.0"
 ---

-# PR Review Comment Workflow
+# PR Review

-## Steps
+## Find the PR

-1. **Find PR**: `gh pr list --head $(git branch --show-current) --repo Significant-Gravitas/AutoGPT`
-2. **Fetch comments** (all three sources):
-   - `gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/reviews` (top-level reviews)
-   - `gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/comments` (inline review comments)
-   - `gh api repos/Significant-Gravitas/AutoGPT/issues/{N}/comments` (PR conversation comments)
-3. **Skip** comments already reacted to by PR author
-4. **For each unreacted comment**:
-   - Read referenced code, make the fix (or reply if you disagree/need info)
-   - **Inline review comments** (`pulls/{N}/comments`):
-     - React: `gh api repos/.../pulls/comments/{ID}/reactions -f content="+1"` (or `-1`)
-     - Reply: `gh api repos/.../pulls/{N}/comments/{ID}/replies -f body="..."`
-   - **PR conversation comments** (`issues/{N}/comments`):
-     - React: `gh api repos/.../issues/comments/{ID}/reactions -f content="+1"` (or `-1`)
-     - No threaded replies — post a new issue comment if needed
-   - **Top-level reviews**: no reaction API — address in code, reply via issue comment if needed
-5. **Include autogpt-reviewer bot fixes** too
-6. **Format**: `cd autogpt_platform/backend && poetry run format`, `cd autogpt_platform/frontend && pnpm format`
-7. **Commit & push**
-8. **Re-fetch comments** immediately — address any new unreacted ones before waiting on CI
-9. **Stay productive while CI runs** — don't idle. In priority order:
-   - Run any pending local tests (`poetry run pytest`, e2e, etc.) and fix failures
-   - Address any remaining comments
-   - Only poll `gh pr checks {N}` as the last resort when there's truly nothing left to do
-10. **If CI fails** — fix, go back to step 6
-11. **Re-fetch comments again** after CI is green — address anything that appeared while CI was running
-12. **Done** only when: all comments reacted AND CI is green.
+```bash
+gh pr list --head $(git branch --show-current) --repo Significant-Gravitas/AutoGPT
+gh pr view {N}
+```

-## CRITICAL: Do Not Stop
+## Read the diff

-**Loop is: address → format → commit → push → re-check comments → run local tests → wait CI → re-check comments → repeat.**
+```bash
+gh pr diff {N}
+```

-Never idle. If CI is running and you have nothing to address, run local tests. Waiting on CI is the last resort.
+## Fetch existing review comments

-## Rules
+Before posting anything, fetch existing inline comments to avoid duplicates:

- One todo per comment
- For inline review comments: reply on existing threads. For PR conversation comments: post a new issue comment (API doesn't support threaded replies)
- React to every comment: +1 addressed, -1 disagreed (with explanation)
+```bash
+gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/comments
+gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/reviews
+```
+
+## What to check
+
+**Correctness:** logic errors, off-by-one, missing edge cases, race conditions (TOCTOU in file access, credit charging), error handling gaps, async correctness (missing `await`, unclosed resources).
+
+**Security:** input validation at boundaries, no injection (command, XSS, SQL), secrets not logged, file paths sanitized (`os.path.basename()` in error messages).
+
+**Code quality:** apply rules from backend/frontend CLAUDE.md files.
+
+**Architecture:** DRY, single responsibility, modular functions. `Security()` vs `Depends()` for FastAPI auth. `data:` for SSE events, `: comment` for heartbeats. `transaction=True` for Redis pipelines.
+
+**Testing:** edge cases covered, colocated `*_test.py` (backend) / `__tests__/` (frontend), mocks target where symbol is **used** not defined, `AsyncMock` for async.
+
+## Output format
+
+Every comment **must** be prefixed with `🤖` and a criticality badge:
+
+| Tier | Badge | Meaning |
+|---|---|---|
+| Blocker | `🔴 **Blocker**` | Must fix before merge |
+| Should Fix | `🟠 **Should Fix**` | Important improvement |
+| Nice to Have | `🟡 **Nice to Have**` | Minor suggestion |
+| Nit | `🔵 **Nit**` | Style / wording |
+
+Example: `🤖 🔴 **Blocker**: Missing error handling for X — suggest wrapping in try/except.`
+
+## Post inline comments
+
+For each finding, post an inline comment on the PR (do not just write a local report):
+
+```bash
+# Get the latest commit SHA for the PR
+COMMIT_SHA=$(gh api repos/Significant-Gravitas/AutoGPT/pulls/{N} --jq '.head.sha')
+
+# Post an inline comment on a specific file/line
+gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/comments \
+  -f body="🤖 🔴 **Blocker**: <description>" \
+  -f commit_id="$COMMIT_SHA" \
+  -f path="<file path>" \
+  -F line=<line number>
+```
--- a/.claude/skills/worktree-setup/SKILL.md
+++ b/.claude/skills/worktree-setup/SKILL.md
@@ -1,45 +0,0 @@
---
-name: worktree-setup
-description: Set up a new git worktree for parallel development. Creates the worktree, copies .env files, installs dependencies, generates Prisma client, and optionally starts the app (with port conflict resolution) or runs tests. TRIGGER when user asks to set up a worktree, work on a branch in isolation, or needs a separate environment for a branch or PR.
-user-invocable: true
-metadata:
-  author: autogpt-team
-  version: "1.0.0"
---
-
-# Worktree Setup
-
-## Preferred: Use Branchlet
-
-The repo has a `.branchlet.json` config — it handles env file copying, dependency installation, and Prisma generation automatically.
-
-```bash
-npm install -g branchlet                                      # install once
-branchlet create -n <name> -s <source-branch> -b <new-branch>
-branchlet list --json   # list all worktrees
-```
-
-## Manual Fallback
-
-If branchlet isn't available:
-
-1. `git worktree add ../<RepoName><N> <branch-name>`
-2. Copy `.env` files: `backend/.env`, `frontend/.env`, `autogpt_platform/.env`, `db/docker/.env`
-3. Install deps:
-   - `cd autogpt_platform/backend && poetry install && poetry run prisma generate`
-   - `cd autogpt_platform/frontend && pnpm install`
-
-## Running the App
-
-Free ports first — backend uses: 8001, 8002, 8003, 8005, 8006, 8007, 8008.
-
-```bash
-for port in 8001 8002 8003 8005 8006 8007 8008; do
-  lsof -ti :$port | xargs kill -9 2>/dev/null || true
-done
-cd <worktree>/autogpt_platform/backend && poetry run app
-```
-
-## CoPilot Testing Gotcha
-
-SDK mode spawns a Claude subprocess — **won't work inside Claude Code**. Set `CHAT_USE_CLAUDE_AGENT_SDK=false` in `backend/.env` to use baseline mode.
--- a/.claude/skills/worktree/SKILL.md
+++ b/.claude/skills/worktree/SKILL.md
@@ -0,0 +1,85 @@
+---
+name: worktree
+description: Set up a new git worktree for parallel development. Creates the worktree, copies .env files, installs dependencies, and generates Prisma client. TRIGGER when user asks to set up a worktree, work on a branch in isolation, or needs a separate environment for a branch or PR.
+user-invocable: true
+args: "[name] — optional worktree name (e.g., 'AutoGPT7'). If omitted, uses next available AutoGPT<N>."
+metadata:
+  author: autogpt-team
+  version: "3.0.0"
+---
+
+# Worktree Setup
+
+## Create the worktree
+
+Derive paths from the git toplevel. If a name is provided as argument, use it. Otherwise, check `git worktree list` and pick the next `AutoGPT<N>`.
+
+```bash
+ROOT=$(git rev-parse --show-toplevel)
+PARENT=$(dirname "$ROOT")
+
+# From an existing branch
+git worktree add "$PARENT/<NAME>" <branch-name>
+
+# From a new branch off dev
+git worktree add -b <new-branch> "$PARENT/<NAME>" dev
+```
+
+## Copy environment files
+
+Copy `.env` from the root worktree. Falls back to `.env.default` if `.env` doesn't exist.
+
+```bash
+ROOT=$(git rev-parse --show-toplevel)
+TARGET="$(dirname "$ROOT")/<NAME>"
+
+for envpath in autogpt_platform/backend autogpt_platform/frontend autogpt_platform; do
+  if [ -f "$ROOT/$envpath/.env" ]; then
+    cp "$ROOT/$envpath/.env" "$TARGET/$envpath/.env"
+  elif [ -f "$ROOT/$envpath/.env.default" ]; then
+    cp "$ROOT/$envpath/.env.default" "$TARGET/$envpath/.env"
+  fi
+done
+```
+
+## Install dependencies
+
+```bash
+TARGET="$(dirname "$(git rev-parse --show-toplevel)")/<NAME>"
+cd "$TARGET/autogpt_platform/autogpt_libs" && poetry install
+cd "$TARGET/autogpt_platform/backend" && poetry install && poetry run prisma generate
+cd "$TARGET/autogpt_platform/frontend" && pnpm install
+```
+
+Replace `<NAME>` with the actual worktree name (e.g., `AutoGPT7`).
+
+## Running the app (optional)
+
+Backend uses ports: 8001, 8002, 8003, 8005, 8006, 8007, 8008. Free them first if needed:
+
+```bash
+TARGET="$(dirname "$(git rev-parse --show-toplevel)")/<NAME>"
+for port in 8001 8002 8003 8005 8006 8007 8008; do
+  lsof -ti :$port | xargs kill -9 2>/dev/null || true
+done
+cd "$TARGET/autogpt_platform/backend" && poetry run app
+```
+
+## CoPilot testing
+
+SDK mode spawns a Claude subprocess — won't work inside Claude Code. Set `CHAT_USE_CLAUDE_AGENT_SDK=false` in `backend/.env` to use baseline mode.
+
+## Cleanup
+
+```bash
+# Replace <NAME> with the actual worktree name (e.g., AutoGPT7)
+git worktree remove "$(dirname "$(git rev-parse --show-toplevel)")/<NAME>"
+```
+
+## Alternative: Branchlet (optional)
+
+If [branchlet](https://www.npmjs.com/package/branchlet) is installed:
+
+```bash
+branchlet create -n <name> -s <source-branch> -b <new-branch>
+```
--- a/autogpt_platform/CLAUDE.md
+++ b/autogpt_platform/CLAUDE.md
@@ -60,9 +60,12 @@ AutoGPT Platform is a monorepo containing:

 ### Reviewing/Revising Pull Requests

- When the user runs /pr-comments or tries to fetch them, also run gh api /repos/Significant-Gravitas/AutoGPT/pulls/[issuenum]/reviews to get the reviews
- Use gh api /repos/Significant-Gravitas/AutoGPT/pulls/[issuenum]/reviews/[review_id]/comments to get the review contents
- Use gh api /repos/Significant-Gravitas/AutoGPT/issues/9924/comments to get the pr specific comments
+Use `/pr-review` to review a PR or `/pr-address` to address comments.
+
+When fetching comments manually:
+- `gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/reviews` — top-level reviews
+- `gh api repos/Significant-Gravitas/AutoGPT/pulls/{N}/comments` — inline review comments
+- `gh api repos/Significant-Gravitas/AutoGPT/issues/{N}/comments` — PR conversation comments

 ### Conventional Commits

--- a/autogpt_platform/analytics/queries/auth_activities.sql
+++ b/autogpt_platform/analytics/queries/auth_activities.sql
@@ -0,0 +1,40 @@
+-- =============================================================
+-- View: analytics.auth_activities
+-- Looker source alias: ds49  |  Charts: 1
+-- =============================================================
+-- DESCRIPTION
+--   Tracks authentication events (login, logout, SSO, password
+--   reset, etc.) from Supabase's internal audit log.
+--   Useful for monitoring sign-in patterns and detecting anomalies.
+--
+-- SOURCE TABLES
+--   auth.audit_log_entries  — Supabase internal auth event log
+--
+-- OUTPUT COLUMNS
+--   created_at      TIMESTAMPTZ  When the auth event occurred
+--   actor_id        TEXT         User ID who triggered the event
+--   actor_via_sso   TEXT         Whether the action was via SSO ('true'/'false')
+--   action          TEXT         Event type (e.g. 'login', 'logout', 'token_refreshed')
+--
+-- WINDOW
+--   Rolling 90 days from current date
+--
+-- EXAMPLE QUERIES
+--   -- Daily login counts
+--   SELECT DATE_TRUNC('day', created_at) AS day, COUNT(*) AS logins
+--   FROM analytics.auth_activities
+--   WHERE action = 'login'
+--   GROUP BY 1 ORDER BY 1;
+--
+--   -- SSO vs password login breakdown
+--   SELECT actor_via_sso, COUNT(*) FROM analytics.auth_activities
+--   WHERE action = 'login' GROUP BY 1;
+-- =============================================================
+
+SELECT
+    created_at,
+    payload->>'actor_id'      AS actor_id,
+    payload->>'actor_via_sso' AS actor_via_sso,
+    payload->>'action'        AS action
+FROM auth.audit_log_entries
+WHERE created_at >= NOW() - INTERVAL '90 days'
--- a/autogpt_platform/analytics/queries/graph_execution.sql
+++ b/autogpt_platform/analytics/queries/graph_execution.sql
@@ -0,0 +1,105 @@
+-- =============================================================
+-- View: analytics.graph_execution
+-- Looker source alias: ds16  |  Charts: 21
+-- =============================================================
+-- DESCRIPTION
+--   One row per agent graph execution (last 90 days).
+--   Unpacks the JSONB stats column into individual numeric columns
+--   and normalises the executionStatus — runs that failed due to
+--   insufficient credits are reclassified as 'NO_CREDITS' for
+--   easier filtering.  Error messages are scrubbed of IDs and URLs
+--   to allow safe grouping.
+--
+-- SOURCE TABLES
+--   platform.AgentGraphExecution  — Execution records
+--   platform.AgentGraph           — Agent graph metadata (for name)
+--   platform.LibraryAgent         — To flag possibly-AI (safe-mode) agents
+--
+-- OUTPUT COLUMNS
+--   id                TEXT         Execution UUID
+--   agentGraphId      TEXT         Agent graph UUID
+--   agentGraphVersion INT          Graph version number
+--   executionStatus   TEXT         COMPLETED | FAILED | NO_CREDITS | RUNNING | QUEUED | TERMINATED
+--   createdAt         TIMESTAMPTZ  When the execution was queued
+--   updatedAt         TIMESTAMPTZ  Last status update time
+--   userId            TEXT         Owner user UUID
+--   agentGraphName    TEXT         Human-readable agent name
+--   cputime           DECIMAL      Total CPU seconds consumed
+--   walltime          DECIMAL      Total wall-clock seconds
+--   node_count        DECIMAL      Number of nodes in the graph
+--   nodes_cputime     DECIMAL      CPU time across all nodes
+--   nodes_walltime    DECIMAL      Wall time across all nodes
+--   execution_cost    DECIMAL      Credit cost of this execution
+--   correctness_score FLOAT        AI correctness score (if available)
+--   possibly_ai       BOOLEAN      True if agent has sensitive_action_safe_mode enabled
+--   groupedErrorMessage TEXT       Scrubbed error string (IDs/URLs replaced with wildcards)
+--
+-- WINDOW
+--   Rolling 90 days (createdAt > CURRENT_DATE - 90 days)
+--
+-- EXAMPLE QUERIES
+--   -- Daily execution counts by status
+--   SELECT DATE_TRUNC('day', "createdAt") AS day, "executionStatus", COUNT(*)
+--   FROM analytics.graph_execution
+--   GROUP BY 1, 2 ORDER BY 1;
+--
+--   -- Average cost per execution by agent
+--   SELECT "agentGraphName", AVG("execution_cost") AS avg_cost, COUNT(*) AS runs
+--   FROM analytics.graph_execution
+--   WHERE "executionStatus" = 'COMPLETED'
+--   GROUP BY 1 ORDER BY avg_cost DESC;
+--
+--   -- Top error messages
+--   SELECT "groupedErrorMessage", COUNT(*) AS occurrences
+--   FROM analytics.graph_execution
+--   WHERE "executionStatus" = 'FAILED'
+--   GROUP BY 1 ORDER BY 2 DESC LIMIT 20;
+-- =============================================================
+
+SELECT
+    ge."id"                                                        AS id,
+    ge."agentGraphId"                                              AS agentGraphId,
+    ge."agentGraphVersion"                                         AS agentGraphVersion,
+    CASE
+        WHEN jsonb_exists(ge."stats"::jsonb, 'error')
+         AND (
+               (ge."stats"::jsonb->>'error') ILIKE '%insufficient balance%'
+            OR (ge."stats"::jsonb->>'error') ILIKE '%you have no credits left%'
+             )
+        THEN 'NO_CREDITS'
+        ELSE CAST(ge."executionStatus" AS TEXT)
+    END                                                            AS executionStatus,
+    ge."createdAt"                                                 AS createdAt,
+    ge."updatedAt"                                                 AS updatedAt,
+    ge."userId"                                                    AS userId,
+    g."name"                                                       AS agentGraphName,
+    (ge."stats"::jsonb->>'cputime')::decimal                       AS cputime,
+    (ge."stats"::jsonb->>'walltime')::decimal                      AS walltime,
+    (ge."stats"::jsonb->>'node_count')::decimal                    AS node_count,
+    (ge."stats"::jsonb->>'nodes_cputime')::decimal                 AS nodes_cputime,
+    (ge."stats"::jsonb->>'nodes_walltime')::decimal                AS nodes_walltime,
+    (ge."stats"::jsonb->>'cost')::decimal                          AS execution_cost,
+    (ge."stats"::jsonb->>'correctness_score')::float               AS correctness_score,
+    COALESCE(la.possibly_ai, FALSE)                                AS possibly_ai,
+    REGEXP_REPLACE(
+        REGEXP_REPLACE(
+            TRIM(BOTH '"' FROM ge."stats"::jsonb->>'error'),
+            '(https?://)([A-Za-z0-9.-]+)(:[0-9]+)?(/[^\s]*)?',
+            '\1\2/...', 'gi'
+        ),
+        '[a-zA-Z0-9_:-]*\d[a-zA-Z0-9_:-]*', '*', 'g'
+    )                                                              AS groupedErrorMessage
+FROM platform."AgentGraphExecution" ge
+LEFT JOIN platform."AgentGraph" g
+       ON ge."agentGraphId" = g."id"
+      AND ge."agentGraphVersion" = g."version"
+LEFT JOIN (
+    SELECT DISTINCT ON ("userId", "agentGraphId")
+           "userId", "agentGraphId",
+           ("settings"::jsonb->>'sensitive_action_safe_mode')::boolean AS possibly_ai
+    FROM platform."LibraryAgent"
+    WHERE "isDeleted"  = FALSE
+      AND "isArchived" = FALSE
+    ORDER BY "userId", "agentGraphId", "agentGraphVersion" DESC
+) la ON la."userId" = ge."userId" AND la."agentGraphId" = ge."agentGraphId"
+WHERE ge."createdAt" > CURRENT_DATE - INTERVAL '90 days'
--- a/autogpt_platform/analytics/queries/node_block_execution.sql
+++ b/autogpt_platform/analytics/queries/node_block_execution.sql
@@ -0,0 +1,101 @@
+-- =============================================================
+-- View: analytics.node_block_execution
+-- Looker source alias: ds14  |  Charts: 11
+-- =============================================================
+-- DESCRIPTION
+--   One row per node (block) execution (last 90 days).
+--   Unpacks stats JSONB and joins to identify which block type
+--   was run.  For failed nodes, joins the error output and
+--   scrubs it for safe grouping.
+--
+-- SOURCE TABLES
+--   platform.AgentNodeExecution              — Node execution records
+--   platform.AgentNode                       — Node → block mapping
+--   platform.AgentBlock                      — Block name/ID
+--   platform.AgentNodeExecutionInputOutput   — Error output values
+--
+-- OUTPUT COLUMNS
+--   id                    TEXT         Node execution UUID
+--   agentGraphExecutionId TEXT         Parent graph execution UUID
+--   agentNodeId           TEXT         Node UUID within the graph
+--   executionStatus       TEXT         COMPLETED | FAILED | QUEUED | RUNNING | TERMINATED
+--   addedTime             TIMESTAMPTZ  When the node was queued
+--   queuedTime            TIMESTAMPTZ  When it entered the queue
+--   startedTime           TIMESTAMPTZ  When execution started
+--   endedTime             TIMESTAMPTZ  When execution finished
+--   inputSize             BIGINT       Input payload size in bytes
+--   outputSize            BIGINT       Output payload size in bytes
+--   walltime              NUMERIC      Wall-clock seconds for this node
+--   cputime               NUMERIC      CPU seconds for this node
+--   llmRetryCount         INT          Number of LLM retries
+--   llmCallCount          INT          Number of LLM API calls made
+--   inputTokenCount       BIGINT       LLM input tokens consumed
+--   outputTokenCount      BIGINT       LLM output tokens produced
+--   blockName             TEXT         Human-readable block name (e.g. 'OpenAIBlock')
+--   blockId               TEXT         Block UUID
+--   groupedErrorMessage   TEXT         Scrubbed error (IDs/URLs wildcarded)
+--   errorMessage          TEXT         Raw error output (only set when FAILED)
+--
+-- WINDOW
+--   Rolling 90 days (addedTime > CURRENT_DATE - 90 days)
+--
+-- EXAMPLE QUERIES
+--   -- Most-used blocks by execution count
+--   SELECT "blockName", COUNT(*) AS executions,
+--          COUNT(*) FILTER (WHERE "executionStatus"='FAILED') AS failures
+--   FROM analytics.node_block_execution
+--   GROUP BY 1 ORDER BY executions DESC LIMIT 20;
+--
+--   -- Average LLM token usage per block
+--   SELECT "blockName",
+--          AVG("inputTokenCount") AS avg_input_tokens,
+--          AVG("outputTokenCount") AS avg_output_tokens
+--   FROM analytics.node_block_execution
+--   WHERE "llmCallCount" > 0
+--   GROUP BY 1 ORDER BY avg_input_tokens DESC;
+--
+--   -- Top failure reasons
+--   SELECT "blockName", "groupedErrorMessage", COUNT(*) AS count
+--   FROM analytics.node_block_execution
+--   WHERE "executionStatus" = 'FAILED'
+--   GROUP BY 1, 2 ORDER BY count DESC LIMIT 20;
+-- =============================================================
+
+SELECT
+    ne."id"                                                            AS id,
+    ne."agentGraphExecutionId"                                         AS agentGraphExecutionId,
+    ne."agentNodeId"                                                   AS agentNodeId,
+    CAST(ne."executionStatus" AS TEXT)                                 AS executionStatus,
+    ne."addedTime"                                                     AS addedTime,
+    ne."queuedTime"                                                    AS queuedTime,
+    ne."startedTime"                                                   AS startedTime,
+    ne."endedTime"                                                     AS endedTime,
+    (ne."stats"::jsonb->>'input_size')::bigint                         AS inputSize,
+    (ne."stats"::jsonb->>'output_size')::bigint                        AS outputSize,
+    (ne."stats"::jsonb->>'walltime')::numeric                          AS walltime,
+    (ne."stats"::jsonb->>'cputime')::numeric                           AS cputime,
+    (ne."stats"::jsonb->>'llm_retry_count')::int                       AS llmRetryCount,
+    (ne."stats"::jsonb->>'llm_call_count')::int                        AS llmCallCount,
+    (ne."stats"::jsonb->>'input_token_count')::bigint                  AS inputTokenCount,
+    (ne."stats"::jsonb->>'output_token_count')::bigint                 AS outputTokenCount,
+    b."name"                                                           AS blockName,
+    b."id"                                                             AS blockId,
+    REGEXP_REPLACE(
+        REGEXP_REPLACE(
+            TRIM(BOTH '"' FROM eio."data"::text),
+            '(https?://)([A-Za-z0-9.-]+)(:[0-9]+)?(/[^\s]*)?',
+            '\1\2/...', 'gi'
+        ),
+        '[a-zA-Z0-9_:-]*\d[a-zA-Z0-9_:-]*', '*', 'g'
+    )                                                                  AS groupedErrorMessage,
+    eio."data"                                                         AS errorMessage
+FROM platform."AgentNodeExecution" ne
+LEFT JOIN platform."AgentNode" nd
+       ON ne."agentNodeId" = nd."id"
+LEFT JOIN platform."AgentBlock" b
+       ON nd."agentBlockId" = b."id"
+LEFT JOIN platform."AgentNodeExecutionInputOutput" eio
+       ON eio."referencedByOutputExecId" = ne."id"
+      AND eio."name" = 'error'
+      AND ne."executionStatus" = 'FAILED'
+WHERE ne."addedTime" > CURRENT_DATE - INTERVAL '90 days'
--- a/autogpt_platform/analytics/queries/retention_agent.sql
+++ b/autogpt_platform/analytics/queries/retention_agent.sql
@@ -0,0 +1,97 @@
+-- =============================================================
+-- View: analytics.retention_agent
+-- Looker source alias: ds35  |  Charts: 2
+-- =============================================================
+-- DESCRIPTION
+--   Weekly cohort retention broken down per individual agent.
+--   Cohort = week of a user's first use of THAT specific agent.
+--   Tells you which agents keep users coming back vs. one-shot
+--   use. Only includes cohorts from the last 180 days.
+--
+-- SOURCE TABLES
+--   platform.AgentGraphExecution  — Execution records (user × agent × time)
+--   platform.AgentGraph           — Agent names
+--
+-- OUTPUT COLUMNS
+--   agent_id            TEXT   Agent graph UUID
+--   agent_label         TEXT   'AgentName [first8chars]'
+--   agent_label_n       TEXT   'AgentName [first8chars] (n=total_users)'
+--   cohort_week_start   DATE   Week users first ran this agent
+--   cohort_label        TEXT   ISO week label
+--   cohort_label_n      TEXT   ISO week label with cohort size
+--   user_lifetime_week  INT    Weeks since first use of this agent
+--   cohort_users        BIGINT Users in this cohort for this agent
+--   active_users        BIGINT Users who ran the agent again in week k
+--   retention_rate      FLOAT  active_users / cohort_users
+--   cohort_users_w0     BIGINT cohort_users only at week 0 (safe to SUM)
+--   agent_total_users   BIGINT Total users across all cohorts for this agent
+--
+-- EXAMPLE QUERIES
+--   -- Best-retained agents at week 2
+--   SELECT agent_label, AVG(retention_rate) AS w2_retention
+--   FROM analytics.retention_agent
+--   WHERE user_lifetime_week = 2 AND cohort_users >= 10
+--   GROUP BY 1 ORDER BY w2_retention DESC LIMIT 10;
+--
+--   -- Agents with most unique users
+--   SELECT DISTINCT agent_label, agent_total_users
+--   FROM analytics.retention_agent
+--   ORDER BY agent_total_users DESC LIMIT 20;
+-- =============================================================
+
+WITH params AS (SELECT 12::int AS max_weeks, (CURRENT_DATE - INTERVAL '180 days') AS cohort_start),
+events AS (
+  SELECT e."userId"::text AS user_id, e."agentGraphId" AS agent_id,
+         e."createdAt"::timestamptz AS created_at,
+         DATE_TRUNC('week', e."createdAt")::date AS week_start
+  FROM platform."AgentGraphExecution" e
+),
+first_use AS (
+  SELECT user_id, agent_id, MIN(created_at) AS first_use_at,
+         DATE_TRUNC('week', MIN(created_at))::date AS cohort_week_start
+  FROM events GROUP BY 1,2
+  HAVING MIN(created_at) >= (SELECT cohort_start FROM params)
+),
+activity_weeks AS (SELECT DISTINCT user_id, agent_id, week_start FROM events),
+user_week_age AS (
+  SELECT aw.user_id, aw.agent_id, fu.cohort_week_start,
+         ((aw.week_start - DATE_TRUNC('week',fu.first_use_at)::date)/7)::int AS user_lifetime_week
+  FROM activity_weeks aw JOIN first_use fu USING (user_id, agent_id)
+  WHERE aw.week_start >= DATE_TRUNC('week',fu.first_use_at)::date
+),
+active_counts AS (
+  SELECT agent_id, cohort_week_start, user_lifetime_week, COUNT(DISTINCT user_id) AS active_users
+  FROM user_week_age WHERE user_lifetime_week >= 0 GROUP BY 1,2,3
+),
+cohort_sizes AS (
+  SELECT agent_id, cohort_week_start, COUNT(DISTINCT user_id) AS cohort_users FROM first_use GROUP BY 1,2
+),
+cohort_caps AS (
+  SELECT cs.agent_id, cs.cohort_week_start, cs.cohort_users,
+         LEAST((SELECT max_weeks FROM params),
+               GREATEST(0,((DATE_TRUNC('week',CURRENT_DATE)::date-cs.cohort_week_start)/7)::int)) AS cap_weeks
+  FROM cohort_sizes cs
+),
+grid AS (
+  SELECT cc.agent_id, cc.cohort_week_start, gs AS user_lifetime_week, cc.cohort_users
+  FROM cohort_caps cc CROSS JOIN LATERAL generate_series(0, cc.cap_weeks) gs
+),
+agent_names AS (SELECT DISTINCT ON (g."id") g."id" AS agent_id, g."name" AS agent_name FROM platform."AgentGraph" g ORDER BY g."id", g."version" DESC),
+agent_total_users AS (SELECT agent_id, SUM(cohort_users) AS agent_total_users FROM cohort_sizes GROUP BY 1)
+SELECT
+  g.agent_id,
+  COALESCE(an.agent_name,'(unnamed)')||' ['||LEFT(g.agent_id::text,8)||']'  AS agent_label,
+  COALESCE(an.agent_name,'(unnamed)')||' ['||LEFT(g.agent_id::text,8)||'] (n='||COALESCE(atu.agent_total_users,0)||')' AS agent_label_n,
+  g.cohort_week_start,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')                               AS cohort_label,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')||' (n='||g.cohort_users||')'  AS cohort_label_n,
+  g.user_lifetime_week, g.cohort_users,
+  COALESCE(ac.active_users,0)                                              AS active_users,
+  COALESCE(ac.active_users,0)::float / NULLIF(g.cohort_users,0)           AS retention_rate,
+  CASE WHEN g.user_lifetime_week=0 THEN g.cohort_users ELSE 0 END         AS cohort_users_w0,
+  COALESCE(atu.agent_total_users,0)                                        AS agent_total_users
+FROM grid g
+LEFT JOIN active_counts     ac  ON ac.agent_id=g.agent_id AND ac.cohort_week_start=g.cohort_week_start AND ac.user_lifetime_week=g.user_lifetime_week
+LEFT JOIN agent_names       an  ON an.agent_id=g.agent_id
+LEFT JOIN agent_total_users atu ON atu.agent_id=g.agent_id
+ORDER BY agent_label, g.cohort_week_start, g.user_lifetime_week;
--- a/autogpt_platform/analytics/queries/retention_execution_daily.sql
+++ b/autogpt_platform/analytics/queries/retention_execution_daily.sql
@@ -0,0 +1,81 @@
+-- =============================================================
+-- View: analytics.retention_execution_daily
+-- Looker source alias: ds111  |  Charts: 1
+-- =============================================================
+-- DESCRIPTION
+--   Daily cohort retention based on agent executions.
+--   Cohort anchor = day of user's FIRST ever execution.
+--   Only includes cohorts from the last 90 days, up to day 30.
+--   Great for early engagement analysis (did users run another
+--   agent the next day?).
+--
+-- SOURCE TABLES
+--   platform.AgentGraphExecution  — Execution records
+--
+-- OUTPUT COLUMNS
+--   Same pattern as retention_login_daily.
+--   cohort_day_start = day of first execution (not first login)
+--
+-- EXAMPLE QUERIES
+--   -- Day-3 execution retention
+--   SELECT cohort_label, retention_rate_bounded AS d3_retention
+--   FROM analytics.retention_execution_daily
+--   WHERE user_lifetime_day = 3 ORDER BY cohort_day_start;
+-- =============================================================
+
+WITH params AS (SELECT 30::int AS max_days, (CURRENT_DATE - INTERVAL '90 days') AS cohort_start),
+events AS (
+  SELECT e."userId"::text AS user_id, e."createdAt"::timestamptz AS created_at,
+         DATE_TRUNC('day', e."createdAt")::date AS day_start
+  FROM platform."AgentGraphExecution" e WHERE e."userId" IS NOT NULL
+),
+first_exec AS (
+  SELECT user_id, MIN(created_at) AS first_exec_at,
+         DATE_TRUNC('day', MIN(created_at))::date AS cohort_day_start
+  FROM events GROUP BY 1
+  HAVING MIN(created_at) >= (SELECT cohort_start FROM params)
+),
+activity_days AS (SELECT DISTINCT user_id, day_start FROM events),
+user_day_age AS (
+  SELECT ad.user_id, fe.cohort_day_start,
+         (ad.day_start - DATE_TRUNC('day',fe.first_exec_at)::date)::int AS user_lifetime_day
+  FROM activity_days ad JOIN first_exec fe USING (user_id)
+  WHERE ad.day_start >= DATE_TRUNC('day',fe.first_exec_at)::date
+),
+bounded_counts AS (
+  SELECT cohort_day_start, user_lifetime_day, COUNT(DISTINCT user_id) AS active_users_bounded
+  FROM user_day_age WHERE user_lifetime_day >= 0 GROUP BY 1,2
+),
+last_active AS (
+  SELECT cohort_day_start, user_id, MAX(user_lifetime_day) AS last_active_day FROM user_day_age GROUP BY 1,2
+),
+unbounded_counts AS (
+  SELECT la.cohort_day_start, gs AS user_lifetime_day, COUNT(*) AS retained_users_unbounded
+  FROM last_active la
+  CROSS JOIN LATERAL generate_series(0, LEAST(la.last_active_day,(SELECT max_days FROM params))) gs
+  GROUP BY 1,2
+),
+cohort_sizes AS (SELECT cohort_day_start, COUNT(DISTINCT user_id) AS cohort_users FROM first_exec GROUP BY 1),
+cohort_caps AS (
+  SELECT cs.cohort_day_start, cs.cohort_users,
+         LEAST((SELECT max_days FROM params), GREATEST(0,(CURRENT_DATE-cs.cohort_day_start)::int)) AS cap_days
+  FROM cohort_sizes cs
+),
+grid AS (
+  SELECT cc.cohort_day_start, gs AS user_lifetime_day, cc.cohort_users
+  FROM cohort_caps cc CROSS JOIN LATERAL generate_series(0, cc.cap_days) gs
+)
+SELECT
+  g.cohort_day_start,
+  TO_CHAR(g.cohort_day_start,'YYYY-MM-DD')                                AS cohort_label,
+  TO_CHAR(g.cohort_day_start,'YYYY-MM-DD')||' (n='||g.cohort_users||')'   AS cohort_label_n,
+  g.user_lifetime_day, g.cohort_users,
+  COALESCE(b.active_users_bounded,0)     AS active_users_bounded,
+  COALESCE(u.retained_users_unbounded,0) AS retained_users_unbounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(b.active_users_bounded,0)::float/g.cohort_users END    AS retention_rate_bounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(u.retained_users_unbounded,0)::float/g.cohort_users END AS retention_rate_unbounded,
+  CASE WHEN g.user_lifetime_day=0 THEN g.cohort_users ELSE 0 END          AS cohort_users_d0
+FROM grid g
+LEFT JOIN bounded_counts   b ON b.cohort_day_start=g.cohort_day_start AND b.user_lifetime_day=g.user_lifetime_day
+LEFT JOIN unbounded_counts u ON u.cohort_day_start=g.cohort_day_start AND u.user_lifetime_day=g.user_lifetime_day
+ORDER BY g.cohort_day_start, g.user_lifetime_day;
--- a/autogpt_platform/analytics/queries/retention_execution_weekly.sql
+++ b/autogpt_platform/analytics/queries/retention_execution_weekly.sql
@@ -0,0 +1,81 @@
+-- =============================================================
+-- View: analytics.retention_execution_weekly
+-- Looker source alias: ds92  |  Charts: 2
+-- =============================================================
+-- DESCRIPTION
+--   Weekly cohort retention based on agent executions.
+--   Cohort anchor = week of user's FIRST ever agent execution
+--   (not first login). Only includes cohorts from the last 180 days.
+--   Useful when you care about product engagement, not just visits.
+--
+-- SOURCE TABLES
+--   platform.AgentGraphExecution  — Execution records
+--
+-- OUTPUT COLUMNS
+--   Same pattern as retention_login_weekly.
+--   cohort_week_start = week of first execution (not first login)
+--
+-- EXAMPLE QUERIES
+--   -- Week-2 execution retention
+--   SELECT cohort_label, retention_rate_bounded
+--   FROM analytics.retention_execution_weekly
+--   WHERE user_lifetime_week = 2 ORDER BY cohort_week_start;
+-- =============================================================
+
+WITH params AS (SELECT 12::int AS max_weeks, (CURRENT_DATE - INTERVAL '180 days') AS cohort_start),
+events AS (
+  SELECT e."userId"::text AS user_id, e."createdAt"::timestamptz AS created_at,
+         DATE_TRUNC('week', e."createdAt")::date AS week_start
+  FROM platform."AgentGraphExecution" e WHERE e."userId" IS NOT NULL
+),
+first_exec AS (
+  SELECT user_id, MIN(created_at) AS first_exec_at,
+         DATE_TRUNC('week', MIN(created_at))::date AS cohort_week_start
+  FROM events GROUP BY 1
+  HAVING MIN(created_at) >= (SELECT cohort_start FROM params)
+),
+activity_weeks AS (SELECT DISTINCT user_id, week_start FROM events),
+user_week_age AS (
+  SELECT aw.user_id, fe.cohort_week_start,
+         ((aw.week_start - DATE_TRUNC('week',fe.first_exec_at)::date)/7)::int AS user_lifetime_week
+  FROM activity_weeks aw JOIN first_exec fe USING (user_id)
+  WHERE aw.week_start >= DATE_TRUNC('week',fe.first_exec_at)::date
+),
+bounded_counts AS (
+  SELECT cohort_week_start, user_lifetime_week, COUNT(DISTINCT user_id) AS active_users_bounded
+  FROM user_week_age WHERE user_lifetime_week >= 0 GROUP BY 1,2
+),
+last_active AS (
+  SELECT cohort_week_start, user_id, MAX(user_lifetime_week) AS last_active_week FROM user_week_age GROUP BY 1,2
+),
+unbounded_counts AS (
+  SELECT la.cohort_week_start, gs AS user_lifetime_week, COUNT(*) AS retained_users_unbounded
+  FROM last_active la
+  CROSS JOIN LATERAL generate_series(0, LEAST(la.last_active_week,(SELECT max_weeks FROM params))) gs
+  GROUP BY 1,2
+),
+cohort_sizes AS (SELECT cohort_week_start, COUNT(DISTINCT user_id) AS cohort_users FROM first_exec GROUP BY 1),
+cohort_caps AS (
+  SELECT cs.cohort_week_start, cs.cohort_users,
+         LEAST((SELECT max_weeks FROM params),
+               GREATEST(0,((DATE_TRUNC('week',CURRENT_DATE)::date-cs.cohort_week_start)/7)::int)) AS cap_weeks
+  FROM cohort_sizes cs
+),
+grid AS (
+  SELECT cc.cohort_week_start, gs AS user_lifetime_week, cc.cohort_users
+  FROM cohort_caps cc CROSS JOIN LATERAL generate_series(0, cc.cap_weeks) gs
+)
+SELECT
+  g.cohort_week_start,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')                               AS cohort_label,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')||' (n='||g.cohort_users||')'  AS cohort_label_n,
+  g.user_lifetime_week, g.cohort_users,
+  COALESCE(b.active_users_bounded,0)     AS active_users_bounded,
+  COALESCE(u.retained_users_unbounded,0) AS retained_users_unbounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(b.active_users_bounded,0)::float/g.cohort_users END    AS retention_rate_bounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(u.retained_users_unbounded,0)::float/g.cohort_users END AS retention_rate_unbounded,
+  CASE WHEN g.user_lifetime_week=0 THEN g.cohort_users ELSE 0 END         AS cohort_users_w0
+FROM grid g
+LEFT JOIN bounded_counts   b ON b.cohort_week_start=g.cohort_week_start AND b.user_lifetime_week=g.user_lifetime_week
+LEFT JOIN unbounded_counts u ON u.cohort_week_start=g.cohort_week_start AND u.user_lifetime_week=g.user_lifetime_week
+ORDER BY g.cohort_week_start, g.user_lifetime_week;
--- a/autogpt_platform/analytics/queries/retention_login_daily.sql
+++ b/autogpt_platform/analytics/queries/retention_login_daily.sql
@@ -0,0 +1,94 @@
+-- =============================================================
+-- View: analytics.retention_login_daily
+-- Looker source alias: ds112  |  Charts: 1
+-- =============================================================
+-- DESCRIPTION
+--   Daily cohort retention based on login sessions.
+--   Same logic as retention_login_weekly but at day granularity,
+--   showing up to day 30 for cohorts from the last 90 days.
+--   Useful for analysing early activation (days 1-7) in detail.
+--
+-- SOURCE TABLES
+--   auth.sessions  — Login session records
+--
+-- OUTPUT COLUMNS (same pattern as retention_login_weekly)
+--   cohort_day_start          DATE     First day the cohort logged in
+--   cohort_label              TEXT     Date string (e.g. '2025-03-01')
+--   cohort_label_n            TEXT     Date + cohort size (e.g. '2025-03-01 (n=12)')
+--   user_lifetime_day         INT      Days since first login (0 = signup day)
+--   cohort_users              BIGINT   Total users in cohort
+--   active_users_bounded      BIGINT   Users active on exactly day k
+--   retained_users_unbounded  BIGINT   Users active any time on/after day k
+--   retention_rate_bounded    FLOAT    bounded / cohort_users
+--   retention_rate_unbounded  FLOAT    unbounded / cohort_users
+--   cohort_users_d0           BIGINT   cohort_users only at day 0, else 0 (safe to SUM)
+--
+-- EXAMPLE QUERIES
+--   -- Day-1 retention rate (came back next day)
+--   SELECT cohort_label, retention_rate_bounded AS d1_retention
+--   FROM analytics.retention_login_daily
+--   WHERE user_lifetime_day = 1 ORDER BY cohort_day_start;
+--
+--   -- Average retention curve across all cohorts
+--   SELECT user_lifetime_day,
+--          SUM(active_users_bounded)::float / NULLIF(SUM(cohort_users_d0), 0) AS avg_retention
+--   FROM analytics.retention_login_daily
+--   GROUP BY 1 ORDER BY 1;
+-- =============================================================
+
+WITH params AS (SELECT 30::int AS max_days, (CURRENT_DATE - INTERVAL '90 days')::date AS cohort_start),
+events AS (
+  SELECT s.user_id::text AS user_id, s.created_at::timestamptz AS created_at,
+         DATE_TRUNC('day', s.created_at)::date AS day_start
+  FROM auth.sessions s WHERE s.user_id IS NOT NULL
+),
+first_login AS (
+  SELECT user_id, MIN(created_at) AS first_login_time,
+         DATE_TRUNC('day', MIN(created_at))::date AS cohort_day_start
+  FROM events GROUP BY 1
+  HAVING MIN(created_at) >= (SELECT cohort_start FROM params)
+),
+activity_days AS (SELECT DISTINCT user_id, day_start FROM events),
+user_day_age AS (
+  SELECT ad.user_id, fl.cohort_day_start,
+         (ad.day_start - DATE_TRUNC('day', fl.first_login_time)::date)::int AS user_lifetime_day
+  FROM activity_days ad JOIN first_login fl USING (user_id)
+  WHERE ad.day_start >= DATE_TRUNC('day', fl.first_login_time)::date
+),
+bounded_counts AS (
+  SELECT cohort_day_start, user_lifetime_day, COUNT(DISTINCT user_id) AS active_users_bounded
+  FROM user_day_age WHERE user_lifetime_day >= 0 GROUP BY 1,2
+),
+last_active AS (
+  SELECT cohort_day_start, user_id, MAX(user_lifetime_day) AS last_active_day FROM user_day_age GROUP BY 1,2
+),
+unbounded_counts AS (
+  SELECT la.cohort_day_start, gs AS user_lifetime_day, COUNT(*) AS retained_users_unbounded
+  FROM last_active la
+  CROSS JOIN LATERAL generate_series(0, LEAST(la.last_active_day,(SELECT max_days FROM params))) gs
+  GROUP BY 1,2
+),
+cohort_sizes AS (SELECT cohort_day_start, COUNT(DISTINCT user_id) AS cohort_users FROM first_login GROUP BY 1),
+cohort_caps AS (
+  SELECT cs.cohort_day_start, cs.cohort_users,
+         LEAST((SELECT max_days FROM params), GREATEST(0,(CURRENT_DATE-cs.cohort_day_start)::int)) AS cap_days
+  FROM cohort_sizes cs
+),
+grid AS (
+  SELECT cc.cohort_day_start, gs AS user_lifetime_day, cc.cohort_users
+  FROM cohort_caps cc CROSS JOIN LATERAL generate_series(0, cc.cap_days) gs
+)
+SELECT
+  g.cohort_day_start,
+  TO_CHAR(g.cohort_day_start,'YYYY-MM-DD')                                  AS cohort_label,
+  TO_CHAR(g.cohort_day_start,'YYYY-MM-DD')||' (n='||g.cohort_users||')'     AS cohort_label_n,
+  g.user_lifetime_day, g.cohort_users,
+  COALESCE(b.active_users_bounded,0)     AS active_users_bounded,
+  COALESCE(u.retained_users_unbounded,0) AS retained_users_unbounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(b.active_users_bounded,0)::float/g.cohort_users END    AS retention_rate_bounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(u.retained_users_unbounded,0)::float/g.cohort_users END AS retention_rate_unbounded,
+  CASE WHEN g.user_lifetime_day=0 THEN g.cohort_users ELSE 0 END            AS cohort_users_d0
+FROM grid g
+LEFT JOIN bounded_counts   b ON b.cohort_day_start=g.cohort_day_start AND b.user_lifetime_day=g.user_lifetime_day
+LEFT JOIN unbounded_counts u ON u.cohort_day_start=g.cohort_day_start AND u.user_lifetime_day=g.user_lifetime_day
+ORDER BY g.cohort_day_start, g.user_lifetime_day;
--- a/autogpt_platform/analytics/queries/retention_login_onboarded_weekly.sql
+++ b/autogpt_platform/analytics/queries/retention_login_onboarded_weekly.sql
@@ -0,0 +1,96 @@
+-- =============================================================
+-- View: analytics.retention_login_onboarded_weekly
+-- Looker source alias: ds101  |  Charts: 2
+-- =============================================================
+-- DESCRIPTION
+--   Weekly cohort retention from login sessions, restricted to
+--   users who "onboarded" — defined as running at least one
+--   agent within 365 days of their first login.
+--   Filters out users who signed up but never activated,
+--   giving a cleaner view of engaged-user retention.
+--
+-- SOURCE TABLES
+--   auth.sessions                  — Login session records
+--   platform.AgentGraphExecution   — Used to identify onboarders
+--
+-- OUTPUT COLUMNS
+--   Same as retention_login_weekly (cohort_week_start, user_lifetime_week,
+--   retention_rate_bounded, retention_rate_unbounded, etc.)
+--   Only difference: cohort is filtered to onboarded users only.
+--
+-- EXAMPLE QUERIES
+--   -- Compare week-4 retention: all users vs onboarded only
+--   SELECT 'all_users' AS segment, AVG(retention_rate_bounded) AS w4_retention
+--   FROM analytics.retention_login_weekly WHERE user_lifetime_week = 4
+--   UNION ALL
+--   SELECT 'onboarded', AVG(retention_rate_bounded)
+--   FROM analytics.retention_login_onboarded_weekly WHERE user_lifetime_week = 4;
+-- =============================================================
+
+WITH params AS (SELECT 12::int AS max_weeks, 365::int AS onboarding_window_days),
+events AS (
+  SELECT s.user_id::text AS user_id, s.created_at::timestamptz AS created_at,
+         DATE_TRUNC('week', s.created_at)::date AS week_start
+  FROM auth.sessions s WHERE s.user_id IS NOT NULL
+),
+first_login_all AS (
+  SELECT user_id, MIN(created_at) AS first_login_time,
+         DATE_TRUNC('week', MIN(created_at))::date AS cohort_week_start
+  FROM events GROUP BY 1
+),
+onboarders AS (
+  SELECT fl.user_id FROM first_login_all fl
+  WHERE EXISTS (
+    SELECT 1 FROM platform."AgentGraphExecution" e
+    WHERE e."userId"::text = fl.user_id
+      AND e."createdAt" >= fl.first_login_time
+      AND e."createdAt" < fl.first_login_time
+          + make_interval(days => (SELECT onboarding_window_days FROM params))
+  )
+),
+first_login AS (SELECT * FROM first_login_all WHERE user_id IN (SELECT user_id FROM onboarders)),
+activity_weeks AS (SELECT DISTINCT user_id, week_start FROM events),
+user_week_age AS (
+  SELECT aw.user_id, fl.cohort_week_start,
+         ((aw.week_start - DATE_TRUNC('week',fl.first_login_time)::date)/7)::int AS user_lifetime_week
+  FROM activity_weeks aw JOIN first_login fl USING (user_id)
+  WHERE aw.week_start >= DATE_TRUNC('week',fl.first_login_time)::date
+),
+bounded_counts AS (
+  SELECT cohort_week_start, user_lifetime_week, COUNT(DISTINCT user_id) AS active_users_bounded
+  FROM user_week_age WHERE user_lifetime_week >= 0 GROUP BY 1,2
+),
+last_active AS (
+  SELECT cohort_week_start, user_id, MAX(user_lifetime_week) AS last_active_week FROM user_week_age GROUP BY 1,2
+),
+unbounded_counts AS (
+  SELECT la.cohort_week_start, gs AS user_lifetime_week, COUNT(*) AS retained_users_unbounded
+  FROM last_active la
+  CROSS JOIN LATERAL generate_series(0, LEAST(la.last_active_week,(SELECT max_weeks FROM params))) gs
+  GROUP BY 1,2
+),
+cohort_sizes AS (SELECT cohort_week_start, COUNT(DISTINCT user_id) AS cohort_users FROM first_login GROUP BY 1),
+cohort_caps AS (
+  SELECT cs.cohort_week_start, cs.cohort_users,
+         LEAST((SELECT max_weeks FROM params),
+               GREATEST(0,((DATE_TRUNC('week',CURRENT_DATE)::date-cs.cohort_week_start)/7)::int)) AS cap_weeks
+  FROM cohort_sizes cs
+),
+grid AS (
+  SELECT cc.cohort_week_start, gs AS user_lifetime_week, cc.cohort_users
+  FROM cohort_caps cc CROSS JOIN LATERAL generate_series(0, cc.cap_weeks) gs
+)
+SELECT
+  g.cohort_week_start,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')                               AS cohort_label,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')||' (n='||g.cohort_users||')'  AS cohort_label_n,
+  g.user_lifetime_week, g.cohort_users,
+  COALESCE(b.active_users_bounded,0)     AS active_users_bounded,
+  COALESCE(u.retained_users_unbounded,0) AS retained_users_unbounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(b.active_users_bounded,0)::float/g.cohort_users END    AS retention_rate_bounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(u.retained_users_unbounded,0)::float/g.cohort_users END AS retention_rate_unbounded,
+  CASE WHEN g.user_lifetime_week=0 THEN g.cohort_users ELSE 0 END         AS cohort_users_w0
+FROM grid g
+LEFT JOIN bounded_counts   b ON b.cohort_week_start=g.cohort_week_start AND b.user_lifetime_week=g.user_lifetime_week
+LEFT JOIN unbounded_counts u ON u.cohort_week_start=g.cohort_week_start AND u.user_lifetime_week=g.user_lifetime_week
+ORDER BY g.cohort_week_start, g.user_lifetime_week;
--- a/autogpt_platform/analytics/queries/retention_login_weekly.sql
+++ b/autogpt_platform/analytics/queries/retention_login_weekly.sql
@@ -0,0 +1,103 @@
+-- =============================================================
+-- View: analytics.retention_login_weekly
+-- Looker source alias: ds83  |  Charts: 2
+-- =============================================================
+-- DESCRIPTION
+--   Weekly cohort retention based on login sessions.
+--   Users are grouped by the ISO week of their first ever login.
+--   For each cohort × lifetime-week combination, outputs both:
+--     - bounded rate: % active in exactly that week
+--     - unbounded rate: % who were ever active on or after that week
+--   Weeks are capped to the cohort's actual age (no future data points).
+--
+-- SOURCE TABLES
+--   auth.sessions  — Login session records
+--
+-- HOW TO READ THE OUTPUT
+--   cohort_week_start   The Monday of the week users first logged in
+--   user_lifetime_week  0 = signup week, 1 = one week later, etc.
+--   retention_rate_bounded   = active_users_bounded / cohort_users
+--   retention_rate_unbounded = retained_users_unbounded / cohort_users
+--
+-- OUTPUT COLUMNS
+--   cohort_week_start         DATE     First day of the cohort's signup week
+--   cohort_label              TEXT     ISO week label (e.g. '2025-W01')
+--   cohort_label_n            TEXT     ISO week label with cohort size (e.g. '2025-W01 (n=42)')
+--   user_lifetime_week        INT      Weeks since first login (0 = signup week)
+--   cohort_users              BIGINT   Total users in this cohort (denominator)
+--   active_users_bounded      BIGINT   Users active in exactly week k
+--   retained_users_unbounded  BIGINT   Users active any time on/after week k
+--   retention_rate_bounded    FLOAT    bounded active / cohort_users
+--   retention_rate_unbounded  FLOAT    unbounded retained / cohort_users
+--   cohort_users_w0           BIGINT   cohort_users only at week 0, else 0 (safe to SUM in pivot tables)
+--
+-- EXAMPLE QUERIES
+--   -- Week-1 retention rate per cohort
+--   SELECT cohort_label, retention_rate_bounded AS w1_retention
+--   FROM analytics.retention_login_weekly
+--   WHERE user_lifetime_week = 1
+--   ORDER BY cohort_week_start;
+--
+--   -- Overall average retention curve (all cohorts combined)
+--   SELECT user_lifetime_week,
+--          SUM(active_users_bounded)::float / NULLIF(SUM(cohort_users_w0), 0) AS avg_retention
+--   FROM analytics.retention_login_weekly
+--   GROUP BY 1 ORDER BY 1;
+-- =============================================================
+
+WITH params AS (SELECT 12::int AS max_weeks),
+events AS (
+  SELECT s.user_id::text AS user_id, s.created_at::timestamptz AS created_at,
+         DATE_TRUNC('week', s.created_at)::date AS week_start
+  FROM auth.sessions s WHERE s.user_id IS NOT NULL
+),
+first_login AS (
+  SELECT user_id, MIN(created_at) AS first_login_time,
+         DATE_TRUNC('week', MIN(created_at))::date AS cohort_week_start
+  FROM events GROUP BY 1
+),
+activity_weeks AS (SELECT DISTINCT user_id, week_start FROM events),
+user_week_age AS (
+  SELECT aw.user_id, fl.cohort_week_start,
+         ((aw.week_start - DATE_TRUNC('week', fl.first_login_time)::date) / 7)::int AS user_lifetime_week
+  FROM activity_weeks aw JOIN first_login fl USING (user_id)
+  WHERE aw.week_start >= DATE_TRUNC('week', fl.first_login_time)::date
+),
+bounded_counts AS (
+  SELECT cohort_week_start, user_lifetime_week, COUNT(DISTINCT user_id) AS active_users_bounded
+  FROM user_week_age WHERE user_lifetime_week >= 0 GROUP BY 1,2
+),
+last_active AS (
+  SELECT cohort_week_start, user_id, MAX(user_lifetime_week) AS last_active_week FROM user_week_age GROUP BY 1,2
+),
+unbounded_counts AS (
+  SELECT la.cohort_week_start, gs AS user_lifetime_week, COUNT(*) AS retained_users_unbounded
+  FROM last_active la
+  CROSS JOIN LATERAL generate_series(0, LEAST(la.last_active_week,(SELECT max_weeks FROM params))) gs
+  GROUP BY 1,2
+),
+cohort_sizes AS (SELECT cohort_week_start, COUNT(DISTINCT user_id) AS cohort_users FROM first_login GROUP BY 1),
+cohort_caps AS (
+  SELECT cs.cohort_week_start, cs.cohort_users,
+         LEAST((SELECT max_weeks FROM params),
+               GREATEST(0,((DATE_TRUNC('week',CURRENT_DATE)::date - cs.cohort_week_start)/7)::int)) AS cap_weeks
+  FROM cohort_sizes cs
+),
+grid AS (
+  SELECT cc.cohort_week_start, gs AS user_lifetime_week, cc.cohort_users
+  FROM cohort_caps cc CROSS JOIN LATERAL generate_series(0, cc.cap_weeks) gs
+)
+SELECT
+  g.cohort_week_start,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')                                    AS cohort_label,
+  TO_CHAR(g.cohort_week_start,'IYYY-"W"IW')||' (n='||g.cohort_users||')'       AS cohort_label_n,
+  g.user_lifetime_week, g.cohort_users,
+  COALESCE(b.active_users_bounded,0)     AS active_users_bounded,
+  COALESCE(u.retained_users_unbounded,0) AS retained_users_unbounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(b.active_users_bounded,0)::float/g.cohort_users END    AS retention_rate_bounded,
+  CASE WHEN g.cohort_users>0 THEN COALESCE(u.retained_users_unbounded,0)::float/g.cohort_users END AS retention_rate_unbounded,
+  CASE WHEN g.user_lifetime_week=0 THEN g.cohort_users ELSE 0 END               AS cohort_users_w0
+FROM grid g
+LEFT JOIN bounded_counts   b ON b.cohort_week_start=g.cohort_week_start AND b.user_lifetime_week=g.user_lifetime_week
+LEFT JOIN unbounded_counts u ON u.cohort_week_start=g.cohort_week_start AND u.user_lifetime_week=g.user_lifetime_week
+ORDER BY g.cohort_week_start, g.user_lifetime_week
--- a/autogpt_platform/analytics/queries/user_block_spending.sql
+++ b/autogpt_platform/analytics/queries/user_block_spending.sql
@@ -0,0 +1,71 @@
+-- =============================================================
+-- View: analytics.user_block_spending
+-- Looker source alias: ds6  |  Charts: 5
+-- =============================================================
+-- DESCRIPTION
+--   One row per credit transaction (last 90 days).
+--   Shows how users spend credits broken down by block type,
+--   LLM provider and model.  Joins node execution stats for
+--   token-level detail.
+--
+-- SOURCE TABLES
+--   platform.CreditTransaction   — Credit debit/credit records
+--   platform.AgentNodeExecution  — Node execution stats (for token counts)
+--
+-- OUTPUT COLUMNS
+--   transactionKey        TEXT         Unique transaction identifier
+--   userId                TEXT         User who was charged
+--   amount                DECIMAL      Credit amount (positive = credit, negative = debit)
+--   negativeAmount        DECIMAL      amount * -1 (convenience for spend charts)
+--   transactionType       TEXT         Transaction type (e.g. 'USAGE', 'REFUND', 'TOP_UP')
+--   transactionTime       TIMESTAMPTZ  When the transaction was recorded
+--   blockId               TEXT         Block UUID that triggered the spend
+--   blockName             TEXT         Human-readable block name
+--   llm_provider          TEXT         LLM provider (e.g. 'openai', 'anthropic')
+--   llm_model             TEXT         Model name (e.g. 'gpt-4o', 'claude-3-5-sonnet')
+--   node_exec_id          TEXT         Linked node execution UUID
+--   llm_call_count        INT          LLM API calls made in that execution
+--   llm_retry_count       INT          LLM retries in that execution
+--   llm_input_token_count INT          Input tokens consumed
+--   llm_output_token_count INT         Output tokens produced
+--
+-- WINDOW
+--   Rolling 90 days (createdAt > CURRENT_DATE - 90 days)
+--
+-- EXAMPLE QUERIES
+--   -- Total spend per user (last 90 days)
+--   SELECT "userId", SUM("negativeAmount") AS total_spent
+--   FROM analytics.user_block_spending
+--   WHERE "transactionType" = 'USAGE'
+--   GROUP BY 1 ORDER BY total_spent DESC;
+--
+--   -- Spend by LLM provider + model
+--   SELECT "llm_provider", "llm_model",
+--          SUM("negativeAmount") AS total_cost,
+--          SUM("llm_input_token_count") AS input_tokens,
+--          SUM("llm_output_token_count") AS output_tokens
+--   FROM analytics.user_block_spending
+--   WHERE "llm_provider" IS NOT NULL
+--   GROUP BY 1, 2 ORDER BY total_cost DESC;
+-- =============================================================
+
+SELECT
+    c."transactionKey"                                        AS transactionKey,
+    c."userId"                                                AS userId,
+    c."amount"                                                AS amount,
+    c."amount" * -1                                           AS negativeAmount,
+    c."type"                                                  AS transactionType,
+    c."createdAt"                                             AS transactionTime,
+    c.metadata->>'block_id'                                   AS blockId,
+    c.metadata->>'block'                                      AS blockName,
+    c.metadata->'input'->'credentials'->>'provider'           AS llm_provider,
+    c.metadata->'input'->>'model'                             AS llm_model,
+    c.metadata->>'node_exec_id'                               AS node_exec_id,
+    (ne."stats"->>'llm_call_count')::int                       AS llm_call_count,
+    (ne."stats"->>'llm_retry_count')::int                      AS llm_retry_count,
+    (ne."stats"->>'input_token_count')::int                    AS llm_input_token_count,
+    (ne."stats"->>'output_token_count')::int                   AS llm_output_token_count
+FROM platform."CreditTransaction" c
+LEFT JOIN platform."AgentNodeExecution" ne
+       ON (c.metadata->>'node_exec_id') = ne."id"::text
+WHERE c."createdAt" > CURRENT_DATE - INTERVAL '90 days'
--- a/autogpt_platform/analytics/queries/user_onboarding.sql
+++ b/autogpt_platform/analytics/queries/user_onboarding.sql
@@ -0,0 +1,45 @@
+-- =============================================================
+-- View: analytics.user_onboarding
+-- Looker source alias: ds68  |  Charts: 3
+-- =============================================================
+-- DESCRIPTION
+--   One row per user onboarding record.  Contains the user's
+--   stated usage reason, selected integrations, completed
+--   onboarding steps and optional first agent selection.
+--   Full history (no date filter) since onboarding happens
+--   once per user.
+--
+-- SOURCE TABLES
+--   platform.UserOnboarding  — Onboarding state per user
+--
+-- OUTPUT COLUMNS
+--   id                            TEXT         Onboarding record UUID
+--   createdAt                     TIMESTAMPTZ  When onboarding started
+--   updatedAt                     TIMESTAMPTZ  Last update to onboarding state
+--   usageReason                   TEXT         Why user signed up (e.g. 'work', 'personal')
+--   integrations                  TEXT[]       Array of integration names the user selected
+--   userId                        TEXT         User UUID
+--   completedSteps                TEXT[]       Array of onboarding step enums completed
+--   selectedStoreListingVersionId TEXT         First marketplace agent the user chose (if any)
+--
+-- EXAMPLE QUERIES
+--   -- Usage reason breakdown
+--   SELECT "usageReason", COUNT(*) FROM analytics.user_onboarding GROUP BY 1;
+--
+--   -- Completion rate per step
+--   SELECT step, COUNT(*) AS users_completed
+--   FROM analytics.user_onboarding
+--   CROSS JOIN LATERAL UNNEST("completedSteps") AS step
+--   GROUP BY 1 ORDER BY users_completed DESC;
+-- =============================================================
+
+SELECT
+    id,
+    "createdAt",
+    "updatedAt",
+    "usageReason",
+    integrations,
+    "userId",
+    "completedSteps",
+    "selectedStoreListingVersionId"
+FROM platform."UserOnboarding"
--- a/autogpt_platform/analytics/queries/user_onboarding_funnel.sql
+++ b/autogpt_platform/analytics/queries/user_onboarding_funnel.sql
@@ -0,0 +1,100 @@
+-- =============================================================
+-- View: analytics.user_onboarding_funnel
+-- Looker source alias: ds74  |  Charts: 1
+-- =============================================================
+-- DESCRIPTION
+--   Pre-aggregated onboarding funnel showing how many users
+--   completed each step and the drop-off percentage from the
+--   previous step.  One row per onboarding step (all 22 steps
+--   always present, even with 0 completions — prevents sparse
+--   gaps from making LAG compare the wrong predecessors).
+--
+-- SOURCE TABLES
+--   platform.UserOnboarding  — Onboarding records with completedSteps array
+--
+-- OUTPUT COLUMNS
+--   step             TEXT     Onboarding step enum name (e.g. 'WELCOME', 'CONGRATS')
+--   step_order       INT      Numeric position in the funnel (1=first, 22=last)
+--   users_completed  BIGINT   Distinct users who completed this step
+--   pct_from_prev    NUMERIC  % of users from the previous step who reached this one
+--
+-- STEP ORDER
+--   1  WELCOME               9  MARKETPLACE_VISIT     17  SCHEDULE_AGENT
+--   2  USAGE_REASON         10  MARKETPLACE_ADD_AGENT  18  RUN_AGENTS
+--   3  INTEGRATIONS         11  MARKETPLACE_RUN_AGENT  19  RUN_3_DAYS
+--   4  AGENT_CHOICE         12  BUILDER_OPEN           20  TRIGGER_WEBHOOK
+--   5  AGENT_NEW_RUN        13  BUILDER_SAVE_AGENT     21  RUN_14_DAYS
+--   6  AGENT_INPUT          14  BUILDER_RUN_AGENT      22  RUN_AGENTS_100
+--   7  CONGRATS             15  VISIT_COPILOT
+--   8  GET_RESULTS          16  RE_RUN_AGENT
+--
+-- WINDOW
+--   Users who started onboarding in the last 90 days
+--
+-- EXAMPLE QUERIES
+--   -- Full funnel
+--   SELECT * FROM analytics.user_onboarding_funnel ORDER BY step_order;
+--
+--   -- Biggest drop-off point
+--   SELECT step, pct_from_prev FROM analytics.user_onboarding_funnel
+--   ORDER BY pct_from_prev ASC LIMIT 3;
+-- =============================================================
+
+WITH all_steps AS (
+  -- Complete ordered grid of all 22 steps so zero-completion steps
+  -- are always present, keeping LAG comparisons correct.
+  SELECT step_name, step_order
+  FROM (VALUES
+    ('WELCOME',               1),
+    ('USAGE_REASON',          2),
+    ('INTEGRATIONS',          3),
+    ('AGENT_CHOICE',          4),
+    ('AGENT_NEW_RUN',         5),
+    ('AGENT_INPUT',           6),
+    ('CONGRATS',              7),
+    ('GET_RESULTS',           8),
+    ('MARKETPLACE_VISIT',     9),
+    ('MARKETPLACE_ADD_AGENT', 10),
+    ('MARKETPLACE_RUN_AGENT', 11),
+    ('BUILDER_OPEN',          12),
+    ('BUILDER_SAVE_AGENT',    13),
+    ('BUILDER_RUN_AGENT',     14),
+    ('VISIT_COPILOT',         15),
+    ('RE_RUN_AGENT',          16),
+    ('SCHEDULE_AGENT',        17),
+    ('RUN_AGENTS',            18),
+    ('RUN_3_DAYS',            19),
+    ('TRIGGER_WEBHOOK',       20),
+    ('RUN_14_DAYS',           21),
+    ('RUN_AGENTS_100',        22)
+  ) AS t(step_name, step_order)
+),
+raw AS (
+  SELECT
+      u."userId",
+      step_txt::text AS step
+  FROM platform."UserOnboarding" u
+  CROSS JOIN LATERAL UNNEST(u."completedSteps") AS step_txt
+  WHERE u."createdAt" >= CURRENT_DATE - INTERVAL '90 days'
+),
+step_counts AS (
+  SELECT step, COUNT(DISTINCT "userId") AS users_completed
+  FROM raw GROUP BY step
+),
+funnel AS (
+  SELECT
+      a.step_name                          AS step,
+      a.step_order,
+      COALESCE(sc.users_completed, 0)      AS users_completed,
+      ROUND(
+        100.0 * COALESCE(sc.users_completed, 0)
+        / NULLIF(
+            LAG(COALESCE(sc.users_completed, 0)) OVER (ORDER BY a.step_order),
+            0
+          ),
+        2
+      )                                    AS pct_from_prev
+  FROM all_steps a
+  LEFT JOIN step_counts sc ON sc.step = a.step_name
+)
+SELECT * FROM funnel ORDER BY step_order
--- a/autogpt_platform/analytics/queries/user_onboarding_integration.sql
+++ b/autogpt_platform/analytics/queries/user_onboarding_integration.sql
@@ -0,0 +1,41 @@
+-- =============================================================
+-- View: analytics.user_onboarding_integration
+-- Looker source alias: ds75  |  Charts: 1
+-- =============================================================
+-- DESCRIPTION
+--   Pre-aggregated count of users who selected each integration
+--   during onboarding.  One row per integration type, sorted
+--   by popularity.
+--
+-- SOURCE TABLES
+--   platform.UserOnboarding  — integrations array column
+--
+-- OUTPUT COLUMNS
+--   integration            TEXT    Integration name (e.g. 'github', 'slack', 'notion')
+--   users_with_integration BIGINT  Distinct users who selected this integration
+--
+-- WINDOW
+--   Users who started onboarding in the last 90 days
+--
+-- EXAMPLE QUERIES
+--   -- Full integration popularity ranking
+--   SELECT * FROM analytics.user_onboarding_integration;
+--
+--   -- Top 5 integrations
+--   SELECT * FROM analytics.user_onboarding_integration LIMIT 5;
+-- =============================================================
+
+WITH exploded AS (
+  SELECT
+      u."userId" AS user_id,
+      UNNEST(u."integrations") AS integration
+  FROM platform."UserOnboarding" u
+  WHERE u."createdAt" >= CURRENT_DATE - INTERVAL '90 days'
+)
+SELECT
+    integration,
+    COUNT(DISTINCT user_id) AS users_with_integration
+FROM exploded
+WHERE integration IS NOT NULL AND integration <> ''
+GROUP BY integration
+ORDER BY users_with_integration DESC
--- a/autogpt_platform/analytics/queries/users_activities.sql
+++ b/autogpt_platform/analytics/queries/users_activities.sql
@@ -0,0 +1,145 @@
+-- =============================================================
+-- View: analytics.users_activities
+-- Looker source alias: ds56  |  Charts: 5
+-- =============================================================
+-- DESCRIPTION
+--   One row per user with lifetime activity summary.
+--   Joins login sessions with agent graphs, executions and
+--   node-level runs to give a full picture of how engaged
+--   each user is.  Includes a convenience flag for 7-day
+--   activation (did the user return at least 7 days after
+--   their first login?).
+--
+-- SOURCE TABLES
+--   auth.sessions                    — Login/session records
+--   platform.AgentGraph              — Graphs (agents) built by the user
+--   platform.AgentGraphExecution     — Agent run history
+--   platform.AgentNodeExecution      — Individual block execution history
+--
+-- PERFORMANCE NOTE
+--   Each CTE aggregates its own table independently by userId.
+--   This avoids the fan-out that occurs when driving every join
+--   from user_logins across the two largest tables
+--   (AgentGraphExecution and AgentNodeExecution).
+--
+-- OUTPUT COLUMNS
+--   user_id                   TEXT         Supabase user UUID
+--   first_login_time          TIMESTAMPTZ  First ever session created_at
+--   last_login_time           TIMESTAMPTZ  Most recent session created_at
+--   last_visit_time           TIMESTAMPTZ  Max of last refresh or login
+--   last_agent_save_time      TIMESTAMPTZ  Last time user saved an agent graph
+--   agent_count               BIGINT       Number of distinct active graphs built (0 if none)
+--   first_agent_run_time      TIMESTAMPTZ  First ever graph execution
+--   last_agent_run_time       TIMESTAMPTZ  Most recent graph execution
+--   unique_agent_runs         BIGINT       Distinct agent graphs ever run (0 if none)
+--   agent_runs                BIGINT       Total graph execution count (0 if none)
+--   node_execution_count      BIGINT       Total node executions across all runs
+--   node_execution_failed     BIGINT       Node executions with FAILED status
+--   node_execution_completed  BIGINT       Node executions with COMPLETED status
+--   node_execution_terminated BIGINT       Node executions with TERMINATED status
+--   node_execution_queued     BIGINT       Node executions with QUEUED status
+--   node_execution_running    BIGINT       Node executions with RUNNING status
+--   is_active_after_7d        INT          1=returned after day 7, 0=did not, NULL=too early to tell
+--   node_execution_incomplete BIGINT       Node executions with INCOMPLETE status
+--   node_execution_review     BIGINT       Node executions with REVIEW status
+--
+-- EXAMPLE QUERIES
+--   -- Users who ran at least one agent and returned after 7 days
+--   SELECT COUNT(*) FROM analytics.users_activities
+--   WHERE agent_runs > 0 AND is_active_after_7d = 1;
+--
+--   -- Top 10 most active users by agent runs
+--   SELECT user_id, agent_runs, node_execution_count
+--   FROM analytics.users_activities
+--   ORDER BY agent_runs DESC LIMIT 10;
+--
+--   -- 7-day activation rate
+--   SELECT
+--     SUM(CASE WHEN is_active_after_7d = 1 THEN 1 ELSE 0 END)::float
+--     / NULLIF(COUNT(CASE WHEN is_active_after_7d IS NOT NULL THEN 1 END), 0)
+--     AS activation_rate
+--   FROM analytics.users_activities;
+-- =============================================================
+
+WITH user_logins AS (
+  SELECT
+    user_id::text                                    AS user_id,
+    MIN(created_at)                                  AS first_login_time,
+    MAX(created_at)                                  AS last_login_time,
+    GREATEST(
+      MAX(refreshed_at)::timestamptz,
+      MAX(created_at)::timestamptz
+    )                                                AS last_visit_time
+  FROM auth.sessions
+  GROUP BY user_id
+),
+user_agents AS (
+  -- Aggregate AgentGraph directly by userId (no fan-out from user_logins)
+  SELECT
+    "userId"::text                AS user_id,
+    MAX("updatedAt")              AS last_agent_save_time,
+    COUNT(DISTINCT "id")          AS agent_count
+  FROM platform."AgentGraph"
+  WHERE "isActive"
+  GROUP BY "userId"
+),
+user_graph_runs AS (
+  -- Aggregate AgentGraphExecution directly by userId
+  SELECT
+    "userId"::text                        AS user_id,
+    MIN("createdAt")                      AS first_agent_run_time,
+    MAX("createdAt")                      AS last_agent_run_time,
+    COUNT(DISTINCT "agentGraphId")        AS unique_agent_runs,
+    COUNT("id")                           AS agent_runs
+  FROM platform."AgentGraphExecution"
+  GROUP BY "userId"
+),
+user_node_runs AS (
+  -- Aggregate AgentNodeExecution directly; resolve userId via a
+  -- single join to AgentGraphExecution instead of fanning out from
+  -- user_logins through both large tables.
+  SELECT
+    g."userId"::text                                                   AS user_id,
+    COUNT(*)                                                           AS node_execution_count,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'FAILED')             AS node_execution_failed,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'COMPLETED')          AS node_execution_completed,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'TERMINATED')         AS node_execution_terminated,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'QUEUED')             AS node_execution_queued,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'RUNNING')            AS node_execution_running,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'INCOMPLETE')         AS node_execution_incomplete,
+    COUNT(*) FILTER (WHERE n."executionStatus" = 'REVIEW')             AS node_execution_review
+  FROM platform."AgentNodeExecution" n
+  JOIN platform."AgentGraphExecution" g
+    ON g."id" = n."agentGraphExecutionId"
+  GROUP BY g."userId"
+)
+SELECT
+  ul.user_id,
+  ul.first_login_time,
+  ul.last_login_time,
+  ul.last_visit_time,
+  ua.last_agent_save_time,
+  COALESCE(ua.agent_count, 0)             AS agent_count,
+  gr.first_agent_run_time,
+  gr.last_agent_run_time,
+  COALESCE(gr.unique_agent_runs, 0)       AS unique_agent_runs,
+  COALESCE(gr.agent_runs, 0)              AS agent_runs,
+  COALESCE(nr.node_execution_count, 0)      AS node_execution_count,
+  COALESCE(nr.node_execution_failed, 0)     AS node_execution_failed,
+  COALESCE(nr.node_execution_completed, 0)  AS node_execution_completed,
+  COALESCE(nr.node_execution_terminated, 0) AS node_execution_terminated,
+  COALESCE(nr.node_execution_queued, 0)     AS node_execution_queued,
+  COALESCE(nr.node_execution_running, 0)    AS node_execution_running,
+  CASE
+    WHEN ul.first_login_time < NOW() - INTERVAL '7 days'
+     AND ul.last_visit_time  >= ul.first_login_time + INTERVAL '7 days' THEN 1
+    WHEN ul.first_login_time < NOW() - INTERVAL '7 days'
+     AND ul.last_visit_time  <  ul.first_login_time + INTERVAL '7 days' THEN 0
+    ELSE NULL
+  END AS is_active_after_7d,
+  COALESCE(nr.node_execution_incomplete, 0) AS node_execution_incomplete,
+  COALESCE(nr.node_execution_review, 0)     AS node_execution_review
+FROM user_logins ul
+LEFT JOIN user_agents     ua ON ul.user_id = ua.user_id
+LEFT JOIN user_graph_runs gr ON ul.user_id = gr.user_id
+LEFT JOIN user_node_runs  nr ON ul.user_id = nr.user_id
--- a/autogpt_platform/backend/.env.default
+++ b/autogpt_platform/backend/.env.default
@@ -37,6 +37,10 @@ JWT_VERIFY_KEY=your-super-secret-jwt-token-with-at-least-32-characters-long
 ENCRYPTION_KEY=dvziYgz0KSK8FENhju0ZYi8-fRTfAdlz6YLhdB_jhNw=
 UNSUBSCRIBE_SECRET_KEY=HlP8ivStJjmbf6NKi78m_3FnOogut0t5ckzjsIqeaio=

+## ===== SIGNUP / INVITE GATE ===== ##
+# Set to true to require an invite before users can sign up
+ENABLE_INVITE_GATE=false
+
 ## ===== IMPORTANT OPTIONAL CONFIGURATION ===== ##
 # Platform URLs (set these for webhooks and OAuth to work)
 PLATFORM_BASE_URL=http://localhost:8000
--- a/autogpt_platform/backend/CLAUDE.md
+++ b/autogpt_platform/backend/CLAUDE.md
@@ -58,10 +58,31 @@ poetry run pytest path/to/test.py --snapshot-update
 - **Authentication**: JWT-based with Supabase integration
 - **Security**: Cache protection middleware prevents sensitive data caching in browsers/proxies

+## Code Style
+
+- **Top-level imports only** — no local/inner imports (lazy imports only for heavy optional deps like `openpyxl`)
+- **No duck typing** — no `hasattr`/`getattr`/`isinstance` for type dispatch; use typed interfaces/unions/protocols
+- **Pydantic models** over dataclass/namedtuple/dict for structured data
+- **No linter suppressors** — no `# type: ignore`, `# noqa`, `# pyright: ignore`; fix the type/code
+- **List comprehensions** over manual loop-and-append
+- **Early return** — guard clauses first, avoid deep nesting
+- **Lazy `%s` logging** — `logger.info("Processing %s items", count)` not `logger.info(f"Processing {count} items")`
+- **Sanitize error paths** — `os.path.basename()` in error messages to avoid leaking directory structure
+- **TOCTOU awareness** — avoid check-then-act patterns for file access and credit charging
+- **`Security()` vs `Depends()`** — use `Security()` for auth deps to get proper OpenAPI security spec
+- **Redis pipelines** — `transaction=True` for atomicity on multi-step operations
+- **`max(0, value)` guards** — for computed values that should never be negative
+- **SSE protocol** — `data:` lines for frontend-parsed events (must match Zod schema), `: comment` lines for heartbeats/status
+- **File length** — keep files under ~300 lines; if a file grows beyond this, split by responsibility (e.g. extract helpers, models, or a sub-module into a new file). Never keep appending to a long file.
+- **Function length** — keep functions under ~40 lines; extract named helpers when a function grows longer. Long functions are a sign of mixed concerns, not complexity.
+
 ## Testing Approach

 - Uses pytest with snapshot testing for API responses
 - Test files are colocated with source files (`*_test.py`)
+- Mock at boundaries — mock where the symbol is **used**, not where it's **defined**
+- After refactoring, update mock targets to match new module paths
+- Use `AsyncMock` for async functions (`from unittest.mock import AsyncMock`)

 ## Database Schema

--- a/autogpt_platform/backend/backend/api/features/admin/model.py
+++ b/autogpt_platform/backend/backend/api/features/admin/model.py
@@ -1,8 +1,19 @@
-from pydantic import BaseModel
+from __future__ import annotations

+from datetime import datetime
+from typing import TYPE_CHECKING, Any, Literal, Optional
+
+import prisma.enums
+from pydantic import BaseModel, EmailStr
+
+from backend.copilot.session_types import ChatSessionStartType
 from backend.data.model import UserTransaction
 from backend.util.models import Pagination

+if TYPE_CHECKING:
+    from backend.data.invited_user import BulkInvitedUsersResult, InvitedUserRecord
+    from backend.data.model import User
+

 class UserHistoryResponse(BaseModel):
    """Response model for listings with version history"""
@@ -14,3 +25,118 @@ class UserHistoryResponse(BaseModel):
 class AddUserCreditsResponse(BaseModel):
    new_balance: int
    transaction_key: str
+
+
+class CreateInvitedUserRequest(BaseModel):
+    email: EmailStr
+    name: Optional[str] = None
+
+
+class InvitedUserResponse(BaseModel):
+    id: str
+    email: str
+    status: prisma.enums.InvitedUserStatus
+    auth_user_id: Optional[str] = None
+    name: Optional[str] = None
+    tally_understanding: Optional[dict[str, Any]] = None
+    tally_status: prisma.enums.TallyComputationStatus
+    tally_computed_at: Optional[datetime] = None
+    tally_error: Optional[str] = None
+    created_at: datetime
+    updated_at: datetime
+
+    @classmethod
+    def from_record(cls, record: InvitedUserRecord) -> InvitedUserResponse:
+        return cls.model_validate(record.model_dump())
+
+
+class InvitedUsersResponse(BaseModel):
+    invited_users: list[InvitedUserResponse]
+    pagination: Pagination
+
+
+class BulkInvitedUserRowResponse(BaseModel):
+    row_number: int
+    email: Optional[str] = None
+    name: Optional[str] = None
+    status: Literal["CREATED", "SKIPPED", "ERROR"]
+    message: str
+    invited_user: Optional[InvitedUserResponse] = None
+
+
+class BulkInvitedUsersResponse(BaseModel):
+    created_count: int
+    skipped_count: int
+    error_count: int
+    results: list[BulkInvitedUserRowResponse]
+
+    @classmethod
+    def from_result(cls, result: BulkInvitedUsersResult) -> BulkInvitedUsersResponse:
+        return cls(
+            created_count=result.created_count,
+            skipped_count=result.skipped_count,
+            error_count=result.error_count,
+            results=[
+                BulkInvitedUserRowResponse(
+                    row_number=row.row_number,
+                    email=row.email,
+                    name=row.name,
+                    status=row.status,
+                    message=row.message,
+                    invited_user=(
+                        InvitedUserResponse.from_record(row.invited_user)
+                        if row.invited_user is not None
+                        else None
+                    ),
+                )
+                for row in result.results
+            ],
+        )
+
+
+class AdminCopilotUserSummary(BaseModel):
+    id: str
+    email: str
+    name: Optional[str] = None
+    timezone: str
+    created_at: datetime
+    updated_at: datetime
+
+    @classmethod
+    def from_user(cls, user: "User") -> "AdminCopilotUserSummary":
+        return cls(
+            id=user.id,
+            email=user.email,
+            name=user.name,
+            timezone=user.timezone,
+            created_at=user.created_at,
+            updated_at=user.updated_at,
+        )
+
+
+class AdminCopilotUsersResponse(BaseModel):
+    users: list[AdminCopilotUserSummary]
+
+
+class TriggerCopilotSessionRequest(BaseModel):
+    user_id: str
+    start_type: ChatSessionStartType
+
+
+class TriggerCopilotSessionResponse(BaseModel):
+    session_id: str
+    start_type: ChatSessionStartType
+
+
+class SendCopilotEmailsRequest(BaseModel):
+    user_id: str
+
+
+class SendCopilotEmailsResponse(BaseModel):
+    candidate_count: int
+    processed_count: int
+    sent_count: int
+    skipped_count: int
+    repair_queued_count: int
+    running_count: int
+    failed_count: int
--- a/autogpt_platform/backend/backend/api/features/admin/user_admin_routes.py
+++ b/autogpt_platform/backend/backend/api/features/admin/user_admin_routes.py
@@ -0,0 +1,240 @@
+import logging
+import math
+
+from autogpt_libs.auth import get_user_id, requires_admin_user
+from fastapi import APIRouter, File, HTTPException, Query, Security, UploadFile
+
+from backend.copilot.autopilot import (
+    send_pending_copilot_emails_for_user,
+    trigger_autopilot_session_for_user,
+)
+from backend.data.invited_user import (
+    bulk_create_invited_users_from_file,
+    create_invited_user,
+    list_invited_users,
+    retry_invited_user_tally,
+    revoke_invited_user,
+)
+from backend.data.tally import mask_email
+from backend.data.user import search_users
+from backend.util.models import Pagination
+
+from .model import (
+    AdminCopilotUsersResponse,
+    AdminCopilotUserSummary,
+    BulkInvitedUsersResponse,
+    CreateInvitedUserRequest,
+    InvitedUserResponse,
+    InvitedUsersResponse,
+    SendCopilotEmailsRequest,
+    SendCopilotEmailsResponse,
+    TriggerCopilotSessionRequest,
+    TriggerCopilotSessionResponse,
+)
+
+logger = logging.getLogger(__name__)
+
+
+router = APIRouter(
+    prefix="/admin",
+    tags=["users", "admin"],
+    dependencies=[Security(requires_admin_user)],
+)
+
+
+@router.get(
+    "/invited-users",
+    response_model=InvitedUsersResponse,
+    summary="List Invited Users",
+)
+async def get_invited_users(
+    admin_user_id: str = Security(get_user_id),
+    page: int = Query(1, ge=1),
+    page_size: int = Query(50, ge=1, le=200),
+) -> InvitedUsersResponse:
+    logger.info("Admin user %s requested invited users", admin_user_id)
+    invited_users, total = await list_invited_users(page=page, page_size=page_size)
+    return InvitedUsersResponse(
+        invited_users=[InvitedUserResponse.from_record(iu) for iu in invited_users],
+        pagination=Pagination(
+            total_items=total,
+            total_pages=max(1, math.ceil(total / page_size)),
+            current_page=page,
+            page_size=page_size,
+        ),
+    )
+
+
+@router.post(
+    "/invited-users",
+    response_model=InvitedUserResponse,
+    summary="Create Invited User",
+)
+async def create_invited_user_route(
+    request: CreateInvitedUserRequest,
+    admin_user_id: str = Security(get_user_id),
+) -> InvitedUserResponse:
+    logger.info(
+        "Admin user %s creating invited user for %s",
+        admin_user_id,
+        mask_email(request.email),
+    )
+    invited_user = await create_invited_user(request.email, request.name)
+    logger.info(
+        "Admin user %s created invited user %s",
+        admin_user_id,
+        invited_user.id,
+    )
+    return InvitedUserResponse.from_record(invited_user)
+
+
+@router.post(
+    "/invited-users/bulk",
+    response_model=BulkInvitedUsersResponse,
+    summary="Bulk Create Invited Users",
+    operation_id="postV2BulkCreateInvitedUsers",
+)
+async def bulk_create_invited_users_route(
+    file: UploadFile = File(...),
+    admin_user_id: str = Security(get_user_id),
+) -> BulkInvitedUsersResponse:
+    logger.info(
+        "Admin user %s bulk invited users from %s",
+        admin_user_id,
+        file.filename or "<unnamed>",
+    )
+    content = await file.read()
+    result = await bulk_create_invited_users_from_file(file.filename, content)
+    return BulkInvitedUsersResponse.from_result(result)
+
+
+@router.post(
+    "/invited-users/{invited_user_id}/revoke",
+    response_model=InvitedUserResponse,
+    summary="Revoke Invited User",
+)
+async def revoke_invited_user_route(
+    invited_user_id: str,
+    admin_user_id: str = Security(get_user_id),
+) -> InvitedUserResponse:
+    logger.info(
+        "Admin user %s revoking invited user %s", admin_user_id, invited_user_id
+    )
+    invited_user = await revoke_invited_user(invited_user_id)
+    logger.info("Admin user %s revoked invited user %s", admin_user_id, invited_user_id)
+    return InvitedUserResponse.from_record(invited_user)
+
+
+@router.post(
+    "/invited-users/{invited_user_id}/retry-tally",
+    response_model=InvitedUserResponse,
+    summary="Retry Invited User Tally",
+)
+async def retry_invited_user_tally_route(
+    invited_user_id: str,
+    admin_user_id: str = Security(get_user_id),
+) -> InvitedUserResponse:
+    logger.info(
+        "Admin user %s retrying Tally seed for invited user %s",
+        admin_user_id,
+        invited_user_id,
+    )
+    invited_user = await retry_invited_user_tally(invited_user_id)
+    logger.info(
+        "Admin user %s retried Tally seed for invited user %s",
+        admin_user_id,
+        invited_user_id,
+    )
+    return InvitedUserResponse.from_record(invited_user)
+
+
+@router.get(
+    "/copilot/users",
+    response_model=AdminCopilotUsersResponse,
+    summary="Search Copilot Users",
+    operation_id="getV2SearchCopilotUsers",
+)
+async def search_copilot_users_route(
+    search: str = Query("", description="Search by email, name, or user ID"),
+    limit: int = Query(20, ge=1, le=50),
+    admin_user_id: str = Security(get_user_id),
+) -> AdminCopilotUsersResponse:
+    logger.info(
+        "Admin user %s searched Copilot users (query_length=%s, limit=%s)",
+        admin_user_id,
+        len(search.strip()),
+        limit,
+    )
+    users = await search_users(search, limit=limit)
+    return AdminCopilotUsersResponse(
+        users=[AdminCopilotUserSummary.from_user(user) for user in users]
+    )
+
+
+@router.post(
+    "/copilot/trigger",
+    response_model=TriggerCopilotSessionResponse,
+    summary="Trigger Copilot Session",
+    operation_id="postV2TriggerCopilotSession",
+)
+async def trigger_copilot_session_route(
+    request: TriggerCopilotSessionRequest,
+    admin_user_id: str = Security(get_user_id),
+) -> TriggerCopilotSessionResponse:
+    logger.info(
+        "Admin user %s manually triggered %s for user %s",
+        admin_user_id,
+        request.start_type,
+        request.user_id,
+    )
+    try:
+        session = await trigger_autopilot_session_for_user(
+            request.user_id,
+            start_type=request.start_type,
+        )
+    except LookupError as exc:
+        raise HTTPException(status_code=404, detail=str(exc)) from exc
+    except ValueError as exc:
+        raise HTTPException(status_code=400, detail=str(exc)) from exc
+
+    logger.info(
+        "Admin user %s created manual Copilot session %s for user %s",
+        admin_user_id,
+        session.session_id,
+        request.user_id,
+    )
+    return TriggerCopilotSessionResponse(
+        session_id=session.session_id,
+        start_type=request.start_type,
+    )
+
+
+@router.post(
+    "/copilot/send-emails",
+    response_model=SendCopilotEmailsResponse,
+    summary="Send Pending Copilot Emails",
+    operation_id="postV2SendPendingCopilotEmails",
+)
+async def send_pending_copilot_emails_route(
+    request: SendCopilotEmailsRequest,
+    admin_user_id: str = Security(get_user_id),
+) -> SendCopilotEmailsResponse:
+    logger.info(
+        "Admin user %s manually triggered pending Copilot emails for user %s",
+        admin_user_id,
+        request.user_id,
+    )
+    result = await send_pending_copilot_emails_for_user(request.user_id)
+    logger.info(
+        "Admin user %s completed pending Copilot email sweep for user %s "
+        "(candidates=%s, sent=%s, skipped=%s, repairs=%s, running=%s, failed=%s)",
+        admin_user_id,
+        request.user_id,
+        result.candidate_count,
+        result.sent_count,
+        result.skipped_count,
+        result.repair_queued_count,
+        result.running_count,
+        result.failed_count,
+    )
+    return SendCopilotEmailsResponse(**result.model_dump())
--- a/autogpt_platform/backend/backend/api/features/admin/user_admin_routes_test.py
+++ b/autogpt_platform/backend/backend/api/features/admin/user_admin_routes_test.py
@@ -0,0 +1,290 @@
+from datetime import datetime, timezone
+from unittest.mock import AsyncMock
+
+import fastapi
+import fastapi.testclient
+import prisma.enums
+import pytest
+import pytest_mock
+from autogpt_libs.auth.jwt_utils import get_jwt_payload
+
+from backend.copilot.autopilot_email import PendingCopilotEmailSweepResult
+from backend.copilot.model import ChatSession
+from backend.copilot.session_types import ChatSessionStartType
+from backend.data.invited_user import (
+    BulkInvitedUserRowResult,
+    BulkInvitedUsersResult,
+    InvitedUserRecord,
+)
+from backend.data.model import User
+
+from .user_admin_routes import router as user_admin_router
+
+app = fastapi.FastAPI()
+app.include_router(user_admin_router)
+
+client = fastapi.testclient.TestClient(app)
+
+
+@pytest.fixture(autouse=True)
+def setup_app_admin_auth(mock_jwt_admin):
+    app.dependency_overrides[get_jwt_payload] = mock_jwt_admin["get_jwt_payload"]
+    yield
+    app.dependency_overrides.clear()
+
+
+def _sample_invited_user() -> InvitedUserRecord:
+    now = datetime.now(timezone.utc)
+    return InvitedUserRecord(
+        id="invite-1",
+        email="invited@example.com",
+        status=prisma.enums.InvitedUserStatus.INVITED,
+        auth_user_id=None,
+        name="Invited User",
+        tally_understanding=None,
+        tally_status=prisma.enums.TallyComputationStatus.PENDING,
+        tally_computed_at=None,
+        tally_error=None,
+        created_at=now,
+        updated_at=now,
+    )
+
+
+def _sample_bulk_invited_users_result() -> BulkInvitedUsersResult:
+    return BulkInvitedUsersResult(
+        created_count=1,
+        skipped_count=1,
+        error_count=0,
+        results=[
+            BulkInvitedUserRowResult(
+                row_number=1,
+                email="invited@example.com",
+                name=None,
+                status="CREATED",
+                message="Invite created",
+                invited_user=_sample_invited_user(),
+            ),
+            BulkInvitedUserRowResult(
+                row_number=2,
+                email="duplicate@example.com",
+                name=None,
+                status="SKIPPED",
+                message="An invited user with this email already exists",
+                invited_user=None,
+            ),
+        ],
+    )
+
+
+def _sample_user() -> User:
+    now = datetime.now(timezone.utc)
+    return User(
+        id="user-1",
+        email="copilot@example.com",
+        name="Copilot User",
+        timezone="Europe/Madrid",
+        created_at=now,
+        updated_at=now,
+        stripe_customer_id=None,
+        top_up_config=None,
+    )
+
+
+def test_get_invited_users(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.list_invited_users",
+        AsyncMock(return_value=([_sample_invited_user()], 1)),
+    )
+
+    response = client.get("/admin/invited-users")
+
+    assert response.status_code == 200
+    data = response.json()
+    assert len(data["invited_users"]) == 1
+    assert data["invited_users"][0]["email"] == "invited@example.com"
+    assert data["invited_users"][0]["status"] == "INVITED"
+    assert data["pagination"]["total_items"] == 1
+    assert data["pagination"]["current_page"] == 1
+    assert data["pagination"]["page_size"] == 50
+
+
+def test_create_invited_user(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.create_invited_user",
+        AsyncMock(return_value=_sample_invited_user()),
+    )
+
+    response = client.post(
+        "/admin/invited-users",
+        json={"email": "invited@example.com", "name": "Invited User"},
+    )
+
+    assert response.status_code == 200
+    data = response.json()
+    assert data["email"] == "invited@example.com"
+    assert data["name"] == "Invited User"
+
+
+def test_bulk_create_invited_users(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.bulk_create_invited_users_from_file",
+        AsyncMock(return_value=_sample_bulk_invited_users_result()),
+    )
+
+    response = client.post(
+        "/admin/invited-users/bulk",
+        files={
+            "file": ("invites.txt", b"invited@example.com\nduplicate@example.com\n")
+        },
+    )
+
+    assert response.status_code == 200
+    data = response.json()
+    assert data["created_count"] == 1
+    assert data["skipped_count"] == 1
+    assert data["results"][0]["status"] == "CREATED"
+    assert data["results"][1]["status"] == "SKIPPED"
+
+
+def test_revoke_invited_user(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    revoked = _sample_invited_user().model_copy(
+        update={"status": prisma.enums.InvitedUserStatus.REVOKED}
+    )
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.revoke_invited_user",
+        AsyncMock(return_value=revoked),
+    )
+
+    response = client.post("/admin/invited-users/invite-1/revoke")
+
+    assert response.status_code == 200
+    assert response.json()["status"] == "REVOKED"
+
+
+def test_retry_invited_user_tally(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    retried = _sample_invited_user().model_copy(
+        update={"tally_status": prisma.enums.TallyComputationStatus.RUNNING}
+    )
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.retry_invited_user_tally",
+        AsyncMock(return_value=retried),
+    )
+
+    response = client.post("/admin/invited-users/invite-1/retry-tally")
+
+    assert response.status_code == 200
+    assert response.json()["tally_status"] == "RUNNING"
+
+
+def test_search_copilot_users(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.search_users",
+        AsyncMock(return_value=[_sample_user()]),
+    )
+
+    response = client.get("/admin/copilot/users", params={"search": "copilot"})
+
+    assert response.status_code == 200
+    data = response.json()
+    assert len(data["users"]) == 1
+    assert data["users"][0]["email"] == "copilot@example.com"
+    assert data["users"][0]["timezone"] == "Europe/Madrid"
+
+
+def test_trigger_copilot_session(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    session = ChatSession.new(
+        "user-1",
+        start_type=ChatSessionStartType.AUTOPILOT_CALLBACK,
+    )
+    trigger = mocker.patch(
+        "backend.api.features.admin.user_admin_routes.trigger_autopilot_session_for_user",
+        AsyncMock(return_value=session),
+    )
+
+    response = client.post(
+        "/admin/copilot/trigger",
+        json={
+            "user_id": "user-1",
+            "start_type": ChatSessionStartType.AUTOPILOT_CALLBACK.value,
+        },
+    )
+
+    assert response.status_code == 200
+    assert response.json()["session_id"] == session.session_id
+    assert response.json()["start_type"] == "AUTOPILOT_CALLBACK"
+    assert trigger.await_args is not None
+    assert trigger.await_args.args[0] == "user-1"
+    assert (
+        trigger.await_args.kwargs["start_type"]
+        == ChatSessionStartType.AUTOPILOT_CALLBACK
+    )
+
+
+def test_trigger_copilot_session_returns_not_found(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mocker.patch(
+        "backend.api.features.admin.user_admin_routes.trigger_autopilot_session_for_user",
+        AsyncMock(side_effect=LookupError("User not found with ID: missing-user")),
+    )
+
+    response = client.post(
+        "/admin/copilot/trigger",
+        json={
+            "user_id": "missing-user",
+            "start_type": ChatSessionStartType.AUTOPILOT_NIGHTLY.value,
+        },
+    )
+
+    assert response.status_code == 404
+    assert response.json()["detail"] == "User not found with ID: missing-user"
+
+
+def test_send_pending_copilot_emails(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    send_emails = mocker.patch(
+        "backend.api.features.admin.user_admin_routes.send_pending_copilot_emails_for_user",
+        AsyncMock(
+            return_value=PendingCopilotEmailSweepResult(
+                candidate_count=1,
+                processed_count=1,
+                sent_count=1,
+                skipped_count=0,
+                repair_queued_count=0,
+                running_count=0,
+                failed_count=0,
+            )
+        ),
+    )
+
+    response = client.post(
+        "/admin/copilot/send-emails",
+        json={"user_id": "user-1"},
+    )
+
+    assert response.status_code == 200
+    assert response.json() == {
+        "candidate_count": 1,
+        "processed_count": 1,
+        "sent_count": 1,
+        "skipped_count": 0,
+        "repair_queued_count": 0,
+        "running_count": 0,
+        "failed_count": 0,
+    }
+    send_emails.assert_awaited_once_with("user-1")
--- a/autogpt_platform/backend/backend/api/features/chat/routes.py
+++ b/autogpt_platform/backend/backend/api/features/chat/routes.py
@@ -3,18 +3,23 @@
 import asyncio
 import logging
 import re
+import time
 from collections.abc import AsyncGenerator
-from typing import Annotated
+from typing import Annotated, Any, NoReturn
 from uuid import uuid4

 from autogpt_libs import auth
 from fastapi import APIRouter, Depends, HTTPException, Query, Response, Security
 from fastapi.responses import StreamingResponse
-from prisma.models import UserWorkspaceFile
 from pydantic import BaseModel, Field, field_validator

 from backend.copilot import service as chat_service
 from backend.copilot import stream_registry
+from backend.copilot.autopilot import (
+    consume_callback_token,
+    strip_internal_content,
+    unwrap_internal_content,
+)
 from backend.copilot.config import ChatConfig
 from backend.copilot.executor.utils import enqueue_cancel_task, enqueue_copilot_turn
 from backend.copilot.model import (
@@ -27,7 +32,13 @@ from backend.copilot.model import (
    get_user_sessions,
    update_session_title,
 )
-from backend.copilot.response_model import StreamError, StreamFinish, StreamHeartbeat
+from backend.copilot.response_model import (
+    StreamBaseResponse,
+    StreamError,
+    StreamFinish,
+    StreamHeartbeat,
+)
+from backend.copilot.session_types import ChatSessionStartType
 from backend.copilot.tools.e2b_sandbox import kill_sandbox
 from backend.copilot.tools.models import (
    AgentDetailsResponse,
@@ -53,7 +64,9 @@ from backend.copilot.tools.models import (
    UnderstandingUpdatedResponse,
 )
 from backend.copilot.tracking import track_user_message
+from backend.data.db_accessors import workspace_db
 from backend.data.redis_client import get_redis_async
+from backend.data.understanding import get_business_understanding
 from backend.data.workspace import get_or_create_workspace
 from backend.util.exceptions import NotFoundError

@@ -64,6 +77,187 @@ _UUID_RE = re.compile(
 )

 logger = logging.getLogger(__name__)
+STREAM_QUEUE_GET_TIMEOUT_SECONDS = 10.0
+STREAMING_RESPONSE_HEADERS = {
+    "Cache-Control": "no-cache",
+    "Connection": "keep-alive",
+    "X-Accel-Buffering": "no",
+    "x-vercel-ai-ui-message-stream": "v1",
+}
+
+
+def _build_streaming_response(
+    generator: AsyncGenerator[str, None],
+) -> StreamingResponse:
+    return StreamingResponse(
+        generator,
+        media_type="text/event-stream",
+        headers=STREAMING_RESPONSE_HEADERS,
+    )
+
+
+async def _unsubscribe_stream_queue(
+    session_id: str,
+    subscriber_queue: asyncio.Queue[StreamBaseResponse] | None,
+) -> None:
+    if subscriber_queue is None:
+        return
+
+    try:
+        await stream_registry.unsubscribe_from_session(session_id, subscriber_queue)
+    except Exception as unsub_err:
+        logger.error(
+            f"Error unsubscribing from session {session_id}: {unsub_err}",
+            exc_info=True,
+        )
+
+
+async def _stream_subscriber_queue(
+    *,
+    session_id: str,
+    subscriber_queue: asyncio.Queue[StreamBaseResponse],
+    log_meta: dict[str, Any],
+    started_at: float,
+    label: str,
+    surface_errors: bool,
+) -> AsyncGenerator[str, None]:
+    chunk_count = 0
+    first_chunk_type: str | None = None
+
+    try:
+        while True:
+            try:
+                chunk = await asyncio.wait_for(
+                    subscriber_queue.get(),
+                    timeout=STREAM_QUEUE_GET_TIMEOUT_SECONDS,
+                )
+            except asyncio.TimeoutError:
+                yield StreamHeartbeat().to_sse()
+                continue
+
+            chunk_count += 1
+            if first_chunk_type is None:
+                first_chunk_type = type(chunk).__name__
+                elapsed = (time.perf_counter() - started_at) * 1000
+                logger.info(
+                    f"[TIMING] {label} first chunk at {elapsed:.1f}ms, type={first_chunk_type}",
+                    extra={
+                        "json_fields": {
+                            **log_meta,
+                            "chunk_type": first_chunk_type,
+                            "elapsed_ms": elapsed,
+                        }
+                    },
+                )
+
+            yield chunk.to_sse()
+
+            if isinstance(chunk, StreamFinish):
+                total_time = (time.perf_counter() - started_at) * 1000
+                logger.info(
+                    f"[TIMING] {label} received StreamFinish in {total_time:.1f}ms",
+                    extra={
+                        "json_fields": {
+                            **log_meta,
+                            "chunks_yielded": chunk_count,
+                            "total_time_ms": total_time,
+                        }
+                    },
+                )
+                break
+    except GeneratorExit:
+        logger.info(
+            f"[TIMING] {label} client disconnected after {chunk_count} chunks",
+            extra={
+                "json_fields": {
+                    **log_meta,
+                    "chunks_yielded": chunk_count,
+                    "reason": "client_disconnect",
+                }
+            },
+        )
+    except Exception as exc:
+        elapsed = (time.perf_counter() - started_at) * 1000
+        logger.error(
+            f"[TIMING] {label} error after {elapsed:.1f}ms: {exc}",
+            extra={
+                "json_fields": {**log_meta, "elapsed_ms": elapsed, "error": str(exc)}
+            },
+        )
+        if surface_errors:
+            yield StreamError(
+                errorText="An error occurred. Please try again.",
+                code="stream_error",
+            ).to_sse()
+            yield StreamFinish().to_sse()
+    finally:
+        total_time = (time.perf_counter() - started_at) * 1000
+        logger.info(
+            f"[TIMING] {label} finished in {total_time:.1f}ms",
+            extra={
+                "json_fields": {
+                    **log_meta,
+                    "total_time_ms": total_time,
+                    "chunks_yielded": chunk_count,
+                    "first_chunk_type": first_chunk_type,
+                }
+            },
+        )
+        yield "data: [DONE]\n\n"
+
+
+async def _stream_chat_events(
+    *,
+    session_id: str,
+    user_id: str | None,
+    subscribe_from_id: str,
+    turn_id: str,
+    log_meta: dict[str, Any],
+) -> AsyncGenerator[str, None]:
+    started_at = time.perf_counter()
+    subscriber_queue = await stream_registry.subscribe_to_session(
+        session_id=session_id,
+        user_id=user_id,
+        last_message_id=subscribe_from_id,
+    )
+
+    try:
+        if subscriber_queue is None:
+            yield StreamFinish().to_sse()
+            yield "data: [DONE]\n\n"
+            return
+
+        async for chunk in _stream_subscriber_queue(
+            session_id=session_id,
+            subscriber_queue=subscriber_queue,
+            log_meta=log_meta,
+            started_at=started_at,
+            label=f"stream_chat_post[{turn_id}]",
+            surface_errors=True,
+        ):
+            yield chunk
+    finally:
+        await _unsubscribe_stream_queue(session_id, subscriber_queue)
+
+
+async def _resume_stream_events(
+    *,
+    session_id: str,
+    subscriber_queue: asyncio.Queue[StreamBaseResponse],
+) -> AsyncGenerator[str, None]:
+    started_at = time.perf_counter()
+    try:
+        async for chunk in _stream_subscriber_queue(
+            session_id=session_id,
+            subscriber_queue=subscriber_queue,
+            log_meta={"session_id": session_id},
+            started_at=started_at,
+            label=f"resume_stream[{session_id}]",
+            surface_errors=False,
+        ):
+            yield chunk
+    finally:
+        await _unsubscribe_stream_queue(session_id, subscriber_queue)


 async def _validate_and_get_session(
@@ -117,6 +311,8 @@ class SessionDetailResponse(BaseModel):
    created_at: str
    updated_at: str
    user_id: str | None
+    start_type: ChatSessionStartType
+    execution_tag: str | None = None
    messages: list[dict]
    active_stream: ActiveStreamInfo | None = None  # Present if stream is still active

@@ -128,6 +324,8 @@ class SessionSummaryResponse(BaseModel):
    created_at: str
    updated_at: str
    title: str | None = None
+    start_type: ChatSessionStartType
+    execution_tag: str | None = None
    is_processing: bool


@@ -159,6 +357,14 @@ class UpdateSessionTitleRequest(BaseModel):
        return stripped


+class ConsumeCallbackTokenRequest(BaseModel):
+    token: str
+
+
+class ConsumeCallbackTokenResponse(BaseModel):
+    session_id: str
+
+
 # ========== Routes ==========


@@ -170,6 +376,7 @@ async def list_sessions(
    user_id: Annotated[str, Security(auth.get_user_id)],
    limit: int = Query(default=50, ge=1, le=100),
    offset: int = Query(default=0, ge=0),
+    with_auto: bool = Query(default=False),
 ) -> ListSessionsResponse:
    """
    List chat sessions for the authenticated user.
@@ -185,7 +392,12 @@ async def list_sessions(
    Returns:
        ListSessionsResponse: List of session summaries and total count.
    """
-    sessions, total_count = await get_user_sessions(user_id, limit, offset)
+    sessions, total_count = await get_user_sessions(
+        user_id,
+        limit,
+        offset,
+        with_auto=with_auto,
+    )

    # Batch-check Redis for active stream status on each session
    processing_set: set[str] = set()
@@ -216,6 +428,8 @@ async def list_sessions(
                created_at=session.started_at.isoformat(),
                updated_at=session.updated_at.isoformat(),
                title=session.title,
+                start_type=session.start_type,
+                execution_tag=session.execution_tag,
                is_processing=session.session_id in processing_set,
            )
            for session in sessions
@@ -367,12 +581,26 @@ async def get_session(
    if not session:
        raise NotFoundError(f"Session {session_id} not found.")

-    messages = [message.model_dump() for message in session.messages]
+    messages = []
+    for message in session.messages:
+        payload = message.model_dump()
+        if message.role == "user":
+            visible_content = strip_internal_content(message.content)
+            if (
+                visible_content is None
+                and session.start_type != ChatSessionStartType.MANUAL
+            ):
+                visible_content = unwrap_internal_content(message.content)
+            if visible_content is None:
+                continue
+            payload["content"] = visible_content
+        messages.append(payload)

    # Check if there's an active stream for this session
    active_stream_info = None
    active_session, last_message_id = await stream_registry.get_active_session(
-        session_id, user_id
+        session_id,
+        user_id,
    )
    logger.info(
        f"[GET_SESSION] session={session_id}, active_session={active_session is not None}, "
@@ -393,11 +621,28 @@ async def get_session(
        created_at=session.started_at.isoformat(),
        updated_at=session.updated_at.isoformat(),
        user_id=session.user_id or None,
+        start_type=session.start_type,
+        execution_tag=session.execution_tag,
        messages=messages,
        active_stream=active_stream_info,
    )


+@router.post(
+    "/sessions/callback-token/consume",
+    dependencies=[Security(auth.requires_user)],
+)
+async def consume_callback_token_route(
+    request: ConsumeCallbackTokenRequest,
+    user_id: Annotated[str, Security(auth.get_user_id)],
+) -> ConsumeCallbackTokenResponse:
+    try:
+        result = await consume_callback_token(request.token, user_id)
+    except ValueError as exc:
+        raise HTTPException(status_code=404, detail=str(exc)) from exc
+    return ConsumeCallbackTokenResponse(session_id=result.session_id)
+
+
@router.post(
    "/sessions/{session_id}/cancel",
    status_code=200,
@@ -471,9 +716,6 @@ async def stream_chat_post(
        StreamingResponse: SSE-formatted response chunks.

    """
-    import asyncio
-    import time
-
    stream_start_time = time.perf_counter()
    log_meta = {"component": "ChatStream", "session_id": session_id}
    if user_id:
@@ -505,18 +747,14 @@ async def stream_chat_post(

        if valid_ids:
            workspace = await get_or_create_workspace(user_id)
-            # Batch query instead of N+1
-            files = await UserWorkspaceFile.prisma().find_many(
-                where={
-                    "id": {"in": valid_ids},
-                    "workspaceId": workspace.id,
-                    "isDeleted": False,
-                }
+            files = await workspace_db().get_workspace_files_by_ids(
+                workspace_id=workspace.id,
+                file_ids=valid_ids,
            )
            # Only keep IDs that actually exist in the user's workspace
            sanitized_file_ids = [wf.id for wf in files] or None
            file_lines: list[str] = [
-                f"- {wf.name} ({wf.mimeType}, {round(wf.sizeBytes / 1024, 1)} KB), file_id={wf.id}"
+                f"- {wf.name} ({wf.mime_type}, {round(wf.size_bytes / 1024, 1)} KB), file_id={wf.id}"
                for wf in files
            ]
            if file_lines:
@@ -586,141 +824,14 @@ async def stream_chat_post(
        f"[TIMING] Task enqueued to RabbitMQ, setup={setup_time:.1f}ms",
        extra={"json_fields": {**log_meta, "setup_time_ms": setup_time}},
    )
-
-    # SSE endpoint that subscribes to the task's stream
-    async def event_generator() -> AsyncGenerator[str, None]:
-        import time as time_module
-
-        event_gen_start = time_module.perf_counter()
-        logger.info(
-            f"[TIMING] event_generator STARTED, turn={turn_id}, session={session_id}, "
-            f"user={user_id}",
-            extra={"json_fields": log_meta},
+    return _build_streaming_response(
+        _stream_chat_events(
+            session_id=session_id,
+            user_id=user_id,
+            subscribe_from_id=subscribe_from_id,
+            turn_id=turn_id,
+            log_meta=log_meta,
        )
-        subscriber_queue = None
-        first_chunk_yielded = False
-        chunks_yielded = 0
-        try:
-            # Subscribe from the position we captured before enqueuing
-            # This avoids replaying old messages while catching all new ones
-            subscriber_queue = await stream_registry.subscribe_to_session(
-                session_id=session_id,
-                user_id=user_id,
-                last_message_id=subscribe_from_id,
-            )
-
-            if subscriber_queue is None:
-                yield StreamFinish().to_sse()
-                yield "data: [DONE]\n\n"
-                return
-
-            # Read from the subscriber queue and yield to SSE
-            logger.info(
-                "[TIMING] Starting to read from subscriber_queue",
-                extra={"json_fields": log_meta},
-            )
-            while True:
-                try:
-                    chunk = await asyncio.wait_for(subscriber_queue.get(), timeout=10.0)
-                    chunks_yielded += 1
-
-                    if not first_chunk_yielded:
-                        first_chunk_yielded = True
-                        elapsed = time_module.perf_counter() - event_gen_start
-                        logger.info(
-                            f"[TIMING] FIRST CHUNK from queue at {elapsed:.2f}s, "
-                            f"type={type(chunk).__name__}",
-                            extra={
-                                "json_fields": {
-                                    **log_meta,
-                                    "chunk_type": type(chunk).__name__,
-                                    "elapsed_ms": elapsed * 1000,
-                                }
-                            },
-                        )
-
-                    yield chunk.to_sse()
-
-                    # Check for finish signal
-                    if isinstance(chunk, StreamFinish):
-                        total_time = time_module.perf_counter() - event_gen_start
-                        logger.info(
-                            f"[TIMING] StreamFinish received in {total_time:.2f}s; "
-                            f"n_chunks={chunks_yielded}",
-                            extra={
-                                "json_fields": {
-                                    **log_meta,
-                                    "chunks_yielded": chunks_yielded,
-                                    "total_time_ms": total_time * 1000,
-                                }
-                            },
-                        )
-                        break
-                except asyncio.TimeoutError:
-                    yield StreamHeartbeat().to_sse()
-
-        except GeneratorExit:
-            logger.info(
-                f"[TIMING] GeneratorExit (client disconnected), chunks={chunks_yielded}",
-                extra={
-                    "json_fields": {
-                        **log_meta,
-                        "chunks_yielded": chunks_yielded,
-                        "reason": "client_disconnect",
-                    }
-                },
-            )
-            pass  # Client disconnected - background task continues
-        except Exception as e:
-            elapsed = (time_module.perf_counter() - event_gen_start) * 1000
-            logger.error(
-                f"[TIMING] event_generator ERROR after {elapsed:.1f}ms: {e}",
-                extra={
-                    "json_fields": {**log_meta, "elapsed_ms": elapsed, "error": str(e)}
-                },
-            )
-            # Surface error to frontend so it doesn't appear stuck
-            yield StreamError(
-                errorText="An error occurred. Please try again.",
-                code="stream_error",
-            ).to_sse()
-            yield StreamFinish().to_sse()
-        finally:
-            # Unsubscribe when client disconnects or stream ends
-            if subscriber_queue is not None:
-                try:
-                    await stream_registry.unsubscribe_from_session(
-                        session_id, subscriber_queue
-                    )
-                except Exception as unsub_err:
-                    logger.error(
-                        f"Error unsubscribing from session {session_id}: {unsub_err}",
-                        exc_info=True,
-                    )
-            # AI SDK protocol termination - always yield even if unsubscribe fails
-            total_time = time_module.perf_counter() - event_gen_start
-            logger.info(
-                f"[TIMING] event_generator FINISHED in {total_time:.2f}s; "
-                f"turn={turn_id}, session={session_id}, n_chunks={chunks_yielded}",
-                extra={
-                    "json_fields": {
-                        **log_meta,
-                        "total_time_ms": total_time * 1000,
-                        "chunks_yielded": chunks_yielded,
-                    }
-                },
-            )
-            yield "data: [DONE]\n\n"
-
-    return StreamingResponse(
-        event_generator(),
-        media_type="text/event-stream",
-        headers={
-            "Cache-Control": "no-cache",
-            "Connection": "keep-alive",
-            "X-Accel-Buffering": "no",  # Disable nginx buffering
-            "x-vercel-ai-ui-message-stream": "v1",  # AI SDK protocol header
-        },
    )


@@ -746,11 +857,7 @@ async def resume_session_stream(
        StreamingResponse (SSE) when an active stream exists,
        or 204 No Content when there is nothing to resume.
    """
-    import asyncio
-
-    active_session, last_message_id = await stream_registry.get_active_session(
-        session_id, user_id
-    )
+    active_session, _ = await stream_registry.get_active_session(session_id, user_id)

    if not active_session:
        return Response(status_code=204)
@@ -767,64 +874,11 @@ async def resume_session_stream(

    if subscriber_queue is None:
        return Response(status_code=204)
-
-    async def event_generator() -> AsyncGenerator[str, None]:
-        chunk_count = 0
-        first_chunk_type: str | None = None
-        try:
-            while True:
-                try:
-                    chunk = await asyncio.wait_for(subscriber_queue.get(), timeout=10.0)
-                    if chunk_count < 3:
-                        logger.info(
-                            "Resume stream chunk",
-                            extra={
-                                "session_id": session_id,
-                                "chunk_type": str(chunk.type),
-                            },
-                        )
-                    if not first_chunk_type:
-                        first_chunk_type = str(chunk.type)
-                    chunk_count += 1
-                    yield chunk.to_sse()
-
-                    if isinstance(chunk, StreamFinish):
-                        break
-                except asyncio.TimeoutError:
-                    yield StreamHeartbeat().to_sse()
-        except GeneratorExit:
-            pass
-        except Exception as e:
-            logger.error(f"Error in resume stream for session {session_id}: {e}")
-        finally:
-            try:
-                await stream_registry.unsubscribe_from_session(
-                    session_id, subscriber_queue
-                )
-            except Exception as unsub_err:
-                logger.error(
-                    f"Error unsubscribing from session {active_session.session_id}: {unsub_err}",
-                    exc_info=True,
-                )
-            logger.info(
-                "Resume stream completed",
-                extra={
-                    "session_id": session_id,
-                    "n_chunks": chunk_count,
-                    "first_chunk_type": first_chunk_type,
-                },
-            )
-            yield "data: [DONE]\n\n"
-
-    return StreamingResponse(
-        event_generator(),
-        media_type="text/event-stream",
-        headers={
-            "Cache-Control": "no-cache",
-            "Connection": "keep-alive",
-            "X-Accel-Buffering": "no",
-            "x-vercel-ai-ui-message-stream": "v1",
-        },
+    return _build_streaming_response(
+        _resume_stream_events(
+            session_id=session_id,
+            subscriber_queue=subscriber_queue,
+        )
    )


@@ -853,6 +907,36 @@ async def session_assign_user(
    return {"status": "ok"}


+# ========== Suggested Prompts ==========
+
+
+class SuggestedPromptsResponse(BaseModel):
+    """Response model for user-specific suggested prompts."""
+
+    prompts: list[str]
+
+
+@router.get(
+    "/suggested-prompts",
+    dependencies=[Security(auth.requires_user)],
+)
+async def get_suggested_prompts(
+    user_id: Annotated[str, Security(auth.get_user_id)],
+) -> SuggestedPromptsResponse:
+    """
+    Get LLM-generated suggested prompts for the authenticated user.
+
+    Returns personalized quick-action prompts based on the user's
+    business understanding. Returns an empty list if no custom prompts
+    are available.
+    """
+    understanding = await get_business_understanding(user_id)
+    if understanding is None:
+        return SuggestedPromptsResponse(prompts=[])
+
+    return SuggestedPromptsResponse(prompts=understanding.suggested_prompts)
+
+
 # ========== Configuration ==========


@@ -946,6 +1030,6 @@ ToolResponseUnion = (
    description="This endpoint is not meant to be called. It exists solely to "
    "expose tool response models in the OpenAPI schema for frontend codegen.",
 )
-async def _tool_response_schema() -> ToolResponseUnion:  # type: ignore[return]
+async def _tool_response_schema() -> NoReturn:
    """Never called at runtime. Exists only so Orval generates TS types."""
    raise HTTPException(status_code=501, detail="Schema-only endpoint")
--- a/autogpt_platform/backend/backend/api/features/chat/routes_test.py
+++ b/autogpt_platform/backend/backend/api/features/chat/routes_test.py
@@ -1,6 +1,9 @@
-"""Tests for chat API routes: session title update and file attachment validation."""
+"""Tests for chat API routes: session title update, file attachment validation, and suggested prompts."""

-from unittest.mock import AsyncMock
+import asyncio
+from datetime import datetime, timezone
+from types import SimpleNamespace
+from unittest.mock import AsyncMock, MagicMock

 import fastapi
 import fastapi.testclient
@@ -8,6 +11,9 @@ import pytest
 import pytest_mock

 from backend.api.features.chat import routes as chat_routes
+from backend.copilot.model import ChatMessage, ChatSession
+from backend.copilot.response_model import StreamFinish
+from backend.copilot.session_types import ChatSessionStartType

 app = fastapi.FastAPI()
 app.include_router(chat_routes.router)
@@ -115,6 +121,238 @@ def test_update_title_not_found(
    assert response.status_code == 404


+def test_list_sessions_defaults_to_manual_only(
+    mocker: pytest_mock.MockerFixture,
+    test_user_id: str,
+) -> None:
+    started_at = datetime.now(timezone.utc)
+    mock_get_user_sessions = mocker.patch(
+        "backend.api.features.chat.routes.get_user_sessions",
+        new_callable=AsyncMock,
+        return_value=(
+            [
+                SimpleNamespace(
+                    session_id="sess-1",
+                    started_at=started_at,
+                    updated_at=started_at,
+                    title="Nightly check-in",
+                    start_type=chat_routes.ChatSessionStartType.AUTOPILOT_NIGHTLY,
+                    execution_tag="autopilot-nightly:2026-03-13",
+                )
+            ],
+            1,
+        ),
+    )
+
+    pipe = MagicMock()
+    pipe.hget = MagicMock()
+    pipe.execute = AsyncMock(return_value=["running"])
+    redis = MagicMock()
+    redis.pipeline = MagicMock(return_value=pipe)
+    mocker.patch(
+        "backend.api.features.chat.routes.get_redis_async",
+        new_callable=AsyncMock,
+        return_value=redis,
+    )
+
+    response = client.get("/sessions")
+
+    assert response.status_code == 200
+    assert response.json() == {
+        "sessions": [
+            {
+                "id": "sess-1",
+                "created_at": started_at.isoformat(),
+                "updated_at": started_at.isoformat(),
+                "title": "Nightly check-in",
+                "start_type": "AUTOPILOT_NIGHTLY",
+                "execution_tag": "autopilot-nightly:2026-03-13",
+                "is_processing": True,
+            }
+        ],
+        "total": 1,
+    }
+    mock_get_user_sessions.assert_awaited_once_with(
+        test_user_id,
+        50,
+        0,
+        with_auto=False,
+    )
+
+
+def test_list_sessions_can_include_auto_sessions(
+    mocker: pytest_mock.MockerFixture,
+    test_user_id: str,
+) -> None:
+    mock_get_user_sessions = mocker.patch(
+        "backend.api.features.chat.routes.get_user_sessions",
+        new_callable=AsyncMock,
+        return_value=([], 0),
+    )
+
+    response = client.get("/sessions?with_auto=true")
+
+    assert response.status_code == 200
+    assert response.json() == {"sessions": [], "total": 0}
+    mock_get_user_sessions.assert_awaited_once_with(
+        test_user_id,
+        50,
+        0,
+        with_auto=True,
+    )
+
+
+def test_consume_callback_token_route_returns_session_id(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mock_consume = mocker.patch(
+        "backend.api.features.chat.routes.consume_callback_token",
+        new_callable=AsyncMock,
+        return_value=SimpleNamespace(session_id="sess-2"),
+    )
+
+    response = client.post(
+        "/sessions/callback-token/consume",
+        json={"token": "token-123"},
+    )
+
+    assert response.status_code == 200
+    assert response.json() == {"session_id": "sess-2"}
+    mock_consume.assert_awaited_once_with("token-123", TEST_USER_ID)
+
+
+def test_consume_callback_token_route_returns_404_on_invalid_token(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    mocker.patch(
+        "backend.api.features.chat.routes.consume_callback_token",
+        new_callable=AsyncMock,
+        side_effect=ValueError("Callback token not found"),
+    )
+
+    response = client.post(
+        "/sessions/callback-token/consume",
+        json={"token": "token-123"},
+    )
+
+    assert response.status_code == 404
+    assert response.json() == {"detail": "Callback token not found"}
+
+
+def test_get_session_hides_internal_only_messages_for_manual_sessions(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    session = ChatSession.new(
+        TEST_USER_ID,
+        start_type=ChatSessionStartType.MANUAL,
+    )
+    session.messages = [
+        ChatMessage(role="user", content="<internal>hidden</internal>"),
+        ChatMessage(
+            role="user",
+            content="Visible<internal>hidden</internal> text",
+        ),
+        ChatMessage(role="assistant", content="Public response"),
+    ]
+
+    mocker.patch(
+        "backend.api.features.chat.routes.get_chat_session",
+        new_callable=AsyncMock,
+        return_value=session,
+    )
+    mocker.patch(
+        "backend.api.features.chat.routes.stream_registry.get_active_session",
+        new_callable=AsyncMock,
+        return_value=(None, None),
+    )
+
+    response = client.get(f"/sessions/{session.session_id}")
+
+    assert response.status_code == 200
+    assert response.json()["messages"] == [
+        {
+            "role": "user",
+            "content": "Visible text",
+            "name": None,
+            "tool_call_id": None,
+            "refusal": None,
+            "tool_calls": None,
+            "function_call": None,
+        },
+        {
+            "role": "assistant",
+            "content": "Public response",
+            "name": None,
+            "tool_call_id": None,
+            "refusal": None,
+            "tool_calls": None,
+            "function_call": None,
+        },
+    ]
+
+
+def test_get_session_shows_cleaned_internal_kickoff_for_autopilot_sessions(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    session = ChatSession.new(
+        TEST_USER_ID,
+        start_type=ChatSessionStartType.AUTOPILOT_NIGHTLY,
+        execution_tag="autopilot-nightly:2026-03-13",
+    )
+    session.messages = [
+        ChatMessage(role="user", content="<internal>hidden</internal>"),
+        ChatMessage(
+            role="user",
+            content="Visible<internal>hidden</internal> text",
+        ),
+        ChatMessage(role="assistant", content="Public response"),
+    ]
+
+    mocker.patch(
+        "backend.api.features.chat.routes.get_chat_session",
+        new_callable=AsyncMock,
+        return_value=session,
+    )
+    mocker.patch(
+        "backend.api.features.chat.routes.stream_registry.get_active_session",
+        new_callable=AsyncMock,
+        return_value=(None, None),
+    )
+
+    response = client.get(f"/sessions/{session.session_id}")
+
+    assert response.status_code == 200
+    assert response.json()["messages"] == [
+        {
+            "role": "user",
+            "content": "hidden",
+            "name": None,
+            "tool_call_id": None,
+            "refusal": None,
+            "tool_calls": None,
+            "function_call": None,
+        },
+        {
+            "role": "user",
+            "content": "Visible text",
+            "name": None,
+            "tool_call_id": None,
+            "refusal": None,
+            "tool_calls": None,
+            "function_call": None,
+        },
+        {
+            "role": "assistant",
+            "content": "Public response",
+            "name": None,
+            "tool_call_id": None,
+            "refusal": None,
+            "tool_calls": None,
+            "function_call": None,
+        },
+    ]
+
+
 # ─── file_ids Pydantic validation ─────────────────────────────────────


@@ -142,7 +380,11 @@ def _mock_stream_internals(mocker: pytest_mock.MockFixture):
        return_value=None,
    )
    mock_registry = mocker.MagicMock()
+    subscriber_queue = asyncio.Queue()
+    subscriber_queue.put_nowait(StreamFinish())
    mock_registry.create_session = mocker.AsyncMock(return_value=None)
+    mock_registry.subscribe_to_session = mocker.AsyncMock(return_value=subscriber_queue)
+    mock_registry.unsubscribe_from_session = mocker.AsyncMock(return_value=None)
    mocker.patch(
        "backend.api.features.chat.routes.stream_registry",
        mock_registry,
@@ -165,11 +407,11 @@ def test_stream_chat_accepts_20_file_ids(mocker: pytest_mock.MockFixture):
        "backend.api.features.chat.routes.get_or_create_workspace",
        return_value=type("W", (), {"id": "ws-1"})(),
    )
-    mock_prisma = mocker.MagicMock()
-    mock_prisma.find_many = mocker.AsyncMock(return_value=[])
+    workspace_store = mocker.MagicMock()
+    workspace_store.get_workspace_files_by_ids = mocker.AsyncMock(return_value=[])
    mocker.patch(
-        "prisma.models.UserWorkspaceFile.prisma",
-        return_value=mock_prisma,
+        "backend.api.features.chat.routes.workspace_db",
+        return_value=workspace_store,
    )

    response = client.post(
@@ -195,11 +437,11 @@ def test_file_ids_filters_invalid_uuids(mocker: pytest_mock.MockFixture):
        return_value=type("W", (), {"id": "ws-1"})(),
    )

-    mock_prisma = mocker.MagicMock()
-    mock_prisma.find_many = mocker.AsyncMock(return_value=[])
+    workspace_store = mocker.MagicMock()
+    workspace_store.get_workspace_files_by_ids = mocker.AsyncMock(return_value=[])
    mocker.patch(
-        "prisma.models.UserWorkspaceFile.prisma",
-        return_value=mock_prisma,
+        "backend.api.features.chat.routes.workspace_db",
+        return_value=workspace_store,
    )

    valid_id = "aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee"
@@ -217,9 +459,10 @@ def test_file_ids_filters_invalid_uuids(mocker: pytest_mock.MockFixture):
    )

    # The find_many call should only receive the one valid UUID
-    mock_prisma.find_many.assert_called_once()
-    call_kwargs = mock_prisma.find_many.call_args[1]
-    assert call_kwargs["where"]["id"]["in"] == [valid_id]
+    workspace_store.get_workspace_files_by_ids.assert_called_once_with(
+        workspace_id="ws-1",
+        file_ids=[valid_id],
+    )


 # ─── Cross-workspace file_ids ─────────────────────────────────────────
@@ -233,11 +476,11 @@ def test_file_ids_scoped_to_workspace(mocker: pytest_mock.MockFixture):
        return_value=type("W", (), {"id": "my-workspace-id"})(),
    )

-    mock_prisma = mocker.MagicMock()
-    mock_prisma.find_many = mocker.AsyncMock(return_value=[])
+    workspace_store = mocker.MagicMock()
+    workspace_store.get_workspace_files_by_ids = mocker.AsyncMock(return_value=[])
    mocker.patch(
-        "prisma.models.UserWorkspaceFile.prisma",
-        return_value=mock_prisma,
+        "backend.api.features.chat.routes.workspace_db",
+        return_value=workspace_store,
    )

    fid = "aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee"
@@ -246,6 +489,66 @@ def test_file_ids_scoped_to_workspace(mocker: pytest_mock.MockFixture):
        json={"message": "hi", "file_ids": [fid]},
    )

-    call_kwargs = mock_prisma.find_many.call_args[1]
-    assert call_kwargs["where"]["workspaceId"] == "my-workspace-id"
-    assert call_kwargs["where"]["isDeleted"] is False
+    workspace_store.get_workspace_files_by_ids.assert_called_once_with(
+        workspace_id="my-workspace-id",
+        file_ids=[fid],
+    )
+
+
+# ─── Suggested prompts endpoint ──────────────────────────────────────
+
+
+def _mock_get_business_understanding(
+    mocker: pytest_mock.MockerFixture,
+    *,
+    return_value=None,
+):
+    """Mock get_business_understanding."""
+    return mocker.patch(
+        "backend.api.features.chat.routes.get_business_understanding",
+        new_callable=AsyncMock,
+        return_value=return_value,
+    )
+
+
+def test_suggested_prompts_returns_prompts(
+    mocker: pytest_mock.MockerFixture,
+    test_user_id: str,
+) -> None:
+    """User with understanding and prompts gets them back."""
+    mock_understanding = MagicMock()
+    mock_understanding.suggested_prompts = ["Do X", "Do Y", "Do Z"]
+    _mock_get_business_understanding(mocker, return_value=mock_understanding)
+
+    response = client.get("/suggested-prompts")
+
+    assert response.status_code == 200
+    assert response.json() == {"prompts": ["Do X", "Do Y", "Do Z"]}
+
+
+def test_suggested_prompts_no_understanding(
+    mocker: pytest_mock.MockerFixture,
+    test_user_id: str,
+) -> None:
+    """User with no understanding gets empty list."""
+    _mock_get_business_understanding(mocker, return_value=None)
+
+    response = client.get("/suggested-prompts")
+
+    assert response.status_code == 200
+    assert response.json() == {"prompts": []}
+
+
+def test_suggested_prompts_empty_prompts(
+    mocker: pytest_mock.MockerFixture,
+    test_user_id: str,
+) -> None:
+    """User with understanding but no prompts gets empty list."""
+    mock_understanding = MagicMock()
+    mock_understanding.suggested_prompts = []
+    _mock_get_business_understanding(mocker, return_value=mock_understanding)
+
+    response = client.get("/suggested-prompts")
+
+    assert response.status_code == 200
+    assert response.json() == {"prompts": []}
--- a/autogpt_platform/backend/backend/api/features/v1.py
+++ b/autogpt_platform/backend/backend/api/features/v1.py
@@ -55,6 +55,7 @@ from backend.data.credit import (
    set_auto_top_up,
 )
 from backend.data.graph import GraphSettings
+from backend.data.invited_user import get_or_activate_user
 from backend.data.model import CredentialsMetaInput, UserOnboarding
 from backend.data.notifications import NotificationPreference, NotificationPreferenceDTO
 from backend.data.onboarding import (
@@ -70,7 +71,6 @@ from backend.data.onboarding import (
    update_user_onboarding,
 )
 from backend.data.user import (
-    get_or_create_user,
    get_user_by_id,
    get_user_notification_preference,
    update_user_email,
@@ -136,12 +136,10 @@ _tally_background_tasks: set[asyncio.Task] = set()
    dependencies=[Security(requires_user)],
 )
 async def get_or_create_user_route(user_data: dict = Security(get_jwt_payload)):
-    user = await get_or_create_user(user_data)
+    user = await get_or_activate_user(user_data)

-    # Fire-and-forget: populate business understanding from Tally form.
-    # We use created_at proximity instead of an is_new flag because
-    # get_or_create_user is cached — a separate is_new return value would be
-    # unreliable on repeated calls within the cache TTL.
+    # Fire-and-forget: backfill Tally understanding when invite pre-seeding did
+    # not produce a stored result before first activation.
    age_seconds = (datetime.now(timezone.utc) - user.created_at).total_seconds()
    if age_seconds < 30:
        try:
@@ -165,7 +163,8 @@ async def get_or_create_user_route(user_data: dict = Security(get_jwt_payload)):
    dependencies=[Security(requires_user)],
 )
 async def update_user_email_route(
-    user_id: Annotated[str, Security(get_user_id)], email: str = Body(...)
+    user_id: Annotated[str, Security(get_user_id)],
+    email: str = Body(...),
 ) -> dict[str, str]:
    await update_user_email(user_id, email)

@@ -179,10 +178,16 @@ async def update_user_email_route(
    dependencies=[Security(requires_user)],
 )
 async def get_user_timezone_route(
-    user_data: dict = Security(get_jwt_payload),
+    user_id: Annotated[str, Security(get_user_id)],
 ) -> TimezoneResponse:
    """Get user timezone setting."""
-    user = await get_or_create_user(user_data)
+    try:
+        user = await get_user_by_id(user_id)
+    except ValueError:
+        raise HTTPException(
+            status_code=HTTP_404_NOT_FOUND,
+            detail="User not found. Please complete activation via /auth/user first.",
+        )
    return TimezoneResponse(timezone=user.timezone)


@@ -193,7 +198,8 @@ async def get_user_timezone_route(
    dependencies=[Security(requires_user)],
 )
 async def update_user_timezone_route(
-    user_id: Annotated[str, Security(get_user_id)], request: UpdateTimezoneRequest
+    user_id: Annotated[str, Security(get_user_id)],
+    request: UpdateTimezoneRequest,
 ) -> TimezoneResponse:
    """Update user timezone. The timezone should be a valid IANA timezone identifier."""
    user = await update_user_timezone(user_id, str(request.timezone))
--- a/autogpt_platform/backend/backend/api/features/v1_test.py
+++ b/autogpt_platform/backend/backend/api/features/v1_test.py
@@ -51,7 +51,7 @@ def test_get_or_create_user_route(
    }

    mocker.patch(
-        "backend.api.features.v1.get_or_create_user",
+        "backend.api.features.v1.get_or_activate_user",
        return_value=mock_user,
    )

--- a/autogpt_platform/backend/backend/api/rest_api.py
+++ b/autogpt_platform/backend/backend/api/rest_api.py
@@ -19,6 +19,7 @@ from prisma.errors import PrismaError
 import backend.api.features.admin.credit_admin_routes
 import backend.api.features.admin.execution_analytics_routes
 import backend.api.features.admin.store_admin_routes
+import backend.api.features.admin.user_admin_routes
 import backend.api.features.builder
 import backend.api.features.builder.routes
 import backend.api.features.chat.routes as chat_routes
@@ -311,6 +312,11 @@ app.include_router(
    tags=["v2", "admin"],
    prefix="/api/executions",
 )
+app.include_router(
+    backend.api.features.admin.user_admin_routes.router,
+    tags=["v2", "admin"],
+    prefix="/api/users",
+)
 app.include_router(
    backend.api.features.executions.review.routes.router,
    tags=["v2", "executions", "review"],
--- a/autogpt_platform/backend/backend/copilot/autopilot.py
+++ b/autogpt_platform/backend/backend/copilot/autopilot.py
@@ -0,0 +1,135 @@
+"""Autopilot public API — thin facade re-exporting from sub-modules.
+
+Implementation is split by responsibility:
+- autopilot_prompts: constants, prompt templates, context builders
+- autopilot_dispatch: timezone helpers, session creation, dispatch/scheduling
+- autopilot_completion: completion report extraction, repair, handler
+- autopilot_email: email sending, link building, notification sweep
+"""
+
+from __future__ import annotations
+
+import logging
+from datetime import UTC, datetime
+
+from pydantic import BaseModel
+
+from backend.copilot.autopilot_completion import (  # noqa: F401
+    CompletionReportToolCall,
+    CompletionReportToolCallFunction,
+    ToolOutputEnvelope,
+    _build_completion_report_repair_message,
+    _extract_completion_report_from_session,
+    _get_pending_approval_metadata,
+    _queue_completion_report_repair,
+    handle_non_manual_session_completion,
+)
+from backend.copilot.autopilot_dispatch import (  # noqa: F401
+    _bucket_end_for_now,
+    _create_autopilot_session,
+    _crosses_local_midnight,
+    _enqueue_session_turn,
+    _resolve_timezone_name,
+    _session_exists_for_execution_tag,
+    _try_create_callback_session,
+    _try_create_invite_cta_session,
+    _try_create_nightly_session,
+    _user_has_recent_manual_message,
+    _user_has_session_since,
+    dispatch_nightly_copilot,
+    get_callback_execution_tag,
+    get_graph_exec_id_for_session,
+    get_invite_cta_execution_tag,
+    get_nightly_execution_tag,
+    trigger_autopilot_session_for_user,
+)
+from backend.copilot.autopilot_email import (  # noqa: F401
+    PendingCopilotEmailSweepResult,
+    _build_session_link,
+    _get_completion_email_template_name,
+    _markdown_to_email_html,
+    _send_completion_email,
+    _send_nightly_copilot_emails,
+    send_nightly_copilot_emails,
+    send_pending_copilot_emails_for_user,
+)
+from backend.copilot.autopilot_prompts import (  # noqa: F401
+    AUTOPILOT_CALLBACK_EMAIL_TEMPLATE,
+    AUTOPILOT_CALLBACK_TAG,
+    AUTOPILOT_DISABLED_TOOLS,
+    AUTOPILOT_INVITE_CTA_EMAIL_TEMPLATE,
+    AUTOPILOT_INVITE_CTA_TAG,
+    AUTOPILOT_NIGHTLY_EMAIL_TEMPLATE,
+    AUTOPILOT_NIGHTLY_TAG_PREFIX,
+    DEFAULT_AUTOPILOT_CALLBACK_SYSTEM_PROMPT,
+    DEFAULT_AUTOPILOT_INVITE_CTA_SYSTEM_PROMPT,
+    DEFAULT_AUTOPILOT_NIGHTLY_SYSTEM_PROMPT,
+    INTERNAL_TAG_RE,
+    MAX_COMPLETION_REPORT_REPAIRS,
+    _build_autopilot_system_prompt,
+    _format_start_type_label,
+    _get_recent_manual_session_context,
+    _get_recent_sent_email_context,
+    _get_recent_session_summary_context,
+    strip_internal_content,
+    unwrap_internal_content,
+    wrap_internal_message,
+)
+from backend.copilot.model import ChatMessage, create_chat_session
+from backend.data.db_accessors import chat_db
+
+# -- re-exports from sub-modules (preserves existing import paths) ---------- #
+
+
+logger = logging.getLogger(__name__)
+
+
+class CallbackTokenConsumeResult(BaseModel):
+    session_id: str
+
+
+async def consume_callback_token(
+    token_id: str,
+    user_id: str,
+) -> CallbackTokenConsumeResult:
+    """Consume a callback token and return the resulting session.
+
+    Uses an atomic consume-and-update to prevent the TOCTOU race where two
+    concurrent requests could each see the token as unconsumed and create
+    duplicate sessions.
+    """
+    db = chat_db()
+    token = await db.get_chat_session_callback_token(token_id)
+    if token is None or token.user_id != user_id:
+        raise ValueError("Callback token not found")
+    if token.expires_at <= datetime.now(UTC):
+        raise ValueError("Callback token has expired")
+
+    if token.consumed_session_id:
+        return CallbackTokenConsumeResult(session_id=token.consumed_session_id)
+
+    session = await create_chat_session(
+        user_id,
+        initial_messages=[
+            ChatMessage(role="assistant", content=token.callback_session_message)
+        ],
+    )
+
+    # Atomically mark consumed — if another request already consumed the token
+    # concurrently, the DB will have a non-null consumed_session_id; we re-read
+    # and return the winner's session instead.
+    await db.mark_chat_session_callback_token_consumed(
+        token_id,
+        session.session_id,
+    )
+
+    # Re-read to see if we won the race
+    refreshed = await db.get_chat_session_callback_token(token_id)
+    if (
+        refreshed
+        and refreshed.consumed_session_id
+        and refreshed.consumed_session_id != session.session_id
+    ):
+        return CallbackTokenConsumeResult(session_id=refreshed.consumed_session_id)
+
+    return CallbackTokenConsumeResult(session_id=session.session_id)
--- a/autogpt_platform/backend/backend/copilot/autopilot_completion.py
+++ b/autogpt_platform/backend/backend/copilot/autopilot_completion.py
@@ -0,0 +1,198 @@
+from __future__ import annotations
+
+import logging
+from datetime import UTC, datetime
+
+from pydantic import BaseModel, Field, ValidationError
+
+from backend.copilot.autopilot_dispatch import (
+    _enqueue_session_turn,
+    get_graph_exec_id_for_session,
+)
+from backend.copilot.autopilot_prompts import (
+    MAX_COMPLETION_REPORT_REPAIRS,
+    wrap_internal_message,
+)
+from backend.copilot.model import (
+    ChatMessage,
+    ChatSession,
+    get_chat_session,
+    upsert_chat_session,
+)
+from backend.copilot.session_types import CompletionReportInput, StoredCompletionReport
+from backend.data.db_accessors import review_db
+
+logger = logging.getLogger(__name__)
+
+
+# --------------- models --------------- #
+
+
+class CompletionReportToolCallFunction(BaseModel):
+    name: str | None = None
+    arguments: str | None = None
+
+
+class CompletionReportToolCall(BaseModel):
+    id: str
+    function: CompletionReportToolCallFunction = Field(
+        default_factory=CompletionReportToolCallFunction
+    )
+
+
+class ToolOutputEnvelope(BaseModel):
+    type: str | None = None
+
+
+# --------------- approval metadata --------------- #
+
+
+async def _get_pending_approval_metadata(
+    session: ChatSession,
+) -> tuple[int, str | None]:
+    graph_exec_id = get_graph_exec_id_for_session(session.session_id)
+    pending_count = await review_db().count_pending_reviews_for_graph_exec(
+        graph_exec_id,
+        session.user_id,
+    )
+    return pending_count, graph_exec_id if pending_count > 0 else None
+
+
+# --------------- extraction --------------- #
+
+
+def _extract_completion_report_from_session(
+    session: ChatSession,
+    *,
+    pending_approval_count: int,
+) -> CompletionReportInput | None:
+    tool_outputs = {
+        message.tool_call_id: message.content
+        for message in session.messages
+        if message.role == "tool" and message.tool_call_id
+    }
+
+    latest_report: CompletionReportInput | None = None
+    for message in session.messages:
+        if message.role != "assistant" or not message.tool_calls:
+            continue
+
+        for tool_call in message.tool_calls:
+            try:
+                parsed_tool_call = CompletionReportToolCall.model_validate(tool_call)
+            except ValidationError:
+                continue
+
+            if parsed_tool_call.function.name != "completion_report":
+                continue
+
+            output = tool_outputs.get(parsed_tool_call.id)
+            if not output:
+                continue
+
+            try:
+                output_payload = ToolOutputEnvelope.model_validate_json(output)
+            except ValidationError:
+                output_payload = None
+
+            if output_payload is not None and output_payload.type == "error":
+                continue
+
+            try:
+                raw_arguments = parsed_tool_call.function.arguments or "{}"
+                report = CompletionReportInput.model_validate_json(raw_arguments)
+            except ValidationError:
+                continue
+
+            if pending_approval_count > 0 and not report.approval_summary:
+                continue
+
+            latest_report = report
+
+    return latest_report
+
+
+# --------------- repair --------------- #
+
+
+def _build_completion_report_repair_message(
+    *,
+    attempt: int,
+    pending_approval_count: int,
+) -> str:
+    approval_instruction = ""
+    if pending_approval_count > 0:
+        approval_instruction = (
+            f" There are currently {pending_approval_count} pending approval item(s). "
+            "If they still exist, include approval_summary."
+        )
+
+    return wrap_internal_message(
+        "The session completed without a valid completion_report tool call. "
+        f"This is repair attempt {attempt}. Call completion_report now and do not do any additional user-facing work."
+        + approval_instruction
+    )
+
+
+async def _queue_completion_report_repair(
+    session: ChatSession,
+    *,
+    pending_approval_count: int,
+) -> None:
+    attempt = session.completion_report_repair_count + 1
+    repair_message = _build_completion_report_repair_message(
+        attempt=attempt,
+        pending_approval_count=pending_approval_count,
+    )
+    session.messages.append(ChatMessage(role="user", content=repair_message))
+    session.completion_report_repair_count = attempt
+    session.completion_report_repair_queued_at = datetime.now(UTC)
+    session.completed_at = None
+    session.completion_report = None
+    await upsert_chat_session(session)
+    await _enqueue_session_turn(
+        session,
+        message=repair_message,
+        tool_name="completion_report_repair",
+    )
+
+
+# --------------- handler --------------- #
+
+
+async def handle_non_manual_session_completion(session_id: str) -> None:
+    session = await get_chat_session(session_id)
+    if session is None or session.is_manual:
+        return
+
+    pending_approval_count, graph_exec_id = await _get_pending_approval_metadata(
+        session
+    )
+    report = _extract_completion_report_from_session(
+        session,
+        pending_approval_count=pending_approval_count,
+    )
+
+    if report is not None:
+        session.completion_report = StoredCompletionReport(
+            **report.model_dump(),
+            has_pending_approvals=pending_approval_count > 0,
+            pending_approval_count=pending_approval_count,
+            pending_approval_graph_exec_id=graph_exec_id,
+            saved_at=datetime.now(UTC),
+        )
+        session.completion_report_repair_queued_at = None
+        session.completed_at = datetime.now(UTC)
+        await upsert_chat_session(session)
+        return
+
+    if session.completion_report_repair_count >= MAX_COMPLETION_REPORT_REPAIRS:
+        session.completion_report_repair_queued_at = None
+        session.completed_at = datetime.now(UTC)
+        await upsert_chat_session(session)
+        return
+
+    await _queue_completion_report_repair(
+        session,
+        pending_approval_count=pending_approval_count,
+    )
--- a/autogpt_platform/backend/backend/copilot/autopilot_dispatch.py
+++ b/autogpt_platform/backend/backend/copilot/autopilot_dispatch.py
@@ -0,0 +1,386 @@
+from __future__ import annotations
+
+import logging
+from datetime import UTC, date, datetime, time, timedelta
+from typing import TYPE_CHECKING
+from uuid import uuid4
+from zoneinfo import ZoneInfo
+
+import prisma.enums
+
+from backend.copilot import stream_registry
+from backend.copilot.autopilot_prompts import (
+    AUTOPILOT_CALLBACK_TAG,
+    AUTOPILOT_DISABLED_TOOLS,
+    AUTOPILOT_INVITE_CTA_TAG,
+    AUTOPILOT_NIGHTLY_TAG_PREFIX,
+    _build_autopilot_system_prompt,
+    _render_initial_message,
+)
+from backend.copilot.constants import COPILOT_SESSION_PREFIX
+from backend.copilot.executor.utils import enqueue_copilot_turn
+from backend.copilot.model import ChatMessage, ChatSession, create_chat_session
+from backend.copilot.session_types import ChatSessionConfig, ChatSessionStartType
+from backend.data.db_accessors import chat_db, invited_user_db, user_db
+from backend.data.model import User
+from backend.util.feature_flag import Flag, is_feature_enabled
+from backend.util.settings import Settings
+from backend.util.timezone_utils import get_user_timezone_or_utc
+
+if TYPE_CHECKING:
+    from backend.data.invited_user import InvitedUserRecord
+
+logger = logging.getLogger(__name__)
+settings = Settings()
+
+DISPATCH_BATCH_SIZE = 500
+
+
+# --------------- tag helpers --------------- #
+
+
+def get_graph_exec_id_for_session(session_id: str) -> str:
+    return f"{COPILOT_SESSION_PREFIX}{session_id}"
+
+
+def get_nightly_execution_tag(target_local_date: date) -> str:
+    return f"{AUTOPILOT_NIGHTLY_TAG_PREFIX}{target_local_date.isoformat()}"
+
+
+def get_callback_execution_tag() -> str:
+    return AUTOPILOT_CALLBACK_TAG
+
+
+def get_invite_cta_execution_tag() -> str:
+    return AUTOPILOT_INVITE_CTA_TAG
+
+
+def _get_manual_trigger_execution_tag(start_type: ChatSessionStartType) -> str:
+    timestamp = datetime.now(UTC).strftime("%Y%m%dT%H%M%S%fZ")
+    return f"admin-autopilot:{start_type.value}:{timestamp}:{uuid4()}"
+
+
+# --------------- timezone helpers --------------- #
+
+
+def _bucket_end_for_now(now_utc: datetime) -> datetime:
+    minute = 30 if now_utc.minute >= 30 else 0
+    return now_utc.replace(minute=minute, second=0, microsecond=0)
+
+
+def _resolve_timezone_name(raw_timezone: str | None) -> str:
+    return get_user_timezone_or_utc(raw_timezone)
+
+
+def _crosses_local_midnight(
+    bucket_start_utc: datetime,
+    bucket_end_utc: datetime,
+    timezone_name: str,
+) -> date | None:
+    """Return the new local date if *bucket_end_utc* falls on a different local
+    date than *bucket_start_utc*, taking DST transitions into account.
+
+    During a DST spring-forward the wall clock jumps forward (e.g. 01:59 →
+    03:00).  We use ``fold=0`` on the end instant so that ambiguous/missing
+    times are resolved consistently and a single 30-min UTC bucket can never
+    produce midnight on *two* consecutive calls.
+    """
+    tz = ZoneInfo(timezone_name)
+    start_local = bucket_start_utc.astimezone(tz)
+    end_local = bucket_end_utc.astimezone(tz)
+    # Resolve ambiguous wall-clock times consistently (spring-forward / fall-back)
+    end_local = end_local.replace(fold=0)
+    if start_local.date() == end_local.date():
+        return None
+    return end_local.date()
+
+
+# --------------- thin DB wrappers --------------- #
+
+
+async def _user_has_recent_manual_message(user_id: str, since: datetime) -> bool:
+    return await chat_db().has_recent_manual_message(user_id, since)
+
+
+async def _user_has_session_since(user_id: str, since: datetime) -> bool:
+    return await chat_db().has_session_since(user_id, since)
+
+
+async def _session_exists_for_execution_tag(user_id: str, execution_tag: str) -> bool:
+    return await chat_db().session_exists_for_execution_tag(user_id, execution_tag)
+
+
+# --------------- session creation --------------- #
+
+
+async def _enqueue_session_turn(
+    session: ChatSession,
+    *,
+    message: str,
+    tool_name: str,
+) -> None:
+    turn_id = str(uuid4())
+    await stream_registry.create_session(
+        session_id=session.session_id,
+        user_id=session.user_id,
+        tool_call_id=tool_name,
+        tool_name=tool_name,
+        turn_id=turn_id,
+        blocking=False,
+    )
+    await enqueue_copilot_turn(
+        session_id=session.session_id,
+        user_id=session.user_id,
+        message=message,
+        turn_id=turn_id,
+        is_user_message=True,
+    )
+
+
+async def _create_autopilot_session(
+    user: User,
+    *,
+    start_type: ChatSessionStartType,
+    execution_tag: str,
+    timezone_name: str,
+    target_local_date: date | None = None,
+    invited_user: InvitedUserRecord | None = None,
+) -> ChatSession | None:
+    if await _session_exists_for_execution_tag(user.id, execution_tag):
+        return None
+
+    system_prompt = await _build_autopilot_system_prompt(
+        user,
+        start_type=start_type,
+        timezone_name=timezone_name,
+        target_local_date=target_local_date,
+        invited_user=invited_user,
+    )
+    initial_message = _render_initial_message(
+        start_type,
+        user_name=user.name,
+        invited_user=invited_user,
+    )
+    session_config = ChatSessionConfig(
+        system_prompt_override=system_prompt,
+        initial_user_message=initial_message,
+        extra_tools=["completion_report"],
+        disabled_tools=AUTOPILOT_DISABLED_TOOLS,
+    )
+
+    session = await create_chat_session(
+        user.id,
+        start_type=start_type,
+        execution_tag=execution_tag,
+        session_config=session_config,
+        initial_messages=[ChatMessage(role="user", content=initial_message)],
+    )
+    await _enqueue_session_turn(
+        session,
+        message=initial_message,
+        tool_name="autopilot_dispatch",
+    )
+    return session
+
+
+# --------------- cohort helpers --------------- #
+
+
+async def _try_create_invite_cta_session(
+    user: User,
+    *,
+    invited_user: InvitedUserRecord | None,
+    now_utc: datetime,
+    timezone_name: str,
+    invite_cta_start: date,
+    invite_cta_delay: timedelta,
+) -> bool:
+    if invited_user is None:
+        return False
+    if invited_user.status != prisma.enums.InvitedUserStatus.INVITED:
+        return False
+    if invited_user.created_at.date() < invite_cta_start:
+        return False
+    if invited_user.created_at > now_utc - invite_cta_delay:
+        return False
+    if await _session_exists_for_execution_tag(user.id, get_invite_cta_execution_tag()):
+        return False
+
+    created = await _create_autopilot_session(
+        user,
+        start_type=ChatSessionStartType.AUTOPILOT_INVITE_CTA,
+        execution_tag=get_invite_cta_execution_tag(),
+        timezone_name=timezone_name,
+        invited_user=invited_user,
+    )
+    return created is not None
+
+
+async def _try_create_nightly_session(
+    user: User,
+    *,
+    now_utc: datetime,
+    timezone_name: str,
+    target_local_date: date,
+) -> bool:
+    if not await _user_has_recent_manual_message(
+        user.id,
+        now_utc - timedelta(hours=24),
+    ):
+        return False
+
+    created = await _create_autopilot_session(
+        user,
+        start_type=ChatSessionStartType.AUTOPILOT_NIGHTLY,
+        execution_tag=get_nightly_execution_tag(target_local_date),
+        timezone_name=timezone_name,
+        target_local_date=target_local_date,
+    )
+    return created is not None
+
+
+async def _try_create_callback_session(
+    user: User,
+    *,
+    callback_start: datetime,
+    timezone_name: str,
+) -> bool:
+    if not await _user_has_session_since(user.id, callback_start):
+        return False
+    if await _session_exists_for_execution_tag(user.id, get_callback_execution_tag()):
+        return False
+
+    created = await _create_autopilot_session(
+        user,
+        start_type=ChatSessionStartType.AUTOPILOT_CALLBACK,
+        execution_tag=get_callback_execution_tag(),
+        timezone_name=timezone_name,
+    )
+    return created is not None
+
+
+# --------------- dispatch --------------- #
+
+
+async def _dispatch_nightly_copilot() -> int:
+    now_utc = datetime.now(UTC)
+    bucket_end = _bucket_end_for_now(now_utc)
+    bucket_start = bucket_end - timedelta(minutes=30)
+    callback_start = datetime.combine(
+        settings.config.nightly_copilot_callback_start_date,
+        time.min,
+        tzinfo=UTC,
+    )
+    invite_cta_start = settings.config.nightly_copilot_invite_cta_start_date
+    invite_cta_delay = timedelta(
+        hours=settings.config.nightly_copilot_invite_cta_delay_hours
+    )
+
+    # Paginate user list to avoid loading the entire table into memory.
+    created_count = 0
+    cursor: str | None = None
+    while True:
+        batch = await user_db().list_users(
+            limit=DISPATCH_BATCH_SIZE,
+            cursor=cursor,
+        )
+        if not batch:
+            break
+
+        user_ids = [user.id for user in batch]
+        invites = await invited_user_db().list_invited_users_for_auth_users(user_ids)
+        invites_by_user_id = {
+            invite.auth_user_id: invite for invite in invites if invite.auth_user_id
+        }
+
+        for user in batch:
+            if not await is_feature_enabled(
+                Flag.NIGHTLY_COPILOT, user.id, default=False
+            ):
+                continue
+
+            timezone_name = _resolve_timezone_name(user.timezone)
+            target_local_date = _crosses_local_midnight(
+                bucket_start,
+                bucket_end,
+                timezone_name,
+            )
+            if target_local_date is None:
+                continue
+
+            invited_user = invites_by_user_id.get(user.id)
+            if await _try_create_invite_cta_session(
+                user,
+                invited_user=invited_user,
+                now_utc=now_utc,
+                timezone_name=timezone_name,
+                invite_cta_start=invite_cta_start,
+                invite_cta_delay=invite_cta_delay,
+            ):
+                created_count += 1
+                continue
+
+            if await _try_create_nightly_session(
+                user,
+                now_utc=now_utc,
+                timezone_name=timezone_name,
+                target_local_date=target_local_date,
+            ):
+                created_count += 1
+                continue
+
+            if await _try_create_callback_session(
+                user,
+                callback_start=callback_start,
+                timezone_name=timezone_name,
+            ):
+                created_count += 1
+
+        cursor = batch[-1].id if len(batch) == DISPATCH_BATCH_SIZE else None
+        if cursor is None:
+            break
+
+    return created_count
+
+
+async def dispatch_nightly_copilot() -> int:
+    return await _dispatch_nightly_copilot()
+
+
+async def trigger_autopilot_session_for_user(
+    user_id: str,
+    *,
+    start_type: ChatSessionStartType,
+) -> ChatSession:
+    allowed_start_types = {
+        ChatSessionStartType.AUTOPILOT_INVITE_CTA,
+        ChatSessionStartType.AUTOPILOT_NIGHTLY,
+        ChatSessionStartType.AUTOPILOT_CALLBACK,
+    }
+    if start_type not in allowed_start_types:
+        raise ValueError(f"Unsupported autopilot start type: {start_type}")
+
+    try:
+        user = await user_db().get_user_by_id(user_id)
+    except ValueError as exc:
+        raise LookupError(str(exc)) from exc
+
+    invites = await invited_user_db().list_invited_users_for_auth_users([user_id])
+    invited_user = invites[0] if invites else None
+    timezone_name = _resolve_timezone_name(user.timezone)
+    target_local_date = None
+    if start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY:
+        target_local_date = datetime.now(UTC).astimezone(ZoneInfo(timezone_name)).date()
+
+    session = await _create_autopilot_session(
+        user,
+        start_type=start_type,
+        execution_tag=_get_manual_trigger_execution_tag(start_type),
+        timezone_name=timezone_name,
+        target_local_date=target_local_date,
+        invited_user=invited_user,
+    )
+    if session is None:
+        raise ValueError("Failed to create autopilot session")
+
+    return session
--- a/autogpt_platform/backend/backend/copilot/autopilot_email.py
+++ b/autogpt_platform/backend/backend/copilot/autopilot_email.py
@@ -0,0 +1,297 @@
+from __future__ import annotations
+
+import asyncio
+import logging
+from datetime import UTC, datetime
+
+from markdown_it import MarkdownIt
+from pydantic import BaseModel
+
+from backend.copilot import stream_registry
+from backend.copilot.autopilot_completion import (
+    _get_pending_approval_metadata,
+    _queue_completion_report_repair,
+)
+from backend.copilot.autopilot_prompts import (
+    AUTOPILOT_CALLBACK_EMAIL_TEMPLATE,
+    AUTOPILOT_INVITE_CTA_EMAIL_TEMPLATE,
+    AUTOPILOT_NIGHTLY_EMAIL_TEMPLATE,
+    MAX_COMPLETION_REPORT_REPAIRS,
+)
+from backend.copilot.model import (
+    ChatSession,
+    get_chat_session,
+    update_session_title,
+    upsert_chat_session,
+)
+from backend.copilot.service import _generate_session_title
+from backend.copilot.session_types import ChatSessionStartType
+from backend.data.db_accessors import chat_db, user_db
+from backend.notifications.email import EmailSender
+from backend.util.url import get_frontend_base_url
+
+logger = logging.getLogger(__name__)
+PENDING_NOTIFICATION_SWEEP_LIMIT = 200
+
+_md = MarkdownIt()
+
+_EMAIL_INLINE_STYLES: list[tuple[str, str]] = [
+    (
+        "<p>",
+        '<p style="font-size: 15px; line-height: 170%;'
+        " margin-top: 0; margin-bottom: 16px;"
+        ' color: #1F1F20;">',
+    ),
+    (
+        "<li>",
+        '<li style="font-size: 15px; line-height: 170%;'
+        " margin-top: 0; margin-bottom: 8px;"
+        ' color: #1F1F20;">',
+    ),
+    (
+        "<ul>",
+        '<ul style="padding: 0 0 0 24px;' ' margin-top: 0; margin-bottom: 16px;">',
+    ),
+    (
+        "<ol>",
+        '<ol style="padding: 0 0 0 24px;' ' margin-top: 0; margin-bottom: 16px;">',
+    ),
+    (
+        "<a ",
+        '<a style="color: #7733F5;'
+        " text-decoration: underline;"
+        ' font-weight: 500;" ',
+    ),
+    (
+        "<h2>",
+        '<h2 style="font-size: 20px; font-weight: 600;'
+        ' margin-top: 0; margin-bottom: 12px; color: #1F1F20;">',
+    ),
+    (
+        "<h3>",
+        '<h3 style="font-size: 18px; font-weight: 600;'
+        ' margin-top: 0; margin-bottom: 12px; color: #1F1F20;">',
+    ),
+]
+
+
+def _markdown_to_email_html(text: str | None) -> str:
+    """Convert markdown text to email-safe HTML with inline styles."""
+    if not text or not text.strip():
+        return ""
+    html = _md.render(text.strip())
+    for tag, styled_tag in _EMAIL_INLINE_STYLES:
+        html = html.replace(tag, styled_tag)
+    return html.strip()
+
+
+# --------------- link builders --------------- #
+
+
+def _build_session_link(session_id: str, *, show_autopilot: bool) -> str:
+    base_url = get_frontend_base_url()
+    suffix = "&showAutopilot=1" if show_autopilot else ""
+    return f"{base_url}/copilot?sessionId={session_id}{suffix}"
+
+
+def _get_completion_email_template_name(start_type: ChatSessionStartType) -> str:
+    if start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY:
+        return AUTOPILOT_NIGHTLY_EMAIL_TEMPLATE
+    if start_type == ChatSessionStartType.AUTOPILOT_CALLBACK:
+        return AUTOPILOT_CALLBACK_EMAIL_TEMPLATE
+    if start_type == ChatSessionStartType.AUTOPILOT_INVITE_CTA:
+        return AUTOPILOT_INVITE_CTA_EMAIL_TEMPLATE
+    raise ValueError(f"Unsupported start type for completion email: {start_type}")
+
+
+class PendingCopilotEmailSweepResult(BaseModel):
+    candidate_count: int = 0
+    processed_count: int = 0
+    sent_count: int = 0
+    skipped_count: int = 0
+    repair_queued_count: int = 0
+    running_count: int = 0
+    failed_count: int = 0
+
+
+async def _ensure_session_title_for_completed_session(session: ChatSession) -> None:
+    if session.title or not session.user_id:
+        return
+
+    report = session.completion_report
+    if report is None:
+        return
+
+    title = report.email_title.strip() if report.email_title else ""
+    if not title:
+        title_seed = report.email_body or report.thoughts
+        if title_seed:
+            generated_title = await _generate_session_title(
+                title_seed,
+                user_id=session.user_id,
+                session_id=session.session_id,
+            )
+            title = generated_title.strip() if generated_title else ""
+
+    if not title:
+        return
+
+    updated = await update_session_title(
+        session.session_id,
+        session.user_id,
+        title,
+        only_if_empty=True,
+    )
+    if updated:
+        session.title = title
+
+
+# --------------- send email --------------- #
+
+
+async def _send_completion_email(session: ChatSession) -> None:
+    report = session.completion_report
+    if report is None:
+        raise ValueError("Missing completion report")
+    try:
+        user = await user_db().get_user_by_id(session.user_id)
+    except ValueError as exc:
+        raise ValueError(f"User {session.user_id} not found") from exc
+
+    if not user.email:
+        raise ValueError(f"User {session.user_id} not found")
+
+    approval_cta = report.has_pending_approvals
+    template_name = _get_completion_email_template_name(session.start_type)
+    if approval_cta:
+        cta_url = _build_session_link(session.session_id, show_autopilot=True)
+        cta_label = "Review in Copilot"
+    else:
+        cta_url = _build_session_link(session.session_id, show_autopilot=True)
+        cta_label = (
+            "Try Copilot"
+            if session.start_type == ChatSessionStartType.AUTOPILOT_INVITE_CTA
+            else "Open Copilot"
+        )
+
+    # EmailSender.send_template is synchronous (blocking HTTP call to Postmark).
+    # Run it in a thread to avoid blocking the async event loop.
+    sender = EmailSender()
+    await asyncio.to_thread(
+        sender.send_template,
+        user_email=user.email,
+        subject=report.email_title or "Autopilot update",
+        template_name=template_name,
+        data={
+            "email_body_html": _markdown_to_email_html(report.email_body),
+            "approval_summary_html": _markdown_to_email_html(report.approval_summary),
+            "cta_url": cta_url,
+            "cta_label": cta_label,
+        },
+    )
+
+
+# --------------- email sweep --------------- #
+
+
+async def _process_pending_copilot_email_candidates(
+    candidates: list,
+) -> PendingCopilotEmailSweepResult:
+    result = PendingCopilotEmailSweepResult(candidate_count=len(candidates))
+
+    for candidate in candidates:
+        session = await get_chat_session(candidate.session_id)
+        if session is None or session.is_manual:
+            continue
+
+        active = await stream_registry.get_session(session.session_id)
+        is_running = active is not None and active.status == "running"
+        if is_running:
+            result.running_count += 1
+            continue
+
+        pending_approval_count, graph_exec_id = await _get_pending_approval_metadata(
+            session
+        )
+
+        if session.completion_report is None:
+            if session.completion_report_repair_count < MAX_COMPLETION_REPORT_REPAIRS:
+                await _queue_completion_report_repair(
+                    session,
+                    pending_approval_count=pending_approval_count,
+                )
+                result.repair_queued_count += 1
+                continue
+
+            session.completed_at = session.completed_at or datetime.now(UTC)
+            session.completion_report_repair_queued_at = None
+            session.notification_email_skipped_at = datetime.now(UTC)
+            await upsert_chat_session(session)
+            result.skipped_count += 1
+            continue
+
+        session.completed_at = session.completed_at or datetime.now(UTC)
+        if (
+            session.completion_report.pending_approval_graph_exec_id is None
+            and graph_exec_id
+        ):
+            session.completion_report = session.completion_report.model_copy(
+                update={
+                    "has_pending_approvals": pending_approval_count > 0,
+                    "pending_approval_count": pending_approval_count,
+                    "pending_approval_graph_exec_id": graph_exec_id,
+                }
+            )
+
+        try:
+            await _ensure_session_title_for_completed_session(session)
+        except Exception:
+            logger.exception(
+                "Failed to ensure session title for session %s",
+                session.session_id,
+            )
+
+        if not session.completion_report.should_notify_user:
+            session.notification_email_skipped_at = datetime.now(UTC)
+            await upsert_chat_session(session)
+            result.skipped_count += 1
+            continue
+
+        try:
+            await _send_completion_email(session)
+        except Exception:
+            logger.exception(
+                "Failed to send nightly copilot email for session %s",
+                session.session_id,
+            )
+            result.failed_count += 1
+            continue
+
+        session.notification_email_sent_at = datetime.now(UTC)
+        await upsert_chat_session(session)
+        result.sent_count += 1
+
+    result.processed_count = result.sent_count + result.skipped_count
+    return result
+
+
+async def _send_nightly_copilot_emails() -> int:
+    candidates = await chat_db().get_pending_notification_chat_sessions(
+        limit=PENDING_NOTIFICATION_SWEEP_LIMIT
+    )
+    result = await _process_pending_copilot_email_candidates(candidates)
+    return result.processed_count
+
+
+async def send_nightly_copilot_emails() -> int:
+    return await _send_nightly_copilot_emails()
+
+
+async def send_pending_copilot_emails_for_user(
+    user_id: str,
+) -> PendingCopilotEmailSweepResult:
+    candidates = await chat_db().get_pending_notification_chat_sessions_for_user(
+        user_id,
+        limit=PENDING_NOTIFICATION_SWEEP_LIMIT,
+    )
+    return await _process_pending_copilot_email_candidates(candidates)
--- a/autogpt_platform/backend/backend/copilot/autopilot_prompts.py
+++ b/autogpt_platform/backend/backend/copilot/autopilot_prompts.py
@@ -0,0 +1,409 @@
+from __future__ import annotations
+
+import json
+import logging
+import re
+from datetime import date, datetime, time, timedelta
+from typing import TYPE_CHECKING, Any
+
+from backend.copilot.service import _get_system_prompt_template
+from backend.copilot.service import config as chat_config
+from backend.copilot.session_types import ChatSessionStartType
+from backend.data.db_accessors import chat_db, understanding_db
+from backend.data.understanding import format_understanding_for_prompt
+
+if TYPE_CHECKING:
+    from backend.data.invited_user import InvitedUserRecord
+
+logger = logging.getLogger(__name__)
+
+INTERNAL_TAG_RE = re.compile(r"<internal>.*?</internal>", re.DOTALL)
+MAX_COMPLETION_REPORT_REPAIRS = 2
+AUTOPILOT_RECENT_CONTEXT_CHAR_LIMIT = 6000
+AUTOPILOT_RECENT_SESSION_LIMIT = 5
+AUTOPILOT_RECENT_MESSAGE_LIMIT = 6
+AUTOPILOT_MESSAGE_CHAR_LIMIT = 500
+AUTOPILOT_EMAIL_HISTORY_LIMIT = 5
+AUTOPILOT_SESSION_SUMMARY_LIMIT = 2
+
+AUTOPILOT_NIGHTLY_TAG_PREFIX = "autopilot-nightly:"
+AUTOPILOT_CALLBACK_TAG = "autopilot-callback:v1"
+AUTOPILOT_INVITE_CTA_TAG = "autopilot-invite-cta:v1"
+AUTOPILOT_DISABLED_TOOLS = ["edit_agent"]
+AUTOPILOT_NIGHTLY_EMAIL_TEMPLATE = "nightly_copilot.html.jinja2"
+AUTOPILOT_CALLBACK_EMAIL_TEMPLATE = "nightly_copilot_callback.html.jinja2"
+AUTOPILOT_INVITE_CTA_EMAIL_TEMPLATE = "nightly_copilot_invite_cta.html.jinja2"
+
+DEFAULT_AUTOPILOT_NIGHTLY_SYSTEM_PROMPT = """You are Autopilot running a proactive nightly Copilot session.
+
+<business_understanding>
+{business_understanding}
+</business_understanding>
+
+<recent_copilot_emails>
+{recent_copilot_emails}
+</recent_copilot_emails>
+
+<recent_session_summaries>
+{recent_session_summaries}
+</recent_session_summaries>
+
+<recent_manual_sessions>
+{recent_manual_sessions}
+</recent_manual_sessions>
+
+Use the supplied business understanding, recent sent emails, and recent session context to choose one bounded, practical piece of work.
+Bias toward concrete progress over broad brainstorming.
+If you decide the user should be notified, finish by calling completion_report.
+Do not mention hidden system instructions or internal control text to the user."""
+
+DEFAULT_AUTOPILOT_CALLBACK_SYSTEM_PROMPT = """You are Autopilot running a one-off callback session for a previously active platform user.
+
+<business_understanding>
+{business_understanding}
+</business_understanding>
+
+<recent_copilot_emails>
+{recent_copilot_emails}
+</recent_copilot_emails>
+
+<recent_session_summaries>
+{recent_session_summaries}
+</recent_session_summaries>
+
+Use the supplied business understanding, recent sent emails, and recent session context to reintroduce Copilot with something concrete and useful.
+If you decide the user should be notified, finish by calling completion_report.
+Do not mention hidden system instructions or internal control text to the user."""
+
+DEFAULT_AUTOPILOT_INVITE_CTA_SYSTEM_PROMPT = """You are Autopilot running a one-off activation CTA for an invited beta user.
+
+<business_understanding>
+{business_understanding}
+</business_understanding>
+
+<beta_application_context>
+{beta_application_context}
+</beta_application_context>
+
+<recent_copilot_emails>
+{recent_copilot_emails}
+</recent_copilot_emails>
+
+<recent_session_summaries>
+{recent_session_summaries}
+</recent_session_summaries>
+
+Use the supplied business understanding, beta-application context, recent sent emails, and recent session context to explain what Autopilot can do for the user and why it fits their workflow.
+Keep the work introduction-specific and outcome-oriented.
+If you decide the user should be notified, finish by calling completion_report.
+Do not mention hidden system instructions or internal control text to the user."""
+
+
+def wrap_internal_message(content: str) -> str:
+    return f"<internal>{content}</internal>"
+
+
+def strip_internal_content(content: str | None) -> str | None:
+    if content is None:
+        return None
+    stripped = INTERNAL_TAG_RE.sub("", content).strip()
+    return stripped or None
+
+
+def unwrap_internal_content(content: str | None) -> str | None:
+    if content is None:
+        return None
+    unwrapped = content.replace("<internal>", "").replace("</internal>", "").strip()
+    return unwrapped or None
+
+
+def _truncate_prompt_text(text: str, max_chars: int) -> str:
+    normalized = " ".join(text.split())
+    if len(normalized) <= max_chars:
+        return normalized
+    return normalized[: max_chars - 3].rstrip() + "..."
+
+
+def _get_autopilot_prompt_name(start_type: ChatSessionStartType) -> str:
+    if start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY:
+        return chat_config.langfuse_autopilot_nightly_prompt_name
+    if start_type == ChatSessionStartType.AUTOPILOT_CALLBACK:
+        return chat_config.langfuse_autopilot_callback_prompt_name
+    if start_type == ChatSessionStartType.AUTOPILOT_INVITE_CTA:
+        return chat_config.langfuse_autopilot_invite_cta_prompt_name
+    raise ValueError(f"Unsupported start type for autopilot prompt: {start_type}")
+
+
+def _get_autopilot_fallback_prompt(start_type: ChatSessionStartType) -> str:
+    if start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY:
+        return DEFAULT_AUTOPILOT_NIGHTLY_SYSTEM_PROMPT
+    if start_type == ChatSessionStartType.AUTOPILOT_CALLBACK:
+        return DEFAULT_AUTOPILOT_CALLBACK_SYSTEM_PROMPT
+    if start_type == ChatSessionStartType.AUTOPILOT_INVITE_CTA:
+        return DEFAULT_AUTOPILOT_INVITE_CTA_SYSTEM_PROMPT
+    raise ValueError(f"Unsupported start type for autopilot prompt: {start_type}")
+
+
+def _format_start_type_label(start_type: ChatSessionStartType) -> str:
+    if start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY:
+        return "Nightly"
+    if start_type == ChatSessionStartType.AUTOPILOT_CALLBACK:
+        return "Callback"
+    if start_type == ChatSessionStartType.AUTOPILOT_INVITE_CTA:
+        return "Beta Invite CTA"
+    return start_type.value
+
+
+def _get_invited_user_tally_understanding(
+    invited_user: InvitedUserRecord | None,
+) -> dict[str, Any] | None:
+    return invited_user.tally_understanding if invited_user is not None else None
+
+
+def _render_initial_message(
+    start_type: ChatSessionStartType,
+    *,
+    user_name: str | None,
+    invited_user: InvitedUserRecord | None = None,
+) -> str:
+    display_name = user_name or "the user"
+    if start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY:
+        return wrap_internal_message(
+            "This is a nightly proactive Copilot session. Review recent manual activity, "
+            f"do one useful piece of work for {display_name}, and finish with completion_report."
+        )
+    if start_type == ChatSessionStartType.AUTOPILOT_CALLBACK:
+        return wrap_internal_message(
+            "This is a one-off callback session for a previously active user. "
+            f"Reintroduce Copilot with something concrete and useful for {display_name}, "
+            "then finish with completion_report."
+        )
+
+    invite_summary = ""
+    tally_understanding = _get_invited_user_tally_understanding(invited_user)
+    if tally_understanding is not None:
+        invite_summary = "\nKnown context from the beta application:\n" + json.dumps(
+            tally_understanding, ensure_ascii=False
+        )
+    return wrap_internal_message(
+        "This is a one-off invite CTA session for an invited beta user who has not yet activated. "
+        f"Create a tailored introduction for {display_name}, explain how Autopilot can help, "
+        f"and finish with completion_report.{invite_summary}"
+    )
+
+
+def _get_previous_local_midnight_utc(
+    target_local_date: date,
+    timezone_name: str,
+) -> datetime:
+    from datetime import UTC
+    from zoneinfo import ZoneInfo
+
+    tz = ZoneInfo(timezone_name)
+    previous_midnight_local = datetime.combine(
+        target_local_date - timedelta(days=1),
+        time.min,
+        tzinfo=tz,
+    )
+    return previous_midnight_local.astimezone(UTC)
+
+
+async def _get_recent_manual_session_context(
+    user_id: str,
+    *,
+    since_utc: datetime,
+) -> str:
+    sessions = await chat_db().get_manual_chat_sessions_since(
+        user_id,
+        since_utc,
+        AUTOPILOT_RECENT_SESSION_LIMIT,
+    )
+
+    if not sessions:
+        return "No recent manual sessions since the previous nightly run."
+
+    blocks: list[str] = []
+    used_chars = 0
+
+    for session in sessions:
+        messages = await chat_db().get_chat_messages_since(
+            session.session_id, since_utc
+        )
+
+        visible_messages: list[str] = []
+        for message in messages[-AUTOPILOT_RECENT_MESSAGE_LIMIT:]:
+            content = message.content or ""
+            if message.role == "user":
+                visible = strip_internal_content(content)
+            else:
+                visible = content.strip() or None
+            if not visible:
+                continue
+
+            role_label = {
+                "user": "User",
+                "assistant": "Assistant",
+                "tool": "Tool",
+            }.get(message.role, message.role.title())
+            visible_messages.append(
+                f"{role_label}: {_truncate_prompt_text(visible, AUTOPILOT_MESSAGE_CHAR_LIMIT)}"
+            )
+
+        if not visible_messages:
+            continue
+
+        title_suffix = f" ({session.title})" if session.title else ""
+        block = (
+            f"### Session updated {session.updated_at.isoformat()}{title_suffix}\n"
+            + "\n".join(visible_messages)
+        )
+        if used_chars + len(block) > AUTOPILOT_RECENT_CONTEXT_CHAR_LIMIT:
+            break
+
+        blocks.append(block)
+        used_chars += len(block)
+
+    return (
+        "\n\n".join(blocks)
+        if blocks
+        else "No recent manual sessions since the previous nightly run."
+    )
+
+
+async def _get_recent_sent_email_context(user_id: str) -> str:
+    sessions = await chat_db().get_recent_sent_email_chat_sessions(
+        user_id,
+        AUTOPILOT_EMAIL_HISTORY_LIMIT,
+    )
+    if not sessions:
+        return "No recent Copilot or Autopilot emails have been sent to this user."
+
+    blocks: list[str] = []
+    for session in sessions:
+        report = session.completion_report
+        sent_at = session.notification_email_sent_at
+        if report is None or sent_at is None:
+            continue
+
+        lines = [
+            f"### Sent {sent_at.isoformat()} ({_format_start_type_label(session.start_type)})",
+        ]
+        if report.email_title:
+            lines.append(
+                f"Subject: {_truncate_prompt_text(report.email_title, AUTOPILOT_MESSAGE_CHAR_LIMIT)}"
+            )
+        if report.email_body:
+            lines.append(
+                f"Body: {_truncate_prompt_text(report.email_body, AUTOPILOT_MESSAGE_CHAR_LIMIT)}"
+            )
+        if report.callback_session_message:
+            lines.append(
+                "CTA Message: "
+                + _truncate_prompt_text(
+                    report.callback_session_message,
+                    AUTOPILOT_MESSAGE_CHAR_LIMIT,
+                )
+            )
+        blocks.append("\n".join(lines))
+
+    return (
+        "\n\n".join(blocks)
+        if blocks
+        else "No recent Copilot or Autopilot emails have been sent to this user."
+    )
+
+
+async def _get_recent_session_summary_context(user_id: str) -> str:
+    sessions = await chat_db().get_recent_completion_report_chat_sessions(
+        user_id,
+        AUTOPILOT_SESSION_SUMMARY_LIMIT,
+    )
+    if not sessions:
+        return "No recent Copilot session summaries are available."
+
+    blocks: list[str] = []
+    for session in sessions:
+        report = session.completion_report
+        if report is None:
+            continue
+
+        title_suffix = f" ({session.title})" if session.title else ""
+        lines = [
+            f"### {_format_start_type_label(session.start_type)} session updated {session.updated_at.isoformat()}{title_suffix}",
+            f"Summary: {_truncate_prompt_text(report.thoughts, AUTOPILOT_MESSAGE_CHAR_LIMIT)}",
+        ]
+        if report.email_title:
+            lines.append(
+                "Email Title: "
+                + _truncate_prompt_text(
+                    report.email_title, AUTOPILOT_MESSAGE_CHAR_LIMIT
+                )
+            )
+        blocks.append("\n".join(lines))
+
+    return (
+        "\n\n".join(blocks)
+        if blocks
+        else "No recent Copilot session summaries are available."
+    )
+
+
+async def _build_autopilot_system_prompt(
+    user: Any,
+    *,
+    start_type: ChatSessionStartType,
+    timezone_name: str,
+    target_local_date: date | None = None,
+    invited_user: InvitedUserRecord | None = None,
+) -> str:
+    understanding = await understanding_db().get_business_understanding(user.id)
+    business_understanding = (
+        format_understanding_for_prompt(understanding)
+        if understanding
+        else "No saved business understanding yet."
+    )
+    recent_copilot_emails = await _get_recent_sent_email_context(user.id)
+    recent_session_summaries = await _get_recent_session_summary_context(user.id)
+    recent_manual_sessions = "Not applicable for this prompt type."
+    beta_application_context = "No beta application context available."
+
+    users_information_sections = [
+        "## Business Understanding\n" + business_understanding
+    ]
+    users_information_sections.append(
+        "## Recent Copilot Emails Sent To User\n" + recent_copilot_emails
+    )
+    users_information_sections.append(
+        "## Recent Copilot Session Summaries\n" + recent_session_summaries
+    )
+    users_information = "\n\n".join(users_information_sections)
+
+    if (
+        start_type == ChatSessionStartType.AUTOPILOT_NIGHTLY
+        and target_local_date is not None
+    ):
+        recent_manual_sessions = await _get_recent_manual_session_context(
+            user.id,
+            since_utc=_get_previous_local_midnight_utc(
+                target_local_date,
+                timezone_name,
+            ),
+        )
+
+    tally_understanding = _get_invited_user_tally_understanding(invited_user)
+    if tally_understanding is not None:
+        beta_application_context = json.dumps(tally_understanding, ensure_ascii=False)
+
+    return await _get_system_prompt_template(
+        users_information,
+        prompt_name=_get_autopilot_prompt_name(start_type),
+        fallback_prompt=_get_autopilot_fallback_prompt(start_type),
+        template_vars={
+            "users_information": users_information,
+            "business_understanding": business_understanding,
+            "recent_copilot_emails": recent_copilot_emails,
+            "recent_session_summaries": recent_session_summaries,
+            "recent_manual_sessions": recent_manual_sessions,
+            "beta_application_context": beta_application_context,
+        },
+    )
--- a/autogpt_platform/backend/backend/copilot/autopilot_test.py
+++ b/autogpt_platform/backend/backend/copilot/autopilot_test.py
--- a/autogpt_platform/backend/backend/copilot/baseline/service.py
+++ b/autogpt_platform/backend/backend/copilot/baseline/service.py
@@ -38,8 +38,8 @@ from backend.copilot.response_model import (
    StreamToolOutputAvailable,
 )
 from backend.copilot.service import (
-    _build_system_prompt,
    _generate_session_title,
+    _resolve_system_prompt,
    client,
    config,
 )
@@ -160,7 +160,7 @@ async def stream_chat_completion_baseline(
    session = await upsert_chat_session(session)

    # Generate title for new sessions
-    if is_user_message and not session.title:
+    if is_user_message and session.is_manual and not session.title:
        user_messages = [m for m in session.messages if m.role == "user"]
        if len(user_messages) == 1:
            first_message = user_messages[0].content or message or ""
@@ -177,16 +177,20 @@ async def stream_chat_completion_baseline(
    # changes from concurrent chats updating business understanding.
    is_first_turn = len(session.messages) <= 1
    if is_first_turn:
-        base_system_prompt, _ = await _build_system_prompt(
-            user_id, has_conversation_history=False
+        base_system_prompt, _ = await _resolve_system_prompt(
+            session,
+            user_id,
+            has_conversation_history=False,
        )
    else:
-        base_system_prompt, _ = await _build_system_prompt(
-            user_id=None, has_conversation_history=True
+        base_system_prompt, _ = await _resolve_system_prompt(
+            session,
+            user_id=None,
+            has_conversation_history=True,
        )

    # Append tool documentation and technical notes
-    system_prompt = base_system_prompt + get_baseline_supplement()
+    system_prompt = base_system_prompt + get_baseline_supplement(session)

    # Compress context if approaching the model's token limit
    messages_for_context = await _compress_session_messages(session.messages)
@@ -199,7 +203,7 @@ async def stream_chat_completion_baseline(
        if msg.role in ("user", "assistant") and msg.content:
            openai_messages.append({"role": msg.role, "content": msg.content})

-    tools = get_available_tools()
+    tools = get_available_tools(session)

    yield StreamStart(messageId=message_id, sessionId=session_id)

--- a/autogpt_platform/backend/backend/copilot/config.py
+++ b/autogpt_platform/backend/backend/copilot/config.py
@@ -65,6 +65,18 @@ class ChatConfig(BaseSettings):
        default="CoPilot Prompt",
        description="Name of the prompt in Langfuse to fetch",
    )
+    langfuse_autopilot_nightly_prompt_name: str = Field(
+        default="CoPilot Nightly",
+        description="Langfuse prompt name for nightly Autopilot sessions",
+    )
+    langfuse_autopilot_callback_prompt_name: str = Field(
+        default="CoPilot Callback",
+        description="Langfuse prompt name for callback Autopilot sessions",
+    )
+    langfuse_autopilot_invite_cta_prompt_name: str = Field(
+        default="CoPilot Beta Invite CTA",
+        description="Langfuse prompt name for beta invite CTA Autopilot sessions",
+    )
    langfuse_prompt_cache_ttl: int = Field(
        default=300,
        description="Cache TTL in seconds for Langfuse prompt (0 to disable caching)",
@@ -115,7 +127,7 @@ class ChatConfig(BaseSettings):
        description="E2B sandbox template to use for copilot sessions.",
    )
    e2b_sandbox_timeout: int = Field(
-        default=10800,  # 3 hours — wall-clock timeout, not idle; explicit pause is primary
+        default=300,  # 5 min safety net — explicit per-turn pause is the primary mechanism
        description="E2B sandbox running-time timeout (seconds). "
        "E2B timeout is wall-clock (not idle). Explicit per-turn pause is the primary "
        "mechanism; this is the safety net.",
--- a/autogpt_platform/backend/backend/copilot/db.py
+++ b/autogpt_platform/backend/backend/copilot/db.py
@@ -8,19 +8,48 @@ from typing import Any
 from prisma.errors import UniqueViolationError
 from prisma.models import ChatMessage as PrismaChatMessage
 from prisma.models import ChatSession as PrismaChatSession
+from prisma.models import ChatSessionCallbackToken as PrismaChatSessionCallbackToken
 from prisma.types import (
    ChatMessageCreateInput,
    ChatSessionCreateInput,
    ChatSessionUpdateInput,
    ChatSessionWhereInput,
 )
+from pydantic import BaseModel

 from backend.data import db
 from backend.util.json import SafeJson, sanitize_string

 from .model import ChatMessage, ChatSession, ChatSessionInfo
+from .session_types import ChatSessionStartType

 logger = logging.getLogger(__name__)
+_UNSET = object()
+
+
+class ChatSessionCallbackTokenInfo(BaseModel):
+    id: str
+    user_id: str
+    source_session_id: str | None = None
+    callback_session_message: str
+    expires_at: datetime
+    consumed_at: datetime | None = None
+    consumed_session_id: str | None = None
+
+    @classmethod
+    def from_db(
+        cls,
+        token: PrismaChatSessionCallbackToken,
+    ) -> "ChatSessionCallbackTokenInfo":
+        return cls(
+            id=token.id,
+            user_id=token.userId,
+            source_session_id=token.sourceSessionId,
+            callback_session_message=token.callbackSessionMessage,
+            expires_at=token.expiresAt,
+            consumed_at=token.consumedAt,
+            consumed_session_id=token.consumedSessionId,
+        )


 async def get_chat_session(session_id: str) -> ChatSession | None:
@@ -32,9 +61,103 @@ async def get_chat_session(session_id: str) -> ChatSession | None:
    return ChatSession.from_db(session) if session else None


+async def has_recent_manual_message(user_id: str, since: datetime) -> bool:
+    message = await PrismaChatMessage.prisma().find_first(
+        where={
+            "role": "user",
+            "createdAt": {"gte": since},
+            "Session": {
+                "is": {
+                    "userId": user_id,
+                    "startType": ChatSessionStartType.MANUAL.value,
+                }
+            },
+        }
+    )
+    return message is not None
+
+
+async def has_session_since(user_id: str, since: datetime) -> bool:
+    session = await PrismaChatSession.prisma().find_first(
+        where={"userId": user_id, "createdAt": {"gte": since}}
+    )
+    return session is not None
+
+
+async def session_exists_for_execution_tag(user_id: str, execution_tag: str) -> bool:
+    session = await PrismaChatSession.prisma().find_first(
+        where={"userId": user_id, "executionTag": execution_tag}
+    )
+    return session is not None
+
+
+async def get_manual_chat_sessions_since(
+    user_id: str,
+    since_utc: datetime,
+    limit: int,
+) -> list[ChatSessionInfo]:
+    sessions = await PrismaChatSession.prisma().find_many(
+        where={
+            "userId": user_id,
+            "startType": ChatSessionStartType.MANUAL.value,
+            "updatedAt": {"gte": since_utc},
+        },
+        order={"updatedAt": "desc"},
+        take=limit,
+    )
+    return [ChatSessionInfo.from_db(session) for session in sessions]
+
+
+async def get_chat_messages_since(
+    session_id: str,
+    since_utc: datetime,
+) -> list[ChatMessage]:
+    messages = await PrismaChatMessage.prisma().find_many(
+        where={
+            "sessionId": session_id,
+            "createdAt": {"gte": since_utc},
+        },
+        order={"sequence": "asc"},
+    )
+    return [ChatMessage.from_db(message) for message in messages]
+
+
+def _build_chat_message_create_input(
+    *,
+    session_id: str,
+    sequence: int,
+    now: datetime,
+    msg: dict[str, Any],
+) -> ChatMessageCreateInput:
+    data: ChatMessageCreateInput = {
+        "sessionId": session_id,
+        "role": msg["role"],
+        "sequence": sequence,
+        "createdAt": now,
+    }
+
+    if msg.get("content") is not None:
+        data["content"] = sanitize_string(msg["content"])
+    if msg.get("name") is not None:
+        data["name"] = msg["name"]
+    if msg.get("tool_call_id") is not None:
+        data["toolCallId"] = msg["tool_call_id"]
+    if msg.get("refusal") is not None:
+        data["refusal"] = sanitize_string(msg["refusal"])
+    if msg.get("tool_calls") is not None:
+        data["toolCalls"] = SafeJson(msg["tool_calls"])
+    if msg.get("function_call") is not None:
+        data["functionCall"] = SafeJson(msg["function_call"])
+
+    return data
+
+
 async def create_chat_session(
    session_id: str,
    user_id: str,
+    start_type: ChatSessionStartType = ChatSessionStartType.MANUAL,
+    execution_tag: str | None = None,
+    session_config: dict[str, Any] | None = None,
 ) -> ChatSessionInfo:
    """Create a new chat session in the database."""
    data = ChatSessionCreateInput(
@@ -43,6 +166,9 @@ async def create_chat_session(
        credentials=SafeJson({}),
        successfulAgentRuns=SafeJson({}),
        successfulAgentSchedules=SafeJson({}),
+        startType=start_type.value,
+        executionTag=execution_tag,
+        sessionConfig=SafeJson(session_config or {}),
    )
    prisma_session = await PrismaChatSession.prisma().create(data=data)
    return ChatSessionInfo.from_db(prisma_session)
@@ -56,9 +182,19 @@ async def update_chat_session(
    total_prompt_tokens: int | None = None,
    total_completion_tokens: int | None = None,
    title: str | None = None,
+    start_type: ChatSessionStartType | None = None,
+    execution_tag: str | None | object = _UNSET,
+    session_config: dict[str, Any] | None = None,
+    completion_report: dict[str, Any] | None | object = _UNSET,
+    completion_report_repair_count: int | None = None,
+    completion_report_repair_queued_at: datetime | None | object = _UNSET,
+    completed_at: datetime | None | object = _UNSET,
+    notification_email_sent_at: datetime | None | object = _UNSET,
+    notification_email_skipped_at: datetime | None | object = _UNSET,
 ) -> ChatSession | None:
    """Update a chat session's metadata."""
    data: ChatSessionUpdateInput = {"updatedAt": datetime.now(UTC)}
+    should_clear_completion_report = completion_report is None

    if credentials is not None:
        data["credentials"] = SafeJson(credentials)
@@ -72,12 +208,41 @@ async def update_chat_session(
        data["totalCompletionTokens"] = total_completion_tokens
    if title is not None:
        data["title"] = title
+    if start_type is not None:
+        data["startType"] = start_type.value
+    if execution_tag is not _UNSET:
+        data["executionTag"] = execution_tag
+    if session_config is not None:
+        data["sessionConfig"] = SafeJson(session_config)
+    if completion_report is not _UNSET and completion_report is not None:
+        data["completionReport"] = SafeJson(completion_report)
+    if completion_report_repair_count is not None:
+        data["completionReportRepairCount"] = completion_report_repair_count
+    if completion_report_repair_queued_at is not _UNSET:
+        data["completionReportRepairQueuedAt"] = completion_report_repair_queued_at
+    if completed_at is not _UNSET:
+        data["completedAt"] = completed_at
+    if notification_email_sent_at is not _UNSET:
+        data["notificationEmailSentAt"] = notification_email_sent_at
+    if notification_email_skipped_at is not _UNSET:
+        data["notificationEmailSkippedAt"] = notification_email_skipped_at

    session = await PrismaChatSession.prisma().update(
        where={"id": session_id},
        data=data,
        include={"Messages": {"order_by": {"sequence": "asc"}}},
    )
+
+    if should_clear_completion_report:
+        await db.execute_raw_with_schema(
+            'UPDATE {schema_prefix}"ChatSession" SET "completionReport" = NULL WHERE "id" = $1',
+            session_id,
+        )
+        session = await PrismaChatSession.prisma().find_unique(
+            where={"id": session_id},
+            include={"Messages": {"order_by": {"sequence": "asc"}}},
+        )
+
    return ChatSession.from_db(session) if session else None


@@ -187,37 +352,15 @@ async def add_chat_messages_batch(
            now = datetime.now(UTC)

            async with db.transaction() as tx:
-                # Build all message data
-                messages_data = []
-                for i, msg in enumerate(messages):
-                    # Build ChatMessageCreateInput with only non-None values
-                    # (Prisma TypedDict rejects optional fields set to None)
-                    # Note: create_many doesn't support nested creates, use sessionId directly
-                    data: ChatMessageCreateInput = {
-                        "sessionId": session_id,
-                        "role": msg["role"],
-                        "sequence": start_sequence + i,
-                        "createdAt": now,
-                    }
-
-                    # Add optional string fields — sanitize to strip
-                    # PostgreSQL-incompatible control characters.
-                    if msg.get("content") is not None:
-                        data["content"] = sanitize_string(msg["content"])
-                    if msg.get("name") is not None:
-                        data["name"] = msg["name"]
-                    if msg.get("tool_call_id") is not None:
-                        data["toolCallId"] = msg["tool_call_id"]
-                    if msg.get("refusal") is not None:
-                        data["refusal"] = sanitize_string(msg["refusal"])
-
-                    # Add optional JSON fields only when they have values
-                    if msg.get("tool_calls") is not None:
-                        data["toolCalls"] = SafeJson(msg["tool_calls"])
-                    if msg.get("function_call") is not None:
-                        data["functionCall"] = SafeJson(msg["function_call"])
-
-                    messages_data.append(data)
+                messages_data = [
+                    _build_chat_message_create_input(
+                        session_id=session_id,
+                        sequence=start_sequence + i,
+                        now=now,
+                        msg=msg,
+                    )
+                    for i, msg in enumerate(messages)
+                ]

                # Run create_many and session update in parallel within transaction
                # Both use the same timestamp for consistency
@@ -256,10 +399,14 @@ async def get_user_chat_sessions(
    user_id: str,
    limit: int = 50,
    offset: int = 0,
+    with_auto: bool = False,
 ) -> list[ChatSessionInfo]:
    """Get chat sessions for a user, ordered by most recent."""
    prisma_sessions = await PrismaChatSession.prisma().find_many(
-        where={"userId": user_id},
+        where={
+            "userId": user_id,
+            **({} if with_auto else {"startType": ChatSessionStartType.MANUAL.value}),
+        },
        order={"updatedAt": "desc"},
        take=limit,
        skip=offset,
@@ -267,9 +414,88 @@ async def get_user_chat_sessions(
    return [ChatSessionInfo.from_db(s) for s in prisma_sessions]


-async def get_user_session_count(user_id: str) -> int:
+async def get_pending_notification_chat_sessions(
+    limit: int = 200,
+) -> list[ChatSessionInfo]:
+    sessions = await PrismaChatSession.prisma().find_many(
+        where={
+            "startType": {"not": ChatSessionStartType.MANUAL.value},
+            "notificationEmailSentAt": None,
+            "notificationEmailSkippedAt": None,
+        },
+        order={"updatedAt": "asc"},
+        take=limit,
+    )
+    return [ChatSessionInfo.from_db(session) for session in sessions]
+
+
+async def get_pending_notification_chat_sessions_for_user(
+    user_id: str,
+    limit: int = 200,
+) -> list[ChatSessionInfo]:
+    sessions = await PrismaChatSession.prisma().find_many(
+        where={
+            "userId": user_id,
+            "startType": {"not": ChatSessionStartType.MANUAL.value},
+            "notificationEmailSentAt": None,
+            "notificationEmailSkippedAt": None,
+        },
+        order={"updatedAt": "asc"},
+        take=limit,
+    )
+    return [ChatSessionInfo.from_db(session) for session in sessions]
+
+
+async def get_recent_sent_email_chat_sessions(
+    user_id: str,
+    limit: int,
+) -> list[ChatSessionInfo]:
+    sessions = await PrismaChatSession.prisma().find_many(
+        where={
+            "userId": user_id,
+            "startType": {"not": ChatSessionStartType.MANUAL.value},
+            "notificationEmailSentAt": {"not": None},
+        },
+        order={"notificationEmailSentAt": "desc"},
+        take=max(limit * 3, limit),
+    )
+    return [
+        session_info
+        for session_info in (ChatSessionInfo.from_db(session) for session in sessions)
+        if session_info.notification_email_sent_at and session_info.completion_report
+    ][:limit]
+
+
+async def get_recent_completion_report_chat_sessions(
+    user_id: str,
+    limit: int,
+) -> list[ChatSessionInfo]:
+    sessions = await PrismaChatSession.prisma().find_many(
+        where={
+            "userId": user_id,
+            "startType": {"not": ChatSessionStartType.MANUAL.value},
+        },
+        order={"updatedAt": "desc"},
+        take=max(limit * 5, 10),
+    )
+    return [
+        session_info
+        for session_info in (ChatSessionInfo.from_db(session) for session in sessions)
+        if session_info.completion_report is not None
+    ][:limit]
+
+
+async def get_user_session_count(
+    user_id: str,
+    with_auto: bool = False,
+) -> int:
    """Get the total number of chat sessions for a user."""
-    return await PrismaChatSession.prisma().count(where={"userId": user_id})
+    return await PrismaChatSession.prisma().count(
+        where={
+            "userId": user_id,
+            **({} if with_auto else {"startType": ChatSessionStartType.MANUAL.value}),
+        }
+    )


 async def delete_chat_session(session_id: str, user_id: str | None = None) -> bool:
@@ -359,3 +585,42 @@ async def update_tool_message_content(
            f"tool_call_id {tool_call_id}: {e}"
        )
        return False
+
+
+async def create_chat_session_callback_token(
+    user_id: str,
+    source_session_id: str,
+    callback_session_message: str,
+    expires_at: datetime,
+) -> ChatSessionCallbackTokenInfo:
+    token = await PrismaChatSessionCallbackToken.prisma().create(
+        data={
+            "userId": user_id,
+            "sourceSessionId": source_session_id,
+            "callbackSessionMessage": callback_session_message,
+            "expiresAt": expires_at,
+        }
+    )
+    return ChatSessionCallbackTokenInfo.from_db(token)
+
+
+async def get_chat_session_callback_token(
+    token_id: str,
+) -> ChatSessionCallbackTokenInfo | None:
+    token = await PrismaChatSessionCallbackToken.prisma().find_unique(
+        where={"id": token_id}
+    )
+    return ChatSessionCallbackTokenInfo.from_db(token) if token else None
+
+
+async def mark_chat_session_callback_token_consumed(
+    token_id: str,
+    consumed_session_id: str,
+) -> None:
+    await PrismaChatSessionCallbackToken.prisma().update(
+        where={"id": token_id},
+        data={
+            "consumedAt": datetime.now(UTC),
+            "consumedSessionId": consumed_session_id,
+        },
+    )
--- a/autogpt_platform/backend/backend/copilot/model.py
+++ b/autogpt_platform/backend/backend/copilot/model.py
@@ -21,7 +21,7 @@ from openai.types.chat.chat_completion_message_tool_call_param import (
 )
 from prisma.models import ChatMessage as PrismaChatMessage
 from prisma.models import ChatSession as PrismaChatSession
-from pydantic import BaseModel
+from pydantic import BaseModel, Field

 from backend.data.db_accessors import chat_db
 from backend.data.redis_client import get_redis_async
@@ -29,6 +29,11 @@ from backend.util import json
 from backend.util.exceptions import DatabaseError, RedisError

 from .config import ChatConfig
+from .session_types import (
+    ChatSessionConfig,
+    ChatSessionStartType,
+    StoredCompletionReport,
+)

 logger = logging.getLogger(__name__)
 config = ChatConfig()
@@ -80,11 +85,20 @@ class ChatSessionInfo(BaseModel):
    user_id: str
    title: str | None = None
    usage: list[Usage]
-    credentials: dict[str, dict] = {}  # Map of provider -> credential metadata
+    credentials: dict[str, dict] = Field(default_factory=dict)
    started_at: datetime
    updated_at: datetime
-    successful_agent_runs: dict[str, int] = {}
-    successful_agent_schedules: dict[str, int] = {}
+    successful_agent_runs: dict[str, int] = Field(default_factory=dict)
+    successful_agent_schedules: dict[str, int] = Field(default_factory=dict)
+    start_type: ChatSessionStartType = ChatSessionStartType.MANUAL
+    execution_tag: str | None = None
+    session_config: ChatSessionConfig = Field(default_factory=ChatSessionConfig)
+    completion_report: StoredCompletionReport | None = None
+    completion_report_repair_count: int = 0
+    completion_report_repair_queued_at: datetime | None = None
+    completed_at: datetime | None = None
+    notification_email_sent_at: datetime | None = None
+    notification_email_skipped_at: datetime | None = None

    @classmethod
    def from_db(cls, prisma_session: PrismaChatSession) -> Self:
@@ -97,6 +111,8 @@ class ChatSessionInfo(BaseModel):
        successful_agent_schedules = _parse_json_field(
            prisma_session.successfulAgentSchedules, default={}
        )
+        session_config = _parse_json_field(prisma_session.sessionConfig, default={})
+        completion_report = _parse_json_field(prisma_session.completionReport)

        # Calculate usage from token counts
        usage = []
@@ -110,6 +126,20 @@ class ChatSessionInfo(BaseModel):
                )
            )

+        parsed_session_config = ChatSessionConfig.model_validate(session_config or {})
+        parsed_completion_report = None
+        if isinstance(completion_report, dict):
+            try:
+                parsed_completion_report = StoredCompletionReport.model_validate(
+                    completion_report
+                )
+            except Exception:
+                logger.warning(
+                    "Invalid completionReport payload on session %s",
+                    prisma_session.id,
+                    exc_info=True,
+                )
+
        return cls(
            session_id=prisma_session.id,
            user_id=prisma_session.userId,
@@ -120,6 +150,15 @@ class ChatSessionInfo(BaseModel):
            updated_at=prisma_session.updatedAt,
            successful_agent_runs=successful_agent_runs,
            successful_agent_schedules=successful_agent_schedules,
+            start_type=ChatSessionStartType(str(prisma_session.startType)),
+            execution_tag=prisma_session.executionTag,
+            session_config=parsed_session_config,
+            completion_report=parsed_completion_report,
+            completion_report_repair_count=prisma_session.completionReportRepairCount,
+            completion_report_repair_queued_at=prisma_session.completionReportRepairQueuedAt,
+            completed_at=prisma_session.completedAt,
+            notification_email_sent_at=prisma_session.notificationEmailSentAt,
+            notification_email_skipped_at=prisma_session.notificationEmailSkippedAt,
        )


@@ -127,7 +166,13 @@ class ChatSession(ChatSessionInfo):
    messages: list[ChatMessage]

    @classmethod
-    def new(cls, user_id: str) -> Self:
+    def new(
+        cls,
+        user_id: str,
+        start_type: ChatSessionStartType = ChatSessionStartType.MANUAL,
+        execution_tag: str | None = None,
+        session_config: ChatSessionConfig | None = None,
+    ) -> Self:
        return cls(
            session_id=str(uuid.uuid4()),
            user_id=user_id,
@@ -137,6 +182,9 @@ class ChatSession(ChatSessionInfo):
            credentials={},
            started_at=datetime.now(UTC),
            updated_at=datetime.now(UTC),
+            start_type=start_type,
+            execution_tag=execution_tag,
+            session_config=session_config or ChatSessionConfig(),
        )

    @classmethod
@@ -152,6 +200,16 @@ class ChatSession(ChatSessionInfo):
            messages=[ChatMessage.from_db(m) for m in prisma_session.Messages],
        )

+    @property
+    def is_manual(self) -> bool:
+        return self.start_type == ChatSessionStartType.MANUAL
+
+    def allows_tool(self, tool_name: str) -> bool:
+        return self.session_config.allows_tool(tool_name)
+
+    def disables_tool(self, tool_name: str) -> bool:
+        return self.session_config.disables_tool(tool_name)
+
    def add_tool_call_to_current_turn(self, tool_call: dict) -> None:
        """Attach a tool_call to the current turn's assistant message.

@@ -524,6 +582,9 @@ async def _save_session_to_db(
            await db.create_chat_session(
                session_id=session.session_id,
                user_id=session.user_id,
+                start_type=session.start_type,
+                execution_tag=session.execution_tag,
+                session_config=session.session_config.model_dump(mode="json"),
            )
            existing_message_count = 0

@@ -539,6 +600,19 @@ async def _save_session_to_db(
        successful_agent_schedules=session.successful_agent_schedules,
        total_prompt_tokens=total_prompt,
        total_completion_tokens=total_completion,
+        start_type=session.start_type,
+        execution_tag=session.execution_tag,
+        session_config=session.session_config.model_dump(mode="json"),
+        completion_report=(
+            session.completion_report.model_dump(mode="json")
+            if session.completion_report
+            else None
+        ),
+        completion_report_repair_count=session.completion_report_repair_count,
+        completion_report_repair_queued_at=session.completion_report_repair_queued_at,
+        completed_at=session.completed_at,
+        notification_email_sent_at=session.notification_email_sent_at,
+        notification_email_skipped_at=session.notification_email_skipped_at,
    )

    # Add new messages (only those after existing count)
@@ -601,7 +675,13 @@ async def append_and_save_message(session_id: str, message: ChatMessage) -> Chat
        return session


-async def create_chat_session(user_id: str) -> ChatSession:
+async def create_chat_session(
+    user_id: str,
+    start_type: ChatSessionStartType = ChatSessionStartType.MANUAL,
+    execution_tag: str | None = None,
+    session_config: ChatSessionConfig | None = None,
+    initial_messages: list[ChatMessage] | None = None,
+) -> ChatSession:
    """Create a new chat session and persist it.

    Raises:
@@ -609,14 +689,30 @@ async def create_chat_session(user_id: str) -> ChatSession:
            callers never receive a non-persisted session that only exists
            in cache (which would be lost when the cache expires).
    """
-    session = ChatSession.new(user_id)
+    session = ChatSession.new(
+        user_id,
+        start_type=start_type,
+        execution_tag=execution_tag,
+        session_config=session_config,
+    )
+    if initial_messages:
+        session.messages.extend(initial_messages)

    # Create in database first - fail fast if this fails
    try:
        await chat_db().create_chat_session(
            session_id=session.session_id,
            user_id=user_id,
+            start_type=session.start_type,
+            execution_tag=session.execution_tag,
+            session_config=session.session_config.model_dump(mode="json"),
        )
+        if session.messages:
+            await _save_session_to_db(
+                session,
+                0,
+                skip_existence_check=True,
+            )
    except Exception as e:
        logger.error(f"Failed to create session {session.session_id} in database: {e}")
        raise DatabaseError(
@@ -636,6 +732,7 @@ async def get_user_sessions(
    user_id: str,
    limit: int = 50,
    offset: int = 0,
+    with_auto: bool = False,
 ) -> tuple[list[ChatSessionInfo], int]:
    """Get chat sessions for a user from the database with total count.

@@ -644,8 +741,16 @@ async def get_user_sessions(
        number of sessions for the user (not just the current page).
    """
    db = chat_db()
-    sessions = await db.get_user_chat_sessions(user_id, limit, offset)
-    total_count = await db.get_user_session_count(user_id)
+    sessions = await db.get_user_chat_sessions(
+        user_id,
+        limit,
+        offset,
+        with_auto=with_auto,
+    )
+    total_count = await db.get_user_session_count(
+        user_id,
+        with_auto=with_auto,
+    )

    return sessions, total_count

--- a/autogpt_platform/backend/backend/copilot/model_test.py
+++ b/autogpt_platform/backend/backend/copilot/model_test.py
@@ -19,6 +19,7 @@ from .model import (
    get_chat_session,
    upsert_chat_session,
 )
+from .session_types import ChatSessionConfig, ChatSessionStartType

 messages = [
    ChatMessage(content="Hello, how are you?", role="user"),
@@ -46,7 +47,15 @@ messages = [

@pytest.mark.asyncio(loop_scope="session")
 async def test_chatsession_serialization_deserialization():
-    s = ChatSession.new(user_id="abc123")
+    s = ChatSession.new(
+        user_id="abc123",
+        start_type=ChatSessionStartType.AUTOPILOT_NIGHTLY,
+        execution_tag="autopilot-nightly:2026-03-13",
+        session_config=ChatSessionConfig(
+            extra_tools=["completion_report"],
+            disabled_tools=["edit_agent"],
+        ),
+    )
    s.messages = messages
    s.usage = [Usage(prompt_tokens=100, completion_tokens=200, total_tokens=300)]
    serialized = s.model_dump_json()
--- a/autogpt_platform/backend/backend/copilot/prompting.py
+++ b/autogpt_platform/backend/backend/copilot/prompting.py
@@ -6,7 +6,7 @@ handling the distinction between:
 - Local mode vs E2B mode (storage/filesystem differences)
 """

-from backend.copilot.tools import TOOL_REGISTRY
+from backend.copilot.tools import iter_available_tools

 # Shared technical notes that apply to both SDK and baseline modes
 _SHARED_TOOL_NOTES = """\
@@ -52,6 +52,11 @@ Examples:
 You can embed a reference inside any string argument, or use it as the entire
 value.  Multiple references in one argument are all expanded.

+**Type coercion**: The platform automatically coerces expanded string values
+to match the block's expected input types.  For example, if a block expects
+`list[list[str]]` and you pass a string containing a JSON array (e.g. from
+an @@agptfile: expansion), the string will be parsed into the correct type.
+

 ### Sub-agent tasks
 - When using the Task tool, NEVER set `run_in_background` to true.
@@ -156,7 +161,7 @@ def _get_cloud_sandbox_supplement() -> str:
    )


-def _generate_tool_documentation() -> str:
+def _generate_tool_documentation(session=None) -> str:
    """Auto-generate tool documentation from TOOL_REGISTRY.

    NOTE: This is ONLY used in baseline mode (direct OpenAI API).
@@ -172,11 +177,7 @@ def _generate_tool_documentation() -> str:
    docs = "\n## AVAILABLE TOOLS\n\n"

    # Sort tools alphabetically for consistent output
-    # Filter by is_available to match get_available_tools() behavior
-    for name in sorted(TOOL_REGISTRY.keys()):
-        tool = TOOL_REGISTRY[name]
-        if not tool.is_available:
-            continue
+    for name, tool in sorted(iter_available_tools(session), key=lambda item: item[0]):
        schema = tool.as_openai_tool()
        desc = schema["function"].get("description", "No description available")
        # Format as bullet list with tool name in code style
@@ -204,7 +205,7 @@ def get_sdk_supplement(use_e2b: bool, cwd: str = "") -> str:
    return _get_local_storage_supplement(cwd)


-def get_baseline_supplement() -> str:
+def get_baseline_supplement(session=None) -> str:
    """Get the supplement for baseline mode (direct OpenAI API).

    Baseline mode INCLUDES auto-generated tool documentation because the
@@ -214,5 +215,5 @@ def get_baseline_supplement() -> str:
    Returns:
        The supplement string to append to the system prompt
    """
-    tool_docs = _generate_tool_documentation()
+    tool_docs = _generate_tool_documentation(session)
    return tool_docs + _SHARED_TOOL_NOTES
--- a/autogpt_platform/backend/backend/copilot/sdk/compaction.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/compaction.py
@@ -11,7 +11,7 @@ persistence, and the ``CompactionTracker`` state machine.
 import asyncio
 import logging
 import uuid
-from collections.abc import Callable
+from dataclasses import dataclass, field

 from ..constants import COMPACTION_DONE_MSG, COMPACTION_TOOL_NAME
 from ..model import ChatMessage, ChatSession
@@ -27,6 +27,19 @@ from ..response_model import (
 logger = logging.getLogger(__name__)


+@dataclass
+class CompactionResult:
+    """Result of emit_end_if_ready — bundles events with compaction metadata.
+
+    Eliminates the need for separate ``compaction_just_ended`` checks,
+    preventing TOCTOU races between the emit call and the flag read.
+    """
+
+    events: list[StreamBaseResponse] = field(default_factory=list)
+    just_ended: bool = False
+    transcript_path: str = ""
+
+
 # ---------------------------------------------------------------------------
 # Event builders (private — use CompactionTracker or compaction_events)
 # ---------------------------------------------------------------------------
@@ -177,11 +190,22 @@ class CompactionTracker:
        self._start_emitted = False
        self._done = False
        self._tool_call_id = ""
+        self._transcript_path: str = ""

-    @property
-    def on_compact(self) -> Callable[[], None]:
-        """Callback for the PreCompact hook."""
-        return self._compact_start.set
+    def on_compact(self, transcript_path: str = "") -> None:
+        """Callback for the PreCompact hook. Stores transcript_path."""
+        if (
+            self._transcript_path
+            and transcript_path
+            and self._transcript_path != transcript_path
+        ):
+            logger.warning(
+                "[Compaction] Overwriting transcript_path %s -> %s",
+                self._transcript_path,
+                transcript_path,
+            )
+        self._transcript_path = transcript_path
+        self._compact_start.set()

    # ------------------------------------------------------------------
    # Pre-query compaction
@@ -201,6 +225,7 @@ class CompactionTracker:
        self._done = False
        self._start_emitted = False
        self._tool_call_id = ""
+        self._transcript_path = ""

    def emit_start_if_ready(self) -> list[StreamBaseResponse]:
        """If the PreCompact hook fired, emit start events (spinning tool)."""
@@ -211,15 +236,20 @@ class CompactionTracker:
            return _start_events(self._tool_call_id)
        return []

-    async def emit_end_if_ready(self, session: ChatSession) -> list[StreamBaseResponse]:
-        """If compaction is in progress, emit end events and persist."""
+    async def emit_end_if_ready(self, session: ChatSession) -> CompactionResult:
+        """If compaction is in progress, emit end events and persist.
+
+        Returns a ``CompactionResult`` with ``just_ended=True`` and the
+        captured ``transcript_path`` when a compaction cycle completes.
+        This avoids a separate flag check (TOCTOU-safe).
+        """
        # Yield so pending hook tasks can set compact_start
        await asyncio.sleep(0)

        if self._done:
-            return []
+            return CompactionResult()
        if not self._start_emitted and not self._compact_start.is_set():
-            return []
+            return CompactionResult()

        if self._start_emitted:
            # Close the open spinner
@@ -232,8 +262,12 @@ class CompactionTracker:
                COMPACTION_DONE_MSG, tool_call_id=persist_id
            )

+        transcript_path = self._transcript_path
        self._compact_start.clear()
        self._start_emitted = False
        self._done = True
+        self._transcript_path = ""
        _persist(session, persist_id, COMPACTION_DONE_MSG)
-        return done_events
+        return CompactionResult(
+            events=done_events, just_ended=True, transcript_path=transcript_path
+        )
--- a/autogpt_platform/backend/backend/copilot/sdk/compaction_test.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/compaction_test.py
@@ -195,10 +195,11 @@ class TestCompactionTracker:
        session = _make_session()
        tracker.on_compact()
        tracker.emit_start_if_ready()
-        evts = await tracker.emit_end_if_ready(session)
-        assert len(evts) == 2
-        assert isinstance(evts[0], StreamToolOutputAvailable)
-        assert isinstance(evts[1], StreamFinishStep)
+        result = await tracker.emit_end_if_ready(session)
+        assert result.just_ended is True
+        assert len(result.events) == 2
+        assert isinstance(result.events[0], StreamToolOutputAvailable)
+        assert isinstance(result.events[1], StreamFinishStep)
        # Should persist
        assert len(session.messages) == 2

@@ -210,28 +211,32 @@ class TestCompactionTracker:
        session = _make_session()
        tracker.on_compact()
        # Don't call emit_start_if_ready
-        evts = await tracker.emit_end_if_ready(session)
-        assert len(evts) == 5  # Full self-contained event
-        assert isinstance(evts[0], StreamStartStep)
+        result = await tracker.emit_end_if_ready(session)
+        assert result.just_ended is True
+        assert len(result.events) == 5  # Full self-contained event
+        assert isinstance(result.events[0], StreamStartStep)
        assert len(session.messages) == 2

    @pytest.mark.asyncio
-    async def test_emit_end_no_op_when_done(self):
+    async def test_emit_end_no_op_when_no_new_compaction(self):
        tracker = CompactionTracker()
        session = _make_session()
        tracker.on_compact()
        tracker.emit_start_if_ready()
-        await tracker.emit_end_if_ready(session)
-        # Second call should be no-op
-        evts = await tracker.emit_end_if_ready(session)
-        assert evts == []
+        result1 = await tracker.emit_end_if_ready(session)
+        assert result1.just_ended is True
+        # Second call should be no-op (no new on_compact)
+        result2 = await tracker.emit_end_if_ready(session)
+        assert result2.just_ended is False
+        assert result2.events == []

    @pytest.mark.asyncio
    async def test_emit_end_no_op_when_nothing_happened(self):
        tracker = CompactionTracker()
        session = _make_session()
-        evts = await tracker.emit_end_if_ready(session)
-        assert evts == []
+        result = await tracker.emit_end_if_ready(session)
+        assert result.just_ended is False
+        assert result.events == []

    def test_emit_pre_query(self):
        tracker = CompactionTracker()
@@ -246,20 +251,29 @@ class TestCompactionTracker:
        tracker._done = True
        tracker._start_emitted = True
        tracker._tool_call_id = "old"
+        tracker._transcript_path = "/some/path"
        tracker.reset_for_query()
        assert tracker._done is False
        assert tracker._start_emitted is False
        assert tracker._tool_call_id == ""
+        assert tracker._transcript_path == ""

    @pytest.mark.asyncio
-    async def test_pre_query_blocks_sdk_compaction(self):
-        """After pre-query compaction, SDK compaction events are suppressed."""
+    async def test_pre_query_blocks_sdk_compaction_until_reset(self):
+        """After pre-query compaction, SDK compaction is blocked until
+        reset_for_query is called."""
        tracker = CompactionTracker()
        session = _make_session()
        tracker.emit_pre_query(session)
        tracker.on_compact()
+        # _done is True so emit_start_if_ready is blocked
        evts = tracker.emit_start_if_ready()
-        assert evts == []  # _done blocks it
+        assert evts == []
+        # Reset clears _done, allowing subsequent compaction
+        tracker.reset_for_query()
+        tracker.on_compact()
+        evts = tracker.emit_start_if_ready()
+        assert len(evts) == 3

    @pytest.mark.asyncio
    async def test_reset_allows_new_compaction(self):
@@ -279,9 +293,9 @@ class TestCompactionTracker:
        session = _make_session()
        tracker.on_compact()
        start_evts = tracker.emit_start_if_ready()
-        end_evts = await tracker.emit_end_if_ready(session)
+        result = await tracker.emit_end_if_ready(session)
        start_evt = start_evts[1]
-        end_evt = end_evts[0]
+        end_evt = result.events[0]
        assert isinstance(start_evt, StreamToolInputStart)
        assert isinstance(end_evt, StreamToolOutputAvailable)
        assert start_evt.toolCallId == end_evt.toolCallId
@@ -289,3 +303,105 @@ class TestCompactionTracker:
        tool_calls = session.messages[0].tool_calls
        assert tool_calls is not None
        assert tool_calls[0]["id"] == start_evt.toolCallId
+
+    @pytest.mark.asyncio
+    async def test_multiple_compactions_within_query(self):
+        """Two mid-stream compactions within a single query both trigger."""
+        tracker = CompactionTracker()
+        session = _make_session()
+
+        # First compaction cycle
+        tracker.on_compact("/path/1")
+        tracker.emit_start_if_ready()
+        result1 = await tracker.emit_end_if_ready(session)
+        assert result1.just_ended is True
+        assert len(result1.events) == 2
+        assert result1.transcript_path == "/path/1"
+
+        # Second compaction cycle (should NOT be blocked — _done resets
+        # because emit_end_if_ready sets it True, but the next on_compact
+        # + emit_start_if_ready checks !_done which IS True now.
+        # So we need reset_for_query between queries, but within a single
+        # query multiple compactions work because _done blocks emit_start
+        # until the next message arrives, at which point emit_end detects it)
+        #
+        # Actually: _done=True blocks emit_start_if_ready, so we need
+        # the stream loop to reset. In practice service.py doesn't call
+        # reset between compactions within the same query — let's verify
+        # the actual behavior.
+        tracker.on_compact("/path/2")
+        # _done is True from first compaction, so start is blocked
+        start_evts = tracker.emit_start_if_ready()
+        assert start_evts == []
+        # But emit_end returns no-op because _done is True
+        result2 = await tracker.emit_end_if_ready(session)
+        assert result2.just_ended is False
+
+    @pytest.mark.asyncio
+    async def test_multiple_compactions_with_intervening_message(self):
+        """Multiple compactions work when the stream loop processes messages between them.
+
+        In the real service.py flow:
+        1. PreCompact fires → on_compact()
+        2. emit_start shows spinner
+        3. Next message arrives → emit_end completes compaction (_done=True)
+        4. Stream continues processing messages...
+        5. If a second PreCompact fires, _done=True blocks emit_start
+        6. But the next message triggers emit_end, which sees _done=True → no-op
+        7. The stream loop needs to detect this and handle accordingly
+
+        The actual flow for multiple compactions within a query requires
+        _done to be cleared between them. The service.py code uses
+        CompactionResult.just_ended to trigger replace_entries, and _done
+        stays True until reset_for_query.
+        """
+        tracker = CompactionTracker()
+        session = _make_session()
+
+        # First compaction
+        tracker.on_compact("/path/1")
+        tracker.emit_start_if_ready()
+        result1 = await tracker.emit_end_if_ready(session)
+        assert result1.just_ended is True
+        assert result1.transcript_path == "/path/1"
+
+        # Simulate reset between queries
+        tracker.reset_for_query()
+
+        # Second compaction in new query
+        tracker.on_compact("/path/2")
+        start_evts = tracker.emit_start_if_ready()
+        assert len(start_evts) == 3
+        result2 = await tracker.emit_end_if_ready(session)
+        assert result2.just_ended is True
+        assert result2.transcript_path == "/path/2"
+
+    def test_on_compact_stores_transcript_path(self):
+        tracker = CompactionTracker()
+        tracker.on_compact("/some/path.jsonl")
+        assert tracker._transcript_path == "/some/path.jsonl"
+
+    @pytest.mark.asyncio
+    async def test_emit_end_returns_transcript_path(self):
+        """CompactionResult includes the transcript_path from on_compact."""
+        tracker = CompactionTracker()
+        session = _make_session()
+        tracker.on_compact("/my/session.jsonl")
+        tracker.emit_start_if_ready()
+        result = await tracker.emit_end_if_ready(session)
+        assert result.just_ended is True
+        assert result.transcript_path == "/my/session.jsonl"
+        # transcript_path is cleared after emit_end
+        assert tracker._transcript_path == ""
+
+    @pytest.mark.asyncio
+    async def test_emit_end_clears_transcript_path(self):
+        """After emit_end, _transcript_path is reset so it doesn't leak to
+        subsequent non-compaction emit_end calls."""
+        tracker = CompactionTracker()
+        session = _make_session()
+        tracker.on_compact("/first/path.jsonl")
+        tracker.emit_start_if_ready()
+        await tracker.emit_end_if_ready(session)
+        # After compaction, _transcript_path is cleared
+        assert tracker._transcript_path == ""
--- a/autogpt_platform/backend/backend/copilot/sdk/e2e_compaction_test.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/e2e_compaction_test.py
@@ -0,0 +1,531 @@
+"""End-to-end compaction flow test.
+
+Simulates the full service.py compaction lifecycle using real-format
+JSONL session files — no SDK subprocess needed. Exercises:
+
+  1. TranscriptBuilder loads a "downloaded" transcript
+  2. User query appended, assistant response streamed
+  3. PreCompact hook fires → CompactionTracker.on_compact()
+  4. Next message → emit_start_if_ready() yields spinner events
+  5. Message after that → emit_end_if_ready() returns CompactionResult
+  6. read_compacted_entries() reads the CLI session file
+  7. TranscriptBuilder.replace_entries() syncs state
+  8. More messages appended post-compaction
+  9. to_jsonl() exports full state for upload
+  10. Fresh builder loads the export — roundtrip verified
+"""
+
+import asyncio
+
+from backend.copilot.model import ChatSession
+from backend.copilot.response_model import (
+    StreamFinishStep,
+    StreamStartStep,
+    StreamToolInputAvailable,
+    StreamToolInputStart,
+    StreamToolOutputAvailable,
+)
+from backend.copilot.sdk.compaction import CompactionTracker
+from backend.copilot.sdk.transcript import (
+    read_compacted_entries,
+    strip_progress_entries,
+)
+from backend.copilot.sdk.transcript_builder import TranscriptBuilder
+from backend.util import json
+
+
+def _make_jsonl(*entries: dict) -> str:
+    return "\n".join(json.dumps(e) for e in entries) + "\n"
+
+
+def _run(coro):
+    """Run an async coroutine synchronously."""
+    return asyncio.run(coro)
+
+
+# ---------------------------------------------------------------------------
+# Fixtures: realistic CLI session file content
+# ---------------------------------------------------------------------------
+
+# Pre-compaction conversation
+USER_1 = {
+    "type": "user",
+    "uuid": "u1",
+    "message": {"role": "user", "content": "What files are in this project?"},
+}
+ASST_1_THINKING = {
+    "type": "assistant",
+    "uuid": "a1-think",
+    "parentUuid": "u1",
+    "message": {
+        "role": "assistant",
+        "id": "msg_sdk_aaa",
+        "type": "message",
+        "content": [{"type": "thinking", "thinking": "Let me look at the files..."}],
+        "stop_reason": None,
+        "stop_sequence": None,
+    },
+}
+ASST_1_TOOL = {
+    "type": "assistant",
+    "uuid": "a1-tool",
+    "parentUuid": "u1",
+    "message": {
+        "role": "assistant",
+        "id": "msg_sdk_aaa",
+        "type": "message",
+        "content": [
+            {
+                "type": "tool_use",
+                "id": "tu1",
+                "name": "Bash",
+                "input": {"command": "ls"},
+            }
+        ],
+        "stop_reason": "tool_use",
+        "stop_sequence": None,
+    },
+}
+TOOL_RESULT_1 = {
+    "type": "user",
+    "uuid": "tr1",
+    "parentUuid": "a1-tool",
+    "message": {
+        "role": "user",
+        "content": [
+            {
+                "type": "tool_result",
+                "tool_use_id": "tu1",
+                "content": "file1.py\nfile2.py",
+            }
+        ],
+    },
+}
+ASST_1_TEXT = {
+    "type": "assistant",
+    "uuid": "a1-text",
+    "parentUuid": "tr1",
+    "message": {
+        "role": "assistant",
+        "id": "msg_sdk_bbb",
+        "type": "message",
+        "content": [{"type": "text", "text": "I found file1.py and file2.py."}],
+        "stop_reason": "end_turn",
+        "stop_sequence": None,
+    },
+}
+# Progress entries (should be stripped during upload)
+PROGRESS_1 = {
+    "type": "progress",
+    "uuid": "prog1",
+    "parentUuid": "a1-tool",
+    "data": {"type": "bash_progress", "stdout": "running ls..."},
+}
+# Second user message
+USER_2 = {
+    "type": "user",
+    "uuid": "u2",
+    "parentUuid": "a1-text",
+    "message": {"role": "user", "content": "Show me file1.py"},
+}
+ASST_2 = {
+    "type": "assistant",
+    "uuid": "a2",
+    "parentUuid": "u2",
+    "message": {
+        "role": "assistant",
+        "id": "msg_sdk_ccc",
+        "type": "message",
+        "content": [{"type": "text", "text": "Here is file1.py content..."}],
+        "stop_reason": "end_turn",
+        "stop_sequence": None,
+    },
+}
+
+# --- Compaction summary (written by CLI after context compaction) ---
+COMPACT_SUMMARY = {
+    "type": "summary",
+    "uuid": "cs1",
+    "isCompactSummary": True,
+    "message": {
+        "role": "user",
+        "content": (
+            "Summary: User asked about project files. Found file1.py and file2.py. "
+            "User then asked to see file1.py."
+        ),
+    },
+}
+
+# Post-compaction assistant response
+POST_COMPACT_ASST = {
+    "type": "assistant",
+    "uuid": "a3",
+    "parentUuid": "cs1",
+    "message": {
+        "role": "assistant",
+        "id": "msg_sdk_ddd",
+        "type": "message",
+        "content": [{"type": "text", "text": "Here is the content of file1.py..."}],
+        "stop_reason": "end_turn",
+        "stop_sequence": None,
+    },
+}
+
+# Post-compaction user follow-up
+USER_3 = {
+    "type": "user",
+    "uuid": "u3",
+    "parentUuid": "a3",
+    "message": {"role": "user", "content": "Now show file2.py"},
+}
+ASST_3 = {
+    "type": "assistant",
+    "uuid": "a4",
+    "parentUuid": "u3",
+    "message": {
+        "role": "assistant",
+        "id": "msg_sdk_eee",
+        "type": "message",
+        "content": [{"type": "text", "text": "Here is file2.py..."}],
+        "stop_reason": "end_turn",
+        "stop_sequence": None,
+    },
+}
+
+
+# ---------------------------------------------------------------------------
+# E2E test
+# ---------------------------------------------------------------------------
+
+
+class TestCompactionE2E:
+    def _write_session_file(self, session_dir, entries):
+        """Write a CLI session JSONL file."""
+        path = session_dir / "session.jsonl"
+        path.write_text(_make_jsonl(*entries))
+        return path
+
+    def test_full_compaction_lifecycle(self, tmp_path, monkeypatch):
+        """Simulate the complete service.py compaction flow.
+
+        Timeline:
+        1. Previous turn uploaded transcript with [USER_1, ASST_1, USER_2, ASST_2]
+        2. Current turn: download → load_previous
+        3. User sends "Now show file2.py" → append_user
+        4. SDK starts streaming response
+        5. Mid-stream: PreCompact hook fires (context too large)
+        6. CLI writes compaction summary to session file
+        7. Next SDK message → emit_start (spinner)
+        8. Following message → emit_end (CompactionResult)
+        9. read_compacted_entries reads the session file
+        10. replace_entries syncs TranscriptBuilder
+        11. More assistant messages appended
+        12. Export → upload → next turn downloads it
+        """
+        # --- Setup CLI projects directory ---
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        # --- Step 1-2: Load "downloaded" transcript from previous turn ---
+        previous_transcript = _make_jsonl(
+            USER_1,
+            ASST_1_THINKING,
+            ASST_1_TOOL,
+            TOOL_RESULT_1,
+            ASST_1_TEXT,
+            USER_2,
+            ASST_2,
+        )
+        builder = TranscriptBuilder()
+        builder.load_previous(previous_transcript)
+        assert builder.entry_count == 7
+
+        # --- Step 3: User sends new query ---
+        builder.append_user("Now show file2.py")
+        assert builder.entry_count == 8
+
+        # --- Step 4: SDK starts streaming ---
+        builder.append_assistant(
+            [{"type": "thinking", "thinking": "Let me read file2.py..."}],
+            model="claude-sonnet-4-20250514",
+        )
+        assert builder.entry_count == 9
+
+        # --- Step 5-6: PreCompact fires, CLI writes session file ---
+        session_file = self._write_session_file(
+            session_dir,
+            [
+                USER_1,
+                ASST_1_THINKING,
+                ASST_1_TOOL,
+                PROGRESS_1,
+                TOOL_RESULT_1,
+                ASST_1_TEXT,
+                USER_2,
+                ASST_2,
+                COMPACT_SUMMARY,
+                POST_COMPACT_ASST,
+                USER_3,
+                ASST_3,
+            ],
+        )
+
+        # --- Step 7: CompactionTracker receives PreCompact hook ---
+        tracker = CompactionTracker()
+        session = ChatSession.new(user_id="test-user")
+        tracker.on_compact(str(session_file))
+
+        # --- Step 8: Next SDK message arrives → emit_start ---
+        start_events = tracker.emit_start_if_ready()
+        assert len(start_events) == 3
+        assert isinstance(start_events[0], StreamStartStep)
+        assert isinstance(start_events[1], StreamToolInputStart)
+        assert isinstance(start_events[2], StreamToolInputAvailable)
+
+        # Verify tool_call_id is set
+        tool_call_id = start_events[1].toolCallId
+        assert tool_call_id.startswith("compaction-")
+
+        # --- Step 9: Following message → emit_end ---
+        result = _run(tracker.emit_end_if_ready(session))
+        assert result.just_ended is True
+        assert result.transcript_path == str(session_file)
+        assert len(result.events) == 2
+        assert isinstance(result.events[0], StreamToolOutputAvailable)
+        assert isinstance(result.events[1], StreamFinishStep)
+        # Verify same tool_call_id
+        assert result.events[0].toolCallId == tool_call_id
+
+        # Session should have compaction messages persisted
+        assert len(session.messages) == 2
+        assert session.messages[0].role == "assistant"
+        assert session.messages[1].role == "tool"
+
+        # --- Step 10: read_compacted_entries + replace_entries ---
+        compacted = read_compacted_entries(str(session_file))
+        assert compacted is not None
+        # Should have: COMPACT_SUMMARY + POST_COMPACT_ASST + USER_3 + ASST_3
+        assert len(compacted) == 4
+        assert compacted[0]["uuid"] == "cs1"
+        assert compacted[0]["isCompactSummary"] is True
+
+        # Replace builder state with compacted entries
+        old_count = builder.entry_count
+        builder.replace_entries(compacted)
+        assert builder.entry_count == 4  # Only compacted entries
+        assert builder.entry_count < old_count  # Compaction reduced entries
+
+        # --- Step 11: More assistant messages after compaction ---
+        builder.append_assistant(
+            [{"type": "text", "text": "Here is file2.py:\n\ndef hello():\n    pass"}],
+            model="claude-sonnet-4-20250514",
+            stop_reason="end_turn",
+        )
+        assert builder.entry_count == 5
+
+        # --- Step 12: Export for upload ---
+        output = builder.to_jsonl()
+        assert output  # Not empty
+        output_entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert len(output_entries) == 5
+
+        # Verify structure:
+        # [COMPACT_SUMMARY, POST_COMPACT_ASST, USER_3, ASST_3, new_assistant]
+        assert output_entries[0]["type"] == "summary"
+        assert output_entries[0].get("isCompactSummary") is True
+        assert output_entries[0]["uuid"] == "cs1"
+        assert output_entries[1]["uuid"] == "a3"
+        assert output_entries[2]["uuid"] == "u3"
+        assert output_entries[3]["uuid"] == "a4"
+        assert output_entries[4]["type"] == "assistant"
+
+        # Verify parent chain is intact
+        assert output_entries[1]["parentUuid"] == "cs1"  # a3 → cs1
+        assert output_entries[2]["parentUuid"] == "a3"  # u3 → a3
+        assert output_entries[3]["parentUuid"] == "u3"  # a4 → u3
+        assert output_entries[4]["parentUuid"] == "a4"  # new → a4
+
+        # --- Step 13: Roundtrip — next turn loads this export ---
+        builder2 = TranscriptBuilder()
+        builder2.load_previous(output)
+        assert builder2.entry_count == 5
+
+        # isCompactSummary survives roundtrip
+        output2 = builder2.to_jsonl()
+        first_entry = json.loads(output2.strip().split("\n")[0])
+        assert first_entry.get("isCompactSummary") is True
+
+        # Can append more messages
+        builder2.append_user("What about file3.py?")
+        assert builder2.entry_count == 6
+        final_output = builder2.to_jsonl()
+        last_entry = json.loads(final_output.strip().split("\n")[-1])
+        assert last_entry["type"] == "user"
+        # Parented to the last entry from previous turn
+        assert last_entry["parentUuid"] == output_entries[-1]["uuid"]
+
+    def test_double_compaction_within_session(self, tmp_path, monkeypatch):
+        """Two compactions in the same session (across reset_for_query)."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        tracker = CompactionTracker()
+        session = ChatSession.new(user_id="test")
+        builder = TranscriptBuilder()
+
+        # --- First query with compaction ---
+        builder.append_user("first question")
+        builder.append_assistant([{"type": "text", "text": "first answer"}])
+
+        # Write session file for first compaction
+        first_summary = {
+            "type": "summary",
+            "uuid": "cs-first",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "First compaction summary"},
+        }
+        first_post = {
+            "type": "assistant",
+            "uuid": "a-first",
+            "parentUuid": "cs-first",
+            "message": {"role": "assistant", "content": "first post-compact"},
+        }
+        file1 = session_dir / "session1.jsonl"
+        file1.write_text(_make_jsonl(first_summary, first_post))
+
+        tracker.on_compact(str(file1))
+        tracker.emit_start_if_ready()
+        result1 = _run(tracker.emit_end_if_ready(session))
+        assert result1.just_ended is True
+
+        compacted1 = read_compacted_entries(str(file1))
+        assert compacted1 is not None
+        builder.replace_entries(compacted1)
+        assert builder.entry_count == 2
+
+        # --- Reset for second query ---
+        tracker.reset_for_query()
+
+        # --- Second query with compaction ---
+        builder.append_user("second question")
+        builder.append_assistant([{"type": "text", "text": "second answer"}])
+
+        second_summary = {
+            "type": "summary",
+            "uuid": "cs-second",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "Second compaction summary"},
+        }
+        second_post = {
+            "type": "assistant",
+            "uuid": "a-second",
+            "parentUuid": "cs-second",
+            "message": {"role": "assistant", "content": "second post-compact"},
+        }
+        file2 = session_dir / "session2.jsonl"
+        file2.write_text(_make_jsonl(second_summary, second_post))
+
+        tracker.on_compact(str(file2))
+        tracker.emit_start_if_ready()
+        result2 = _run(tracker.emit_end_if_ready(session))
+        assert result2.just_ended is True
+
+        compacted2 = read_compacted_entries(str(file2))
+        assert compacted2 is not None
+        builder.replace_entries(compacted2)
+        assert builder.entry_count == 2  # Only second compaction entries
+
+        # Export and verify
+        output = builder.to_jsonl()
+        entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert entries[0]["uuid"] == "cs-second"
+        assert entries[0].get("isCompactSummary") is True
+
+    def test_strip_progress_then_load_then_compact_roundtrip(
+        self, tmp_path, monkeypatch
+    ):
+        """Full pipeline: strip → load → compact → replace → export → reload.
+
+        This tests the exact sequence that happens across two turns:
+        Turn 1: SDK produces transcript with progress entries
+        Upload: strip_progress_entries removes progress, upload to cloud
+        Turn 2: Download → load_previous → compaction fires → replace → export
+        Turn 3: Download the Turn 2 export → load_previous (roundtrip)
+        """
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        # --- Turn 1: SDK produces raw transcript ---
+        raw_content = _make_jsonl(
+            USER_1,
+            ASST_1_THINKING,
+            ASST_1_TOOL,
+            PROGRESS_1,
+            TOOL_RESULT_1,
+            ASST_1_TEXT,
+            USER_2,
+            ASST_2,
+        )
+
+        # Strip progress for upload
+        stripped = strip_progress_entries(raw_content)
+        stripped_entries = [
+            json.loads(line) for line in stripped.strip().split("\n") if line.strip()
+        ]
+        # Progress should be gone
+        assert not any(e.get("type") == "progress" for e in stripped_entries)
+        assert len(stripped_entries) == 7  # 8 - 1 progress
+
+        # --- Turn 2: Download stripped, load, compaction happens ---
+        builder = TranscriptBuilder()
+        builder.load_previous(stripped)
+        assert builder.entry_count == 7
+
+        builder.append_user("Now show file2.py")
+        builder.append_assistant(
+            [{"type": "text", "text": "Reading file2.py..."}],
+            model="claude-sonnet-4-20250514",
+        )
+
+        # CLI writes session file with compaction
+        session_file = self._write_session_file(
+            session_dir,
+            [
+                USER_1,
+                ASST_1_TOOL,
+                TOOL_RESULT_1,
+                ASST_1_TEXT,
+                USER_2,
+                ASST_2,
+                COMPACT_SUMMARY,
+                POST_COMPACT_ASST,
+            ],
+        )
+
+        compacted = read_compacted_entries(str(session_file))
+        assert compacted is not None
+        builder.replace_entries(compacted)
+
+        # Append post-compaction message
+        builder.append_user("Thanks!")
+        output = builder.to_jsonl()
+
+        # --- Turn 3: Fresh load of Turn 2 export ---
+        builder3 = TranscriptBuilder()
+        builder3.load_previous(output)
+        # Should have: compact_summary + post_compact_asst + "Thanks!"
+        assert builder3.entry_count == 3
+
+        # Compact summary survived the full pipeline
+        first = json.loads(builder3.to_jsonl().strip().split("\n")[0])
+        assert first.get("isCompactSummary") is True
+        assert first["type"] == "summary"
--- a/autogpt_platform/backend/backend/copilot/sdk/mcp_tool_guide.md
+++ b/autogpt_platform/backend/backend/copilot/sdk/mcp_tool_guide.md
@@ -20,7 +20,24 @@ Use these URLs directly without asking the user:
 | Cloudflare | `https://mcp.cloudflare.com/mcp` |
 | Atlassian / Jira | `https://mcp.atlassian.com/mcp` |

-For other services, search the MCP registry at https://registry.modelcontextprotocol.io/.
+For other services, search the MCP registry API:
+```http
+GET https://registry.modelcontextprotocol.io/v0/servers?q=<search_term>
+```
+Each result includes a `remotes` array with the exact server URL to use.
+
+### Important: Check blocks first
+
+Before using `run_mcp_tool`, always check if the platform already has blocks for the service
+using `find_block`. The platform has hundreds of built-in blocks (Google Sheets, Google Docs,
+Google Calendar, Gmail, etc.) that work without MCP setup.
+
+Only use `run_mcp_tool` when:
+- The service is in the known hosted MCP servers list above, OR
+- You searched `find_block` first and found no matching blocks
+
+**Never guess or construct MCP server URLs.** Only use URLs from the known servers list above
+or from the `remotes[].url` field in MCP registry search results.

 ### Authentication

--- a/autogpt_platform/backend/backend/copilot/sdk/security_hooks.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/security_hooks.py
@@ -127,7 +127,7 @@ def create_security_hooks(
    user_id: str | None,
    sdk_cwd: str | None = None,
    max_subtasks: int = 3,
-    on_compact: Callable[[], None] | None = None,
+    on_compact: Callable[[str], None] | None = None,
 ) -> dict[str, Any]:
    """Create the security hooks configuration for Claude Agent SDK.

@@ -142,6 +142,7 @@ def create_security_hooks(
        sdk_cwd: SDK working directory for workspace-scoped tool validation
        max_subtasks: Maximum concurrent Task (sub-agent) spawns allowed per session
        on_compact: Callback invoked when SDK starts compacting context.
+            Receives the transcript_path from the hook input.

    Returns:
        Hooks configuration dict for ClaudeAgentOptions
@@ -301,11 +302,21 @@ def create_security_hooks(
            """
            _ = context, tool_use_id
            trigger = input_data.get("trigger", "auto")
+            # Sanitize untrusted input before logging to prevent log injection
+            transcript_path = (
+                str(input_data.get("transcript_path", ""))
+                .replace("\n", "")
+                .replace("\r", "")
+            )
            logger.info(
-                f"[SDK] Context compaction triggered: {trigger}, user={user_id}"
+                "[SDK] Context compaction triggered: %s, user=%s, "
+                "transcript_path=%s",
+                trigger,
+                user_id,
+                transcript_path,
            )
            if on_compact is not None:
-                on_compact()
+                on_compact(transcript_path)
            return cast(SyncHookJSONOutput, {})

        hooks: dict[str, Any] = {
--- a/autogpt_platform/backend/backend/copilot/sdk/service.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/service.py
@@ -12,7 +12,7 @@ import subprocess
 import sys
 import uuid
 from collections.abc import AsyncGenerator
-from typing import Any, cast
+from typing import Any, Protocol, cast

 import openai
 from claude_agent_sdk import (
@@ -56,9 +56,9 @@ from ..response_model import (
    StreamToolOutputAvailable,
 )
 from ..service import (
-    _build_system_prompt,
    _generate_session_title,
    _is_langfuse_configured,
+    _resolve_system_prompt,
 )
 from ..tools.e2b_sandbox import get_or_create_sandbox, pause_sandbox_direct
 from ..tools.sandbox import WORKSPACE_PREFIX, make_session_path
@@ -77,6 +77,7 @@ from .tool_adapter import (
 from .transcript import (
    cleanup_cli_project_dir,
    download_transcript,
+    read_compacted_entries,
    upload_transcript,
    validate_transcript,
    write_transcript_to_tempfile,
@@ -87,6 +88,10 @@ logger = logging.getLogger(__name__)
 config = ChatConfig()


+class _ClaudeSDKTransport(Protocol):
+    async def write(self, data: str) -> None: ...
+
+
 def _setup_langfuse_otel() -> None:
    """Configure OTEL tracing for the Claude Agent SDK → Langfuse.

@@ -136,6 +141,16 @@ def _setup_langfuse_otel() -> None:
 _setup_langfuse_otel()


+async def _write_multimodal_query(
+    client: ClaudeSDKClient,
+    user_message: dict[str, Any],
+) -> None:
+    transport = cast(_ClaudeSDKTransport | None, getattr(client, "_transport", None))
+    if transport is None:
+        raise RuntimeError("Claude SDK transport is unavailable for multimodal input")
+    await transport.write(json.dumps(user_message) + "\n")
+
+
 # Set to hold background tasks to prevent garbage collection
 _background_tasks: set[asyncio.Task[Any]] = set()

@@ -689,7 +704,7 @@ async def stream_chat_completion_sdk(
    session = await upsert_chat_session(session)

    # Generate title for new sessions (first user message)
-    if is_user_message and not session.title:
+    if is_user_message and session.is_manual and not session.title:
        user_messages = [m for m in session.messages if m.role == "user"]
        if len(user_messages) == 1:
            first_message = user_messages[0].content or message or ""
@@ -804,7 +819,11 @@ async def stream_chat_completion_sdk(

        e2b_sandbox, (base_system_prompt, _), dl = await asyncio.gather(
            _setup_e2b(),
-            _build_system_prompt(user_id, has_conversation_history=has_history),
+            _resolve_system_prompt(
+                session,
+                user_id,
+                has_conversation_history=has_history,
+            ),
            _fetch_transcript(),
        )

@@ -861,7 +880,7 @@ async def stream_chat_completion_sdk(
                "Claude Code CLI subscription (requires `claude login`)."
            )

-        mcp_server = create_copilot_mcp_server(use_e2b=use_e2b)
+        mcp_server = create_copilot_mcp_server(session, use_e2b=use_e2b)

        sdk_model = _resolve_sdk_model()

@@ -875,7 +894,7 @@ async def stream_chat_completion_sdk(
            on_compact=compaction.on_compact,
        )

-        allowed = get_copilot_tool_names(use_e2b=use_e2b)
+        allowed = get_copilot_tool_names(session, use_e2b=use_e2b)
        disallowed = get_sdk_disallowed_tools(use_e2b=use_e2b)

        def _on_stderr(line: str) -> None:
@@ -976,10 +995,7 @@ async def stream_chat_completion_sdk(
                    "parent_tool_use_id": None,
                    "session_id": session_id,
                }
-                assert client._transport is not None  # noqa: SLF001
-                await client._transport.write(  # noqa: SLF001
-                    json.dumps(user_msg) + "\n"
-                )
+                await _write_multimodal_query(client, user_msg)
                # Capture user message in transcript (multimodal)
                transcript_builder.append_user(content=content_blocks)
            else:
@@ -1045,6 +1061,7 @@ async def stream_chat_completion_sdk(
                            exc_info=True,
                        )
                        ended_with_stream_error = True
+
                        yield StreamError(
                            errorText=f"SDK stream error: {stream_err}",
                            code="sdk_stream_error",
@@ -1129,9 +1146,26 @@ async def stream_chat_completion_sdk(
                                sdk_msg.result or "(no error message provided)",
                            )

-                    # Emit compaction end if SDK finished compacting
-                    for ev in await compaction.emit_end_if_ready(session):
+                    # Emit compaction end if SDK finished compacting.
+                    # When compaction ends, sync TranscriptBuilder with the
+                    # CLI's active context so they stay identical.
+                    compact_result = await compaction.emit_end_if_ready(session)
+                    for ev in compact_result.events:
                        yield ev
+                    # After replace_entries, skip append_assistant for this
+                    # sdk_msg — the CLI session file already contains it,
+                    # so appending again would create a duplicate.
+                    entries_replaced = False
+                    if compact_result.just_ended:
+                        compacted = await asyncio.to_thread(
+                            read_compacted_entries,
+                            compact_result.transcript_path,
+                        )
+                        if compacted is not None:
+                            transcript_builder.replace_entries(
+                                compacted, log_prefix=log_prefix
+                            )
+                            entries_replaced = True

                    for response in adapter.convert_message(sdk_msg):
                        if isinstance(response, StreamStart):
@@ -1218,10 +1252,11 @@ async def stream_chat_completion_sdk(
                                    tool_call_id=response.toolCallId,
                                )
                            )
-                            transcript_builder.append_tool_result(
-                                tool_use_id=response.toolCallId,
-                                content=content,
-                            )
+                            if not entries_replaced:
+                                transcript_builder.append_tool_result(
+                                    tool_use_id=response.toolCallId,
+                                    content=content,
+                                )
                            has_tool_results = True

                        elif isinstance(response, StreamFinish):
@@ -1231,7 +1266,9 @@ async def stream_chat_completion_sdk(
                    # any stashed tool results from the previous turn are
                    # recorded first, preserving the required API order:
                    # assistant(tool_use) → tool_result → assistant(text).
-                    if isinstance(sdk_msg, AssistantMessage):
+                    # Skip if replace_entries just ran — the CLI session
+                    # file already contains this message.
+                    if isinstance(sdk_msg, AssistantMessage) and not entries_replaced:
                        transcript_builder.append_assistant(
                            content_blocks=_format_sdk_content_blocks(sdk_msg.content),
                            model=sdk_msg.model,
@@ -1422,13 +1459,13 @@ async def stream_chat_completion_sdk(
            task.add_done_callback(_background_tasks.discard)

        # --- Upload transcript for next-turn --resume ---
-        # This MUST run in finally so the transcript is uploaded even when
-        # the streaming loop raises an exception.
-        # The transcript represents the COMPLETE active context (atomic).
+        # TranscriptBuilder is the single source of truth.  It mirrors the
+        # CLI's active context: on compaction, replace_entries() syncs it
+        # with the compacted session file.  No CLI file read needed here.
        if config.claude_agent_use_resume and user_id and session is not None:
            try:
-                # Build complete transcript from captured SDK messages
                transcript_content = transcript_builder.to_jsonl()
+                entry_count = transcript_builder.entry_count

                if not transcript_content:
                    logger.warning(
@@ -1438,18 +1475,15 @@ async def stream_chat_completion_sdk(
                    logger.warning(
                        "%s Transcript invalid, skipping upload (entries=%d)",
                        log_prefix,
-                        transcript_builder.entry_count,
+                        entry_count,
                    )
                else:
                    logger.info(
-                        "%s Uploading complete transcript (entries=%d, bytes=%d)",
+                        "%s Uploading transcript (entries=%d, bytes=%d)",
                        log_prefix,
-                        transcript_builder.entry_count,
+                        entry_count,
                        len(transcript_content),
                    )
-                    # Shield upload from cancellation - let it complete even if
-                    # the finally block is interrupted. No timeout to avoid race
-                    # conditions where backgrounded uploads overwrite newer transcripts.
                    await asyncio.shield(
                        upload_transcript(
                            user_id=user_id,
--- a/autogpt_platform/backend/backend/copilot/sdk/service_test.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/service_test.py
@@ -205,6 +205,29 @@ class TestPromptSupplement:
        ):
            assert "`browser_navigate`" in docs

+    def test_baseline_supplement_respects_session_disabled_tools(self):
+        """Session-specific docs should hide disabled tools and include added session tools."""
+        from backend.copilot.model import ChatSession
+        from backend.copilot.prompting import get_baseline_supplement
+        from backend.copilot.session_types import (
+            ChatSessionConfig,
+            ChatSessionStartType,
+        )
+
+        session = ChatSession.new(
+            "user-1",
+            start_type=ChatSessionStartType.AUTOPILOT_NIGHTLY,
+            session_config=ChatSessionConfig(
+                extra_tools=["completion_report"],
+                disabled_tools=["edit_agent"],
+            ),
+        )
+
+        docs = get_baseline_supplement(session)
+
+        assert "`completion_report`" in docs
+        assert "`edit_agent`" not in docs
+
    def test_baseline_supplement_includes_workflows(self):
        """Baseline supplement should include workflow guidance in tool descriptions."""
        from backend.copilot.prompting import get_baseline_supplement
@@ -219,15 +242,13 @@ class TestPromptSupplement:
    def test_baseline_supplement_completeness(self):
        """All available tools from TOOL_REGISTRY should appear in baseline supplement."""
        from backend.copilot.prompting import get_baseline_supplement
-        from backend.copilot.tools import TOOL_REGISTRY
+        from backend.copilot.tools import iter_available_tools

        docs = get_baseline_supplement()

        # Verify each available registered tool is documented
-        # (matches _generate_tool_documentation which filters by is_available)
-        for tool_name, tool in TOOL_REGISTRY.items():
-            if not tool.is_available:
-                continue
+        # (matches _generate_tool_documentation which filters with iter_available_tools)
+        for tool_name, _ in iter_available_tools():
            assert (
                f"`{tool_name}`" in docs
            ), f"Tool '{tool_name}' missing from baseline supplement"
@@ -277,14 +298,12 @@ class TestPromptSupplement:
    def test_baseline_supplement_no_duplicate_tools(self):
        """No tool should appear multiple times in baseline supplement."""
        from backend.copilot.prompting import get_baseline_supplement
-        from backend.copilot.tools import TOOL_REGISTRY
+        from backend.copilot.tools import iter_available_tools

        docs = get_baseline_supplement()

        # Count occurrences of each available tool in the entire supplement
-        for tool_name, tool in TOOL_REGISTRY.items():
-            if not tool.is_available:
-                continue
+        for tool_name, _ in iter_available_tools():
            # Count how many times this tool appears as a bullet point
            count = docs.count(f"- **`{tool_name}`**")
            assert count == 1, f"Tool '{tool_name}' appears {count} times (should be 1)"
--- a/autogpt_platform/backend/backend/copilot/sdk/tool_adapter.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/tool_adapter.py
@@ -32,7 +32,7 @@ from backend.copilot.sdk.file_ref import (
    expand_file_refs_in_args,
    read_file_bytes,
 )
-from backend.copilot.tools import TOOL_REGISTRY
+from backend.copilot.tools import iter_available_tools
 from backend.copilot.tools.base import BaseTool
 from backend.util.truncate import truncate

@@ -338,7 +338,11 @@ def _text_from_mcp_result(result: dict[str, Any]) -> str:
    )


-def create_copilot_mcp_server(*, use_e2b: bool = False):
+def create_copilot_mcp_server(
+    session: ChatSession,
+    *,
+    use_e2b: bool = False,
+):
    """Create an in-process MCP server configuration for CoPilot tools.

    When *use_e2b* is True, five additional MCP file tools are registered
@@ -387,7 +391,7 @@ def create_copilot_mcp_server(*, use_e2b: bool = False):

    sdk_tools = []

-    for tool_name, base_tool in TOOL_REGISTRY.items():
+    for tool_name, base_tool in iter_available_tools(session):
        handler = create_tool_handler(base_tool)
        decorated = tool(
            tool_name,
@@ -475,25 +479,30 @@ DANGEROUS_PATTERNS = [
    r"subprocess",
 ]

-# Static tool name list for the non-E2B case (backward compatibility).
-COPILOT_TOOL_NAMES = [
-    *[f"{MCP_TOOL_PREFIX}{name}" for name in TOOL_REGISTRY.keys()],
-    f"{MCP_TOOL_PREFIX}{_READ_TOOL_NAME}",
-    *_SDK_BUILTIN_TOOLS,
-]

-
-def get_copilot_tool_names(*, use_e2b: bool = False) -> list[str]:
+def get_copilot_tool_names(
+    session: ChatSession,
+    *,
+    use_e2b: bool = False,
+) -> list[str]:
    """Build the ``allowed_tools`` list for :class:`ClaudeAgentOptions`.

    When *use_e2b* is True the SDK built-in file tools are replaced by MCP
    equivalents that route to the E2B sandbox.
    """
+    tool_names = [
+        f"{MCP_TOOL_PREFIX}{name}" for name, _ in iter_available_tools(session)
+    ]
+
    if not use_e2b:
-        return list(COPILOT_TOOL_NAMES)
+        return [
+            *tool_names,
+            f"{MCP_TOOL_PREFIX}{_READ_TOOL_NAME}",
+            *_SDK_BUILTIN_TOOLS,
+        ]

    return [
-        *[f"{MCP_TOOL_PREFIX}{name}" for name in TOOL_REGISTRY.keys()],
+        *tool_names,
        f"{MCP_TOOL_PREFIX}{_READ_TOOL_NAME}",
        *[f"{MCP_TOOL_PREFIX}{name}" for name in E2B_FILE_TOOL_NAMES],
        *_SDK_BUILTIN_ALWAYS,
--- a/autogpt_platform/backend/backend/copilot/sdk/tool_adapter_test.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/tool_adapter_test.py
@@ -3,11 +3,14 @@
 import pytest

 from backend.copilot.context import get_sdk_cwd
+from backend.copilot.model import ChatSession
+from backend.copilot.session_types import ChatSessionConfig, ChatSessionStartType
 from backend.util.truncate import truncate

 from .tool_adapter import (
    _MCP_MAX_CHARS,
    _text_from_mcp_result,
+    get_copilot_tool_names,
    pop_pending_tool_output,
    set_execution_context,
    stash_pending_tool_output,
@@ -168,3 +171,20 @@ class TestTruncationAndStashIntegration:
        text = _text_from_mcp_result(truncated)
        assert len(text) < len(big_text)
        assert len(str(truncated)) <= _MCP_MAX_CHARS
+
+
+class TestSessionToolFiltering:
+    def test_disabled_tools_are_removed_from_sdk_allowed_tools(self):
+        session = ChatSession.new(
+            "user-1",
+            start_type=ChatSessionStartType.AUTOPILOT_NIGHTLY,
+            session_config=ChatSessionConfig(
+                extra_tools=["completion_report"],
+                disabled_tools=["edit_agent"],
+            ),
+        )
+
+        tool_names = get_copilot_tool_names(session)
+
+        assert "mcp__copilot__completion_report" in tool_names
+        assert "mcp__copilot__edit_agent" not in tool_names
--- a/autogpt_platform/backend/backend/copilot/sdk/transcript.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/transcript.py
@@ -13,8 +13,10 @@ filesystem for self-hosted) — no DB column needed.
 import logging
 import os
 import re
+import shutil
 import time
 from dataclasses import dataclass
+from pathlib import Path

 from backend.util import json

@@ -82,7 +84,11 @@ def strip_progress_entries(content: str) -> str:
        parent = entry.get("parentUuid", "")
        if uid:
            uuid_to_parent[uid] = parent
-        if entry.get("type", "") in STRIPPABLE_TYPES and uid:
+        if (
+            entry.get("type", "") in STRIPPABLE_TYPES
+            and uid
+            and not entry.get("isCompactSummary")
+        ):
            stripped_uuids.add(uid)

    # Second pass: keep non-stripped entries, reparenting where needed.
@@ -106,7 +112,9 @@ def strip_progress_entries(content: str) -> str:
        if not isinstance(entry, dict):
            result_lines.append(line)
            continue
-        if entry.get("type", "") in STRIPPABLE_TYPES:
+        if entry.get("type", "") in STRIPPABLE_TYPES and not entry.get(
+            "isCompactSummary"
+        ):
            continue
        uid = entry.get("uuid", "")
        if uid in reparented:
@@ -137,6 +145,155 @@ def _sanitize_id(raw_id: str, max_len: int = 36) -> str:
 _SAFE_CWD_PREFIX = os.path.realpath("/tmp/copilot-")


+def _projects_base() -> str:
+    """Return the resolved path to the CLI's projects directory."""
+    config_dir = os.environ.get("CLAUDE_CONFIG_DIR") or os.path.expanduser("~/.claude")
+    return os.path.realpath(os.path.join(config_dir, "projects"))
+
+
+def _cli_project_dir(sdk_cwd: str) -> str | None:
+    """Return the CLI's project directory for a given working directory.
+
+    Returns ``None`` if the path would escape the projects base.
+    """
+    cwd_encoded = re.sub(r"[^a-zA-Z0-9]", "-", os.path.realpath(sdk_cwd))
+    projects_base = _projects_base()
+    project_dir = os.path.realpath(os.path.join(projects_base, cwd_encoded))
+
+    if not project_dir.startswith(projects_base + os.sep):
+        logger.warning(
+            "[Transcript] Project dir escaped projects base: %s", project_dir
+        )
+        return None
+    return project_dir
+
+
+def _safe_glob_jsonl(project_dir: str) -> list[Path]:
+    """Glob ``*.jsonl`` files, filtering out symlinks that escape the directory."""
+    try:
+        resolved_base = Path(project_dir).resolve()
+    except OSError as e:
+        logger.warning("[Transcript] Failed to resolve project dir: %s", e)
+        return []
+
+    result: list[Path] = []
+    for candidate in Path(project_dir).glob("*.jsonl"):
+        try:
+            resolved = candidate.resolve()
+            if resolved.is_relative_to(resolved_base):
+                result.append(resolved)
+        except (OSError, RuntimeError) as e:
+            logger.debug(
+                "[Transcript] Skipping invalid CLI session candidate %s: %s",
+                candidate,
+                e,
+            )
+    return result
+
+
+def read_compacted_entries(transcript_path: str) -> list[dict] | None:
+    """Read compacted entries from the CLI session file after compaction.
+
+    Parses the JSONL file line-by-line, finds the ``isCompactSummary: true``
+    entry, and returns it plus all entries after it.
+
+    The CLI writes the compaction summary BEFORE sending the next message,
+    so the file is guaranteed to be flushed by the time we read it.
+
+    Returns a list of parsed dicts, or ``None`` if the file cannot be read
+    or no compaction summary is found.
+    """
+    if not transcript_path:
+        return None
+
+    projects_base = _projects_base()
+    real_path = os.path.realpath(transcript_path)
+    if not real_path.startswith(projects_base + os.sep):
+        logger.warning(
+            "[Transcript] transcript_path outside projects base: %s", transcript_path
+        )
+        return None
+
+    try:
+        content = Path(real_path).read_text()
+    except OSError as e:
+        logger.warning(
+            "[Transcript] Failed to read session file %s: %s", transcript_path, e
+        )
+        return None
+
+    lines = content.strip().split("\n")
+    compact_idx: int | None = None
+
+    for idx, line in enumerate(lines):
+        if not line.strip():
+            continue
+        entry = json.loads(line, fallback=None)
+        if not isinstance(entry, dict):
+            continue
+        if entry.get("isCompactSummary"):
+            compact_idx = idx  # don't break — find the LAST summary
+
+    if compact_idx is None:
+        logger.debug("[Transcript] No compaction summary found in %s", transcript_path)
+        return None
+
+    entries: list[dict] = []
+    for line in lines[compact_idx:]:
+        if not line.strip():
+            continue
+        entry = json.loads(line, fallback=None)
+        if isinstance(entry, dict):
+            entries.append(entry)
+
+    logger.info(
+        "[Transcript] Read %d compacted entries from %s (summary at line %d)",
+        len(entries),
+        transcript_path,
+        compact_idx + 1,
+    )
+    return entries
+
+
+def read_cli_session_file(sdk_cwd: str) -> str | None:
+    """Read the CLI's own session file, which reflects any compaction.
+
+    The CLI writes its session transcript to
+    ``~/.claude/projects/<encoded_cwd>/<session_id>.jsonl``.
+    Since each SDK turn uses a unique ``sdk_cwd``, there should be
+    exactly one ``.jsonl`` file in that directory.
+
+    Returns the file content, or ``None`` if not found.
+    """
+    project_dir = _cli_project_dir(sdk_cwd)
+    if not project_dir or not os.path.isdir(project_dir):
+        return None
+
+    jsonl_files = _safe_glob_jsonl(project_dir)
+    if not jsonl_files:
+        logger.debug("[Transcript] No CLI session file found in %s", project_dir)
+        return None
+
+    # Pick the most recently modified file (should be only one per turn).
+    try:
+        session_file = max(jsonl_files, key=lambda p: p.stat().st_mtime)
+    except OSError as e:
+        logger.warning("[Transcript] Failed to inspect CLI session files: %s", e)
+        return None
+
+    try:
+        content = session_file.read_text()
+        logger.info(
+            "[Transcript] Read CLI session file: %s (%d bytes)",
+            session_file,
+            len(content),
+        )
+        return content
+    except OSError as e:
+        logger.warning("[Transcript] Failed to read CLI session file: %s", e)
+        return None
+
+
 def cleanup_cli_project_dir(sdk_cwd: str) -> None:
    """Remove the CLI's project directory for a specific working directory.

@@ -144,25 +301,15 @@ def cleanup_cli_project_dir(sdk_cwd: str) -> None:
    Each SDK turn uses a unique ``sdk_cwd``, so the project directory is
    safe to remove entirely after the transcript has been uploaded.
    """
-    import shutil
-
-    # Encode cwd the same way CLI does (replaces non-alphanumeric with -)
-    cwd_encoded = re.sub(r"[^a-zA-Z0-9]", "-", os.path.realpath(sdk_cwd))
-    config_dir = os.environ.get("CLAUDE_CONFIG_DIR") or os.path.expanduser("~/.claude")
-    projects_base = os.path.realpath(os.path.join(config_dir, "projects"))
-    project_dir = os.path.realpath(os.path.join(projects_base, cwd_encoded))
-
-    if not project_dir.startswith(projects_base + os.sep):
-        logger.warning(
-            f"[Transcript] Cleanup path escaped projects base: {project_dir}"
-        )
+    project_dir = _cli_project_dir(sdk_cwd)
+    if not project_dir:
        return

    if os.path.isdir(project_dir):
        shutil.rmtree(project_dir, ignore_errors=True)
-        logger.debug(f"[Transcript] Cleaned up CLI project dir: {project_dir}")
+        logger.debug("[Transcript] Cleaned up CLI project dir: %s", project_dir)
    else:
-        logger.debug(f"[Transcript] Project dir not found: {project_dir}")
+        logger.debug("[Transcript] Project dir not found: %s", project_dir)


 def write_transcript_to_tempfile(
@@ -259,24 +406,27 @@ def _meta_storage_path_parts(user_id: str, session_id: str) -> tuple[str, str, s
    )


-def _build_storage_path(user_id: str, session_id: str, backend: object) -> str:
-    """Build the full storage path string that ``retrieve()`` expects.
-
-    ``store()`` returns a path like ``gcs://bucket/workspaces/...`` or
-    ``local://workspace_id/file_id/filename``.  Since we use deterministic
-    arguments we can reconstruct the same path for download/delete without
-    having stored the return value.
-    """
+def _build_path_from_parts(parts: tuple[str, str, str], backend: object) -> str:
+    """Build a full storage path from (workspace_id, file_id, filename) parts."""
    from backend.util.workspace_storage import GCSWorkspaceStorage

-    wid, fid, fname = _storage_path_parts(user_id, session_id)
-
+    wid, fid, fname = parts
    if isinstance(backend, GCSWorkspaceStorage):
        blob = f"workspaces/{wid}/{fid}/{fname}"
        return f"gcs://{backend.bucket_name}/{blob}"
-    else:
-        # LocalWorkspaceStorage returns local://{relative_path}
-        return f"local://{wid}/{fid}/{fname}"
+    return f"local://{wid}/{fid}/{fname}"
+
+
+def _build_storage_path(user_id: str, session_id: str, backend: object) -> str:
+    """Build the full storage path string that ``retrieve()`` expects."""
+    return _build_path_from_parts(_storage_path_parts(user_id, session_id), backend)
+
+
+def _build_meta_storage_path(user_id: str, session_id: str, backend: object) -> str:
+    """Build the full storage path for the companion .meta.json file."""
+    return _build_path_from_parts(
+        _meta_storage_path_parts(user_id, session_id), backend
+    )


 async def upload_transcript(
@@ -381,15 +531,7 @@ async def download_transcript(
    message_count = 0
    uploaded_at = 0.0
    try:
-        from backend.util.workspace_storage import GCSWorkspaceStorage
-
-        mwid, mfid, mfname = _meta_storage_path_parts(user_id, session_id)
-        if isinstance(storage, GCSWorkspaceStorage):
-            blob = f"workspaces/{mwid}/{mfid}/{mfname}"
-            meta_path = f"gcs://{storage.bucket_name}/{blob}"
-        else:
-            meta_path = f"local://{mwid}/{mfid}/{mfname}"
-
+        meta_path = _build_meta_storage_path(user_id, session_id, storage)
        meta_data = await storage.retrieve(meta_path)
        meta = json.loads(meta_data.decode("utf-8"), fallback={})
        message_count = meta.get("message_count", 0)
@@ -406,7 +548,11 @@ async def download_transcript(


 async def delete_transcript(user_id: str, session_id: str) -> None:
-    """Delete transcript from bucket storage (e.g. after resume failure)."""
+    """Delete transcript and its metadata from bucket storage.
+
+    Removes both the ``.jsonl`` transcript and the companion ``.meta.json``
+    so stale ``message_count`` watermarks cannot corrupt gap-fill logic.
+    """
    from backend.util.workspace_storage import get_workspace_storage

    storage = await get_workspace_storage()
@@ -414,6 +560,14 @@ async def delete_transcript(user_id: str, session_id: str) -> None:

    try:
        await storage.delete(path)
-        logger.info(f"[Transcript] Deleted transcript for session {session_id}")
+        logger.info("[Transcript] Deleted transcript for session %s", session_id)
    except Exception as e:
-        logger.warning(f"[Transcript] Failed to delete transcript: {e}")
+        logger.warning("[Transcript] Failed to delete transcript: %s", e)
+
+    # Also delete the companion .meta.json to avoid orphaned metadata.
+    try:
+        meta_path = _build_meta_storage_path(user_id, session_id, storage)
+        await storage.delete(meta_path)
+        logger.info("[Transcript] Deleted metadata for session %s", session_id)
+    except Exception as e:
+        logger.warning("[Transcript] Failed to delete metadata: %s", e)
--- a/autogpt_platform/backend/backend/copilot/sdk/transcript_builder.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/transcript_builder.py
@@ -30,6 +30,7 @@ class TranscriptEntry(BaseModel):
    type: str
    uuid: str
    parentUuid: str | None
+    isCompactSummary: bool | None = None
    message: dict[str, Any]


@@ -53,6 +54,24 @@ class TranscriptBuilder:
            return self._entries[-1].message.get("id", "")
        return ""

+    @staticmethod
+    def _parse_entry(data: dict) -> TranscriptEntry | None:
+        """Parse a single transcript entry, filtering strippable types.
+
+        Returns ``None`` for entries that should be skipped (strippable types
+        that are not compaction summaries).
+        """
+        entry_type = data.get("type", "")
+        if entry_type in STRIPPABLE_TYPES and not data.get("isCompactSummary"):
+            return None
+        return TranscriptEntry(
+            type=entry_type,
+            uuid=data.get("uuid") or str(uuid4()),
+            parentUuid=data.get("parentUuid"),
+            isCompactSummary=data.get("isCompactSummary") or None,
+            message=data.get("message", {}),
+        )
+
    def load_previous(self, content: str, log_prefix: str = "[Transcript]") -> None:
        """Load complete previous transcript.

@@ -78,18 +97,9 @@ class TranscriptBuilder:
                )
                continue

-            # Load all non-strippable entries (user/assistant/system/etc.)
-            # Skip only STRIPPABLE_TYPES to match strip_progress_entries() behavior
-            entry_type = data.get("type", "")
-            if entry_type in STRIPPABLE_TYPES:
+            entry = self._parse_entry(data)
+            if entry is None:
                continue
-
-            entry = TranscriptEntry(
-                type=data["type"],
-                uuid=data.get("uuid") or str(uuid4()),
-                parentUuid=data.get("parentUuid"),
-                message=data.get("message", {}),
-            )
            self._entries.append(entry)
            self._last_uuid = entry.uuid

@@ -162,6 +172,43 @@ class TranscriptBuilder:
        )
        self._last_uuid = msg_uuid

+    def replace_entries(
+        self, compacted_entries: list[dict], log_prefix: str = "[Transcript]"
+    ) -> None:
+        """Replace all entries with compacted entries from the CLI session file.
+
+        Called after mid-stream compaction so TranscriptBuilder mirrors the
+        CLI's active context (compaction summary + post-compaction entries).
+
+        Builds the new list first and validates it's non-empty before swapping,
+        so corrupt input cannot wipe the conversation history.
+        """
+        new_entries: list[TranscriptEntry] = []
+        for data in compacted_entries:
+            entry = self._parse_entry(data)
+            if entry is not None:
+                new_entries.append(entry)
+
+        if not new_entries:
+            logger.warning(
+                "%s replace_entries produced 0 entries from %d inputs, keeping old (%d entries)",
+                log_prefix,
+                len(compacted_entries),
+                len(self._entries),
+            )
+            return
+
+        old_count = len(self._entries)
+        self._entries = new_entries
+        self._last_uuid = new_entries[-1].uuid
+
+        logger.info(
+            "%s TranscriptBuilder compacted: %d entries -> %d entries",
+            log_prefix,
+            old_count,
+            len(self._entries),
+        )
+
    def to_jsonl(self) -> str:
        """Export complete context as JSONL.

--- a/autogpt_platform/backend/backend/copilot/sdk/transcript_test.py
+++ b/autogpt_platform/backend/backend/copilot/sdk/transcript_test.py
@@ -1,15 +1,23 @@
 """Unit tests for JSONL transcript management utilities."""

 import os
+from unittest.mock import AsyncMock, patch
+
+import pytest

 from backend.util import json

 from .transcript import (
    STRIPPABLE_TYPES,
+    _cli_project_dir,
+    delete_transcript,
+    read_cli_session_file,
+    read_compacted_entries,
    strip_progress_entries,
    validate_transcript,
    write_transcript_to_tempfile,
 )
+from .transcript_builder import TranscriptBuilder


 def _make_jsonl(*entries: dict) -> str:
@@ -282,3 +290,610 @@ class TestStripProgressEntries:
        lines = result.strip().split("\n")
        asst_entry = json.loads(lines[-1])
        assert asst_entry["parentUuid"] == "u1"  # reparented
+
+
+# --- read_cli_session_file ---
+
+
+class TestReadCliSessionFile:
+    def test_no_matching_files_returns_none(self, tmp_path, monkeypatch):
+        """read_cli_session_file returns None when no .jsonl files exist."""
+        # Create a project dir with no jsonl files
+        project_dir = tmp_path / "projects" / "encoded-cwd"
+        project_dir.mkdir(parents=True)
+        monkeypatch.setattr(
+            "backend.copilot.sdk.transcript._cli_project_dir",
+            lambda sdk_cwd: str(project_dir),
+        )
+        assert read_cli_session_file("/fake/cwd") is None
+
+    def test_one_jsonl_file_returns_content(self, tmp_path, monkeypatch):
+        """read_cli_session_file returns the content of a single .jsonl file."""
+        project_dir = tmp_path / "projects" / "encoded-cwd"
+        project_dir.mkdir(parents=True)
+        jsonl_file = project_dir / "session.jsonl"
+        jsonl_file.write_text("line1\nline2\n")
+        monkeypatch.setattr(
+            "backend.copilot.sdk.transcript._cli_project_dir",
+            lambda sdk_cwd: str(project_dir),
+        )
+        result = read_cli_session_file("/fake/cwd")
+        assert result == "line1\nline2\n"
+
+    def test_symlink_escaping_project_dir_is_skipped(self, tmp_path, monkeypatch):
+        """read_cli_session_file skips symlinks that escape the project dir."""
+        project_dir = tmp_path / "projects" / "encoded-cwd"
+        project_dir.mkdir(parents=True)
+
+        # Create a file outside the project dir
+        outside = tmp_path / "outside"
+        outside.mkdir()
+        outside_file = outside / "evil.jsonl"
+        outside_file.write_text("should not be read\n")
+
+        # Symlink from inside project_dir to outside file
+        symlink = project_dir / "evil.jsonl"
+        symlink.symlink_to(outside_file)
+
+        monkeypatch.setattr(
+            "backend.copilot.sdk.transcript._cli_project_dir",
+            lambda sdk_cwd: str(project_dir),
+        )
+        # The symlink target resolves outside project_dir, so it should be skipped
+        result = read_cli_session_file("/fake/cwd")
+        assert result is None
+
+
+# --- _cli_project_dir ---
+
+
+class TestCliProjectDir:
+    def test_returns_none_for_path_traversal(self, tmp_path, monkeypatch):
+        """_cli_project_dir returns None when the project dir symlink escapes projects base."""
+        config_dir = tmp_path / "config"
+        config_dir.mkdir()
+        projects_dir = config_dir / "projects"
+        projects_dir.mkdir()
+
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        # Create a symlink inside projects/ that points outside of it.
+        # _cli_project_dir encodes the cwd as all-alnum-hyphens, so use a
+        # cwd whose encoded form matches the symlink name we create.
+        evil_target = tmp_path / "escaped"
+        evil_target.mkdir()
+
+        # The encoded form of "/evil/cwd" is "-evil-cwd"
+        symlink_path = projects_dir / "-evil-cwd"
+        symlink_path.symlink_to(evil_target)
+
+        result = _cli_project_dir("/evil/cwd")
+        assert result is None
+
+
+# --- delete_transcript ---
+
+
+class TestDeleteTranscript:
+    @pytest.mark.asyncio
+    async def test_deletes_both_jsonl_and_meta(self):
+        """delete_transcript removes both the .jsonl and .meta.json files."""
+        mock_storage = AsyncMock()
+        mock_storage.delete = AsyncMock()
+
+        with patch(
+            "backend.util.workspace_storage.get_workspace_storage",
+            new_callable=AsyncMock,
+            return_value=mock_storage,
+        ):
+            await delete_transcript("user-123", "session-456")
+
+        assert mock_storage.delete.call_count == 2
+        paths = [call.args[0] for call in mock_storage.delete.call_args_list]
+        assert any(p.endswith(".jsonl") for p in paths)
+        assert any(p.endswith(".meta.json") for p in paths)
+
+    @pytest.mark.asyncio
+    async def test_continues_on_jsonl_delete_failure(self):
+        """If .jsonl delete fails, .meta.json delete is still attempted."""
+        mock_storage = AsyncMock()
+        mock_storage.delete = AsyncMock(
+            side_effect=[Exception("jsonl delete failed"), None]
+        )
+
+        with patch(
+            "backend.util.workspace_storage.get_workspace_storage",
+            new_callable=AsyncMock,
+            return_value=mock_storage,
+        ):
+            # Should not raise
+            await delete_transcript("user-123", "session-456")
+
+        assert mock_storage.delete.call_count == 2
+
+    @pytest.mark.asyncio
+    async def test_handles_meta_delete_failure(self):
+        """If .meta.json delete fails, no exception propagates."""
+        mock_storage = AsyncMock()
+        mock_storage.delete = AsyncMock(
+            side_effect=[None, Exception("meta delete failed")]
+        )
+
+        with patch(
+            "backend.util.workspace_storage.get_workspace_storage",
+            new_callable=AsyncMock,
+            return_value=mock_storage,
+        ):
+            # Should not raise
+            await delete_transcript("user-123", "session-456")
+
+
+# --- read_compacted_entries ---
+
+
+COMPACT_SUMMARY = {
+    "type": "summary",
+    "uuid": "cs1",
+    "isCompactSummary": True,
+    "message": {"role": "assistant", "content": "compacted context"},
+}
+POST_COMPACT_ASST = {
+    "type": "assistant",
+    "uuid": "a2",
+    "parentUuid": "cs1",
+    "message": {"role": "assistant", "content": "response after compaction"},
+}
+
+
+class TestReadCompactedEntries:
+    def test_returns_summary_and_entries_after(self, tmp_path, monkeypatch):
+        """File with isCompactSummary entry returns summary + entries after."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        pre_compact = {"type": "user", "uuid": "u1", "message": {"role": "user"}}
+        path = session_dir / "session.jsonl"
+        path.write_text(_make_jsonl(pre_compact, COMPACT_SUMMARY, POST_COMPACT_ASST))
+
+        result = read_compacted_entries(str(path))
+        assert result is not None
+        assert len(result) == 2
+        assert result[0]["isCompactSummary"] is True
+        assert result[1]["uuid"] == "a2"
+
+    def test_no_compact_summary_returns_none(self, tmp_path, monkeypatch):
+        """File without isCompactSummary returns None."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        path = session_dir / "session.jsonl"
+        path.write_text(_make_jsonl(USER_MSG, ASST_MSG))
+
+        result = read_compacted_entries(str(path))
+        assert result is None
+
+    def test_file_not_found_returns_none(self, tmp_path, monkeypatch):
+        """Non-existent file returns None."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        projects_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        result = read_compacted_entries(str(projects_dir / "missing.jsonl"))
+        assert result is None
+
+    def test_empty_path_returns_none(self):
+        """Empty string path returns None."""
+        result = read_compacted_entries("")
+        assert result is None
+
+    def test_malformed_json_lines_skipped(self, tmp_path, monkeypatch):
+        """Malformed JSON lines are skipped gracefully."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        path = session_dir / "session.jsonl"
+        content = "not valid json\n" + json.dumps(COMPACT_SUMMARY) + "\n"
+        content += "also bad\n" + json.dumps(POST_COMPACT_ASST) + "\n"
+        path.write_text(content)
+
+        result = read_compacted_entries(str(path))
+        assert result is not None
+        assert len(result) == 2  # summary + post-compact assistant
+
+    def test_multiple_compact_summaries_uses_last(self, tmp_path, monkeypatch):
+        """When multiple isCompactSummary entries exist, uses the last one
+        (most recent compaction)."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        second_summary = {
+            "type": "summary",
+            "uuid": "cs2",
+            "isCompactSummary": True,
+            "message": {"role": "assistant", "content": "second summary"},
+        }
+        path = session_dir / "session.jsonl"
+        path.write_text(_make_jsonl(COMPACT_SUMMARY, POST_COMPACT_ASST, second_summary))
+
+        result = read_compacted_entries(str(path))
+        assert result is not None
+        # Last summary found, so only cs2 returned
+        assert len(result) == 1
+        assert result[0]["uuid"] == "cs2"
+
+    def test_path_outside_projects_base_returns_none(self, tmp_path, monkeypatch):
+        """Transcript path outside the projects directory is rejected."""
+        config_dir = tmp_path / "config"
+        (config_dir / "projects").mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        evil_file = tmp_path / "evil.jsonl"
+        evil_file.write_text(_make_jsonl(COMPACT_SUMMARY))
+
+        result = read_compacted_entries(str(evil_file))
+        assert result is None
+
+
+# --- TranscriptBuilder.replace_entries ---
+
+
+class TestTranscriptBuilderReplaceEntries:
+    def test_replaces_existing_entries(self):
+        """replace_entries replaces all entries with compacted ones."""
+        builder = TranscriptBuilder()
+        builder.append_user("hello")
+        builder.append_assistant([{"type": "text", "text": "world"}])
+        assert builder.entry_count == 2
+
+        compacted = [
+            {
+                "type": "user",
+                "uuid": "cs1",
+                "isCompactSummary": True,
+                "message": {"role": "user", "content": "compacted summary"},
+            },
+            {
+                "type": "assistant",
+                "uuid": "a1",
+                "parentUuid": "cs1",
+                "message": {"role": "assistant", "content": "response"},
+            },
+        ]
+        builder.replace_entries(compacted)
+        assert builder.entry_count == 2
+        output = builder.to_jsonl()
+        entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert entries[0]["uuid"] == "cs1"
+        assert entries[1]["uuid"] == "a1"
+
+    def test_filters_strippable_types(self):
+        """Strippable types are filtered out during replace."""
+        builder = TranscriptBuilder()
+        compacted = [
+            {
+                "type": "user",
+                "uuid": "cs1",
+                "message": {"role": "user", "content": "compacted summary"},
+            },
+            {"type": "progress", "uuid": "p1", "message": {}},
+            {"type": "summary", "uuid": "s1", "message": {}},
+            {
+                "type": "assistant",
+                "uuid": "a1",
+                "parentUuid": "cs1",
+                "message": {"role": "assistant", "content": "hi"},
+            },
+        ]
+        builder.replace_entries(compacted)
+        assert builder.entry_count == 2  # progress and summary were filtered
+
+    def test_maintains_last_uuid_chain(self):
+        """After replace, _last_uuid is the last entry's uuid."""
+        builder = TranscriptBuilder()
+        compacted = [
+            {
+                "type": "user",
+                "uuid": "cs1",
+                "message": {"role": "user", "content": "compacted summary"},
+            },
+            {
+                "type": "assistant",
+                "uuid": "a1",
+                "parentUuid": "cs1",
+                "message": {"role": "assistant", "content": "hi"},
+            },
+        ]
+        builder.replace_entries(compacted)
+        # Appending a new user message should chain to a1
+        builder.append_user("next question")
+        output = builder.to_jsonl()
+        entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert entries[-1]["parentUuid"] == "a1"
+
+    def test_empty_entries_list_keeps_existing(self):
+        """Replacing with empty list keeps existing entries (safety check)."""
+        builder = TranscriptBuilder()
+        builder.append_user("hello")
+        builder.replace_entries([])
+        # Empty input is treated as corrupt — existing entries preserved
+        assert builder.entry_count == 1
+        assert not builder.is_empty
+
+
+# --- TranscriptBuilder.load_previous with compacted content ---
+
+
+class TestTranscriptBuilderLoadPreviousCompacted:
+    def test_preserves_compact_summary_entry(self):
+        """load_previous preserves isCompactSummary entries even though
+        their type is 'summary' (which is in STRIPPABLE_TYPES)."""
+        compacted_content = _make_jsonl(COMPACT_SUMMARY, POST_COMPACT_ASST)
+        builder = TranscriptBuilder()
+        builder.load_previous(compacted_content)
+        assert builder.entry_count == 2
+        output = builder.to_jsonl()
+        entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert entries[0]["type"] == "summary"
+        assert entries[0]["uuid"] == "cs1"
+        assert entries[1]["uuid"] == "a2"
+
+    def test_strips_regular_summary_entries(self):
+        """Regular summary entries (without isCompactSummary) are still stripped."""
+        regular_summary = {"type": "summary", "uuid": "s1", "message": {"content": "x"}}
+        content = _make_jsonl(regular_summary, POST_COMPACT_ASST)
+        builder = TranscriptBuilder()
+        builder.load_previous(content)
+        assert builder.entry_count == 1  # Only the assistant entry
+
+
+# --- End-to-end compaction flow (simulates service.py) ---
+
+
+class TestCompactionFlowIntegration:
+    """Simulate the full compaction flow as it happens in service.py:
+
+    1. TranscriptBuilder loads a previous transcript (download)
+    2. New messages are appended (user query + assistant response)
+    3. CompactionTracker fires (PreCompact hook → emit_start → emit_end)
+    4. read_compacted_entries reads the CLI session file
+    5. TranscriptBuilder.replace_entries syncs with CLI state
+    6. Final to_jsonl() produces the correct output (upload)
+    """
+
+    def test_full_compaction_roundtrip(self, tmp_path, monkeypatch):
+        """Full roundtrip: load → append → compact → replace → export."""
+        # Setup: create a CLI session file with pre-compact + compaction entries
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        # Simulate a transcript with old messages, then a compaction summary
+        old_user = {
+            "type": "user",
+            "uuid": "u1",
+            "message": {"role": "user", "content": "old question"},
+        }
+        old_asst = {
+            "type": "assistant",
+            "uuid": "a1",
+            "parentUuid": "u1",
+            "message": {"role": "assistant", "content": "old answer"},
+        }
+        compact_summary = {
+            "type": "summary",
+            "uuid": "cs1",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "compacted summary of conversation"},
+        }
+        post_compact_asst = {
+            "type": "assistant",
+            "uuid": "a2",
+            "parentUuid": "cs1",
+            "message": {"role": "assistant", "content": "response after compaction"},
+        }
+        session_file = session_dir / "session.jsonl"
+        session_file.write_text(
+            _make_jsonl(old_user, old_asst, compact_summary, post_compact_asst)
+        )
+
+        # Step 1: TranscriptBuilder loads previous transcript (simulates download)
+        # The previous transcript would have the OLD entries (pre-compaction)
+        previous_transcript = _make_jsonl(old_user, old_asst)
+        builder = TranscriptBuilder()
+        builder.load_previous(previous_transcript)
+        assert builder.entry_count == 2
+
+        # Step 2: New messages appended during the current query
+        builder.append_user("new question")
+        builder.append_assistant([{"type": "text", "text": "new answer"}])
+        assert builder.entry_count == 4
+
+        # Step 3: read_compacted_entries reads the CLI session file
+        compacted = read_compacted_entries(str(session_file))
+        assert compacted is not None
+        assert len(compacted) == 2  # compact_summary + post_compact_asst
+        assert compacted[0]["isCompactSummary"] is True
+
+        # Step 4: replace_entries syncs builder with CLI state
+        builder.replace_entries(compacted)
+        assert builder.entry_count == 2  # Only compacted entries now
+
+        # Step 5: Append post-compaction messages (continuing the stream)
+        builder.append_user("follow-up question")
+        assert builder.entry_count == 3
+
+        # Step 6: Export and verify
+        output = builder.to_jsonl()
+        entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert len(entries) == 3
+        # First entry is the compaction summary
+        assert entries[0]["type"] == "summary"
+        assert entries[0]["uuid"] == "cs1"
+        # Second is the post-compact assistant
+        assert entries[1]["uuid"] == "a2"
+        # Third is our follow-up, parented to the last compacted entry
+        assert entries[2]["type"] == "user"
+        assert entries[2]["parentUuid"] == "a2"
+
+    def test_compaction_preserves_chain_across_multiple_compactions(
+        self, tmp_path, monkeypatch
+    ):
+        """Two compactions: first compacts old history, second compacts the first."""
+        config_dir = tmp_path / "config"
+        projects_dir = config_dir / "projects"
+        session_dir = projects_dir / "proj"
+        session_dir.mkdir(parents=True)
+        monkeypatch.setenv("CLAUDE_CONFIG_DIR", str(config_dir))
+
+        # First compaction
+        first_summary = {
+            "type": "summary",
+            "uuid": "cs1",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "first summary"},
+        }
+        mid_asst = {
+            "type": "assistant",
+            "uuid": "a1",
+            "parentUuid": "cs1",
+            "message": {"role": "assistant", "content": "mid response"},
+        }
+        # Second compaction (compacts the first summary + mid_asst)
+        second_summary = {
+            "type": "summary",
+            "uuid": "cs2",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "second summary"},
+        }
+        final_asst = {
+            "type": "assistant",
+            "uuid": "a2",
+            "parentUuid": "cs2",
+            "message": {"role": "assistant", "content": "final response"},
+        }
+
+        session_file = session_dir / "session.jsonl"
+        session_file.write_text(
+            _make_jsonl(first_summary, mid_asst, second_summary, final_asst)
+        )
+
+        # read_compacted_entries should find the LAST summary
+        compacted = read_compacted_entries(str(session_file))
+        assert compacted is not None
+        assert len(compacted) == 2  # second_summary + final_asst
+        assert compacted[0]["uuid"] == "cs2"
+
+        # Apply to builder
+        builder = TranscriptBuilder()
+        builder.append_user("old stuff")
+        builder.append_assistant([{"type": "text", "text": "old response"}])
+        builder.replace_entries(compacted)
+        assert builder.entry_count == 2
+
+        # New message chains correctly
+        builder.append_user("after second compaction")
+        output = builder.to_jsonl()
+        entries = [json.loads(line) for line in output.strip().split("\n")]
+        assert entries[-1]["parentUuid"] == "a2"
+
+    def test_strip_progress_preserves_compact_summaries(self):
+        """strip_progress_entries doesn't strip isCompactSummary entries
+        even though their type is 'summary' (in STRIPPABLE_TYPES)."""
+        compact_summary = {
+            "type": "summary",
+            "uuid": "cs1",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "compacted"},
+        }
+        regular_summary = {"type": "summary", "uuid": "s1", "message": {"content": "x"}}
+        progress = {"type": "progress", "uuid": "p1", "data": {"stdout": "..."}}
+        user = {
+            "type": "user",
+            "uuid": "u1",
+            "message": {"role": "user", "content": "hi"},
+        }
+
+        content = _make_jsonl(compact_summary, regular_summary, progress, user)
+        stripped = strip_progress_entries(content)
+        stripped_entries = [
+            json.loads(line) for line in stripped.strip().split("\n") if line.strip()
+        ]
+
+        uuids = [e.get("uuid") for e in stripped_entries]
+        # compact_summary kept, regular_summary stripped, progress stripped, user kept
+        assert "cs1" in uuids  # compact summary preserved
+        assert "s1" not in uuids  # regular summary stripped
+        assert "p1" not in uuids  # progress stripped
+        assert "u1" in uuids  # user kept
+
+    def test_builder_load_then_replace_then_export_roundtrip(self):
+        """Load a compacted transcript, replace with new compaction, export.
+        Simulates two consecutive turns with compaction each time."""
+        # Turn 1: load compacted transcript
+        compact1 = {
+            "type": "summary",
+            "uuid": "cs1",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "summary v1"},
+        }
+        asst1 = {
+            "type": "assistant",
+            "uuid": "a1",
+            "parentUuid": "cs1",
+            "message": {"role": "assistant", "content": "response 1"},
+        }
+        builder = TranscriptBuilder()
+        builder.load_previous(_make_jsonl(compact1, asst1))
+        assert builder.entry_count == 2
+
+        # Turn 1: append new messages
+        builder.append_user("question")
+        builder.append_assistant([{"type": "text", "text": "answer"}])
+        assert builder.entry_count == 4
+
+        # Turn 1: compaction fires — replace with new compacted state
+        compact2 = {
+            "type": "summary",
+            "uuid": "cs2",
+            "isCompactSummary": True,
+            "message": {"role": "user", "content": "summary v2"},
+        }
+        asst2 = {
+            "type": "assistant",
+            "uuid": "a2",
+            "parentUuid": "cs2",
+            "message": {"role": "assistant", "content": "continuing"},
+        }
+        builder.replace_entries([compact2, asst2])
+        assert builder.entry_count == 2
+
+        # Export (this goes to cloud storage for next turn's download)
+        output = builder.to_jsonl()
+        lines = [json.loads(line) for line in output.strip().split("\n")]
+        assert lines[0]["uuid"] == "cs2"
+        assert lines[0]["type"] == "summary"
+        assert lines[1]["uuid"] == "a2"
+
+        # Turn 2: fresh builder loads the exported transcript
+        builder2 = TranscriptBuilder()
+        builder2.load_previous(output)
+        assert builder2.entry_count == 2
+        builder2.append_user("turn 2 question")
+        output2 = builder2.to_jsonl()
+        lines2 = [json.loads(line) for line in output2.strip().split("\n")]
+        assert lines2[-1]["parentUuid"] == "a2"
--- a/autogpt_platform/backend/backend/copilot/service.py
+++ b/autogpt_platform/backend/backend/copilot/service.py
@@ -22,7 +22,7 @@ from backend.util.exceptions import NotAuthorizedError, NotFoundError
 from backend.util.settings import AppEnvironment, Settings

 from .config import ChatConfig
-from .model import ChatSessionInfo, get_chat_session, upsert_chat_session
+from .model import ChatSession, ChatSessionInfo, get_chat_session, upsert_chat_session

 logger = logging.getLogger(__name__)

@@ -64,7 +64,13 @@ def _is_langfuse_configured() -> bool:
    )


-async def _get_system_prompt_template(context: str) -> str:
+async def _get_system_prompt_template(
+    context: str,
+    *,
+    prompt_name: str | None = None,
+    fallback_prompt: str | None = None,
+    template_vars: dict[str, str] | None = None,
+) -> str:
    """Get the system prompt, trying Langfuse first with fallback to default.

    Args:
@@ -73,6 +79,11 @@ async def _get_system_prompt_template(context: str) -> str:
    Returns:
        The compiled system prompt string.
    """
+    resolved_prompt_name = prompt_name or config.langfuse_prompt_name
+    resolved_template_vars = {
+        "users_information": context,
+        **(template_vars or {}),
+    }
    if _is_langfuse_configured():
        try:
            # Use asyncio.to_thread to avoid blocking the event loop
@@ -85,16 +96,16 @@ async def _get_system_prompt_template(context: str) -> str:
            )
            prompt = await asyncio.to_thread(
                langfuse.get_prompt,
-                config.langfuse_prompt_name,
+                resolved_prompt_name,
                label=label,
                cache_ttl_seconds=config.langfuse_prompt_cache_ttl,
            )
-            return prompt.compile(users_information=context)
+            return prompt.compile(**resolved_template_vars)
        except Exception as e:
            logger.warning(f"Failed to fetch prompt from Langfuse, using default: {e}")

    # Fallback to default prompt
-    return DEFAULT_SYSTEM_PROMPT.format(users_information=context)
+    return (fallback_prompt or DEFAULT_SYSTEM_PROMPT).format(**resolved_template_vars)


 async def _build_system_prompt(
@@ -131,6 +142,21 @@ async def _build_system_prompt(
    return compiled, understanding


+async def _resolve_system_prompt(
+    session: ChatSession,
+    user_id: str | None,
+    *,
+    has_conversation_history: bool = False,
+) -> tuple[str, Any]:
+    override = session.session_config.system_prompt_override
+    if override:
+        return override, None
+    return await _build_system_prompt(
+        user_id,
+        has_conversation_history=has_conversation_history,
+    )
+
+
 async def _generate_session_title(
    message: str,
    user_id: str | None = None,
--- a/autogpt_platform/backend/backend/copilot/session_types.py
+++ b/autogpt_platform/backend/backend/copilot/session_types.py
@@ -0,0 +1,60 @@
+from __future__ import annotations
+
+from datetime import datetime
+from enum import Enum
+
+from pydantic import BaseModel, Field, model_validator
+
+
+class ChatSessionStartType(str, Enum):
+    MANUAL = "MANUAL"
+    AUTOPILOT_NIGHTLY = "AUTOPILOT_NIGHTLY"
+    AUTOPILOT_CALLBACK = "AUTOPILOT_CALLBACK"
+    AUTOPILOT_INVITE_CTA = "AUTOPILOT_INVITE_CTA"
+
+
+class ChatSessionConfig(BaseModel):
+    system_prompt_override: str | None = None
+    initial_user_message: str | None = None
+    initial_assistant_message: str | None = None
+    extra_tools: list[str] = Field(default_factory=list)
+    disabled_tools: list[str] = Field(default_factory=list)
+
+    def allows_tool(self, tool_name: str) -> bool:
+        return tool_name in self.extra_tools
+
+    def disables_tool(self, tool_name: str) -> bool:
+        return tool_name in self.disabled_tools
+
+
+class CompletionReportInput(BaseModel):
+    thoughts: str
+    should_notify_user: bool
+    email_title: str | None = None
+    email_body: str | None = None
+    callback_session_message: str | None = None
+    approval_summary: str | None = None
+
+    @model_validator(mode="after")
+    def validate_notification_fields(self) -> "CompletionReportInput":
+        if self.should_notify_user:
+            required_fields = {
+                "email_title": self.email_title,
+                "email_body": self.email_body,
+                "callback_session_message": self.callback_session_message,
+            }
+            missing = [
+                field_name for field_name, value in required_fields.items() if not value
+            ]
+            if missing:
+                raise ValueError(
+                    "Missing required notification fields: " + ", ".join(missing)
+                )
+        return self
+
+
+class StoredCompletionReport(CompletionReportInput):
+    has_pending_approvals: bool
+    pending_approval_count: int
+    pending_approval_graph_exec_id: str | None = None
+    saved_at: datetime
--- a/autogpt_platform/backend/backend/copilot/stream_registry.py
+++ b/autogpt_platform/backend/backend/copilot/stream_registry.py
@@ -17,11 +17,12 @@ Subscribers:
 import asyncio
 import logging
 import time
-from dataclasses import dataclass, field
+from collections.abc import Awaitable
 from datetime import datetime, timezone
-from typing import Any, Literal
+from typing import Any, Literal, cast

 import orjson
+from pydantic import BaseModel, ConfigDict, Field

 from backend.api.model import CopilotCompletionPayload
 from backend.data.notification_bus import (
@@ -55,6 +56,12 @@ _listener_sessions: dict[int, tuple[str, asyncio.Task]] = {}
 # Timeout for putting chunks into subscriber queues (seconds)
 # If the queue is full and doesn't drain within this time, send an overflow error
 QUEUE_PUT_TIMEOUT = 5.0
+SESSION_LOOKUP_RETRY_SECONDS = 0.05
+STREAM_REPLAY_COUNT = 1000
+STREAM_XREAD_BLOCK_MS = 5000
+STREAM_XREAD_COUNT = 100
+STALE_SESSION_BUFFER_SECONDS = 300
+UNSUBSCRIBE_TIMEOUT_SECONDS = 5.0

 # Lua script for atomic compare-and-swap status update (idempotent completion)
 # Returns 1 if status was updated, 0 if already completed/failed
@@ -68,19 +75,24 @@ return 0
 """


-@dataclass
-class ActiveSession:
+SessionStatus = Literal["running", "completed", "failed"]
+RedisHash = dict[str, str]
+RedisStreamMessages = list[tuple[str, list[tuple[str, RedisHash]]]]
+
+
+class ActiveSession(BaseModel):
    """Represents an active streaming session (metadata only, no in-memory queues)."""

+    model_config = ConfigDict(frozen=True)
+
    session_id: str
    user_id: str | None
    tool_call_id: str
    tool_name: str
    turn_id: str = ""
    blocking: bool = False  # If True, HTTP request is waiting for completion
-    status: Literal["running", "completed", "failed"] = "running"
-    created_at: datetime = field(default_factory=lambda: datetime.now(timezone.utc))
-    asyncio_task: asyncio.Task | None = None
+    status: SessionStatus = "running"
+    created_at: datetime = Field(default_factory=lambda: datetime.now(timezone.utc))


 def _get_session_meta_key(session_id: str) -> str:
@@ -93,7 +105,54 @@ def _get_turn_stream_key(turn_id: str) -> str:
    return f"{config.turn_stream_prefix}{turn_id}"


-def _parse_session_meta(meta: dict[Any, Any], session_id: str = "") -> ActiveSession:
+async def _redis_hset_mapping(redis: Any, key: str, mapping: RedisHash) -> int:
+    return await cast(Awaitable[int], redis.hset(key, mapping=mapping))
+
+
+async def _redis_hgetall(redis: Any, key: str) -> RedisHash:
+    return cast(
+        RedisHash,
+        await cast(Awaitable[dict[str, str]], redis.hgetall(key)),
+    )
+
+
+async def _redis_hget(redis: Any, key: str, field: str) -> str | None:
+    return cast(
+        str | None,
+        await cast(Awaitable[str | None], redis.hget(key, field)),
+    )
+
+
+async def _redis_xread(
+    redis: Any,
+    streams: dict[str, str],
+    *,
+    count: int,
+    block: int | None,
+) -> RedisStreamMessages:
+    return cast(
+        RedisStreamMessages,
+        await cast(
+            Awaitable[RedisStreamMessages],
+            redis.xread(streams, count=count, block=block),
+        ),
+    )
+
+
+async def _redis_complete_session(
+    redis: Any,
+    meta_key: str,
+    status: SessionStatus,
+) -> int:
+    return int(
+        await cast(
+            Awaitable[int | str],
+            redis.eval(COMPLETE_SESSION_SCRIPT, 1, meta_key, status),
+        )
+    )
+
+
+def _parse_session_meta(meta: RedisHash, session_id: str = "") -> ActiveSession:
    """Parse a raw Redis hash into a typed ActiveSession.

    Centralises the ``meta.get(...)`` boilerplate so callers don't repeat it.
@@ -107,7 +166,7 @@ def _parse_session_meta(meta: dict[Any, Any], session_id: str = "") -> ActiveSes
        tool_name=meta.get("tool_name", ""),
        turn_id=meta.get("turn_id", "") or session_id,
        blocking=meta.get("blocking") == "1",
-        status=meta.get("status", "running"),  # type: ignore[arg-type]
+        status=cast(SessionStatus, meta.get("status", "running")),
    )


@@ -170,7 +229,8 @@ async def create_session(
    # No need to delete old stream — each turn_id is a fresh UUID

    hset_start = time.perf_counter()
-    await redis.hset(  # type: ignore[misc]
+    await _redis_hset_mapping(
+        redis,
        meta_key,
        mapping={
            "session_id": session_id,
@@ -280,6 +340,108 @@ async def publish_chunk(
    return message_id


+def _decode_stream_chunk(msg_data: RedisHash) -> StreamBaseResponse | None:
+    raw_data = msg_data.get("data")
+    if raw_data is None:
+        return None
+
+    chunk_data = orjson.loads(raw_data)
+    return _reconstruct_chunk(chunk_data)
+
+
+async def _replay_messages(
+    messages: RedisStreamMessages,
+    subscriber_queue: asyncio.Queue[StreamBaseResponse],
+    *,
+    last_message_id: str,
+) -> tuple[int, str]:
+    replayed_count = 0
+    replay_last_id = last_message_id
+
+    for _stream_name, stream_messages in messages:
+        for msg_id, msg_data in stream_messages:
+            replay_last_id = msg_id
+            try:
+                chunk = _decode_stream_chunk(msg_data)
+                if chunk is None:
+                    continue
+                await subscriber_queue.put(chunk)
+                replayed_count += 1
+            except Exception as exc:
+                logger.warning("Failed to replay message: %s", exc)
+
+    return replayed_count, replay_last_id
+
+
+async def _deliver_message_to_queue(
+    session_id: str,
+    subscriber_queue: asyncio.Queue[StreamBaseResponse],
+    chunk: StreamBaseResponse,
+    *,
+    last_delivered_id: str,
+    log_meta: dict[str, Any],
+) -> bool:
+    try:
+        await asyncio.wait_for(
+            subscriber_queue.put(chunk),
+            timeout=QUEUE_PUT_TIMEOUT,
+        )
+        return True
+    except asyncio.TimeoutError:
+        logger.warning(
+            f"[TIMING] Subscriber queue full, delivery timed out after {QUEUE_PUT_TIMEOUT}s",
+            extra={
+                "json_fields": {
+                    **log_meta,
+                    "timeout_s": QUEUE_PUT_TIMEOUT,
+                    "reason": "queue_full",
+                }
+            },
+        )
+        try:
+            overflow_error = StreamError(
+                errorText="Message delivery timeout - some messages may have been missed",
+                code="QUEUE_OVERFLOW",
+                details={
+                    "last_delivered_id": last_delivered_id,
+                    "recovery_hint": f"Reconnect with last_message_id={last_delivered_id}",
+                },
+            )
+            subscriber_queue.put_nowait(overflow_error)
+        except asyncio.QueueFull:
+            logger.error(
+                f"Cannot deliver overflow error for session {session_id}, queue completely blocked"
+            )
+        return False
+
+
+async def _handle_xread_timeout(
+    redis: Any,
+    session_id: str,
+    subscriber_queue: asyncio.Queue[StreamBaseResponse],
+) -> bool:
+    meta_key = _get_session_meta_key(session_id)
+    status = await _redis_hget(redis, meta_key, "status")
+    if status != "running":
+        try:
+            await asyncio.wait_for(
+                subscriber_queue.put(StreamFinish()),
+                timeout=QUEUE_PUT_TIMEOUT,
+            )
+        except asyncio.TimeoutError:
+            logger.warning(f"Timeout delivering finish event for session {session_id}")
+        return False
+
+    try:
+        await asyncio.wait_for(
+            subscriber_queue.put(StreamHeartbeat()),
+            timeout=QUEUE_PUT_TIMEOUT,
+        )
+    except asyncio.TimeoutError:
+        logger.warning(f"Timeout delivering heartbeat for session {session_id}")
+    return True
+
+
 async def subscribe_to_session(
    session_id: str,
    user_id: str | None,
@@ -313,7 +475,7 @@ async def subscribe_to_session(
    redis_start = time.perf_counter()
    redis = await get_redis_async()
    meta_key = _get_session_meta_key(session_id)
-    meta: dict[Any, Any] = await redis.hgetall(meta_key)  # type: ignore[misc]
+    meta = await _redis_hgetall(redis, meta_key)
    hgetall_time = (time.perf_counter() - redis_start) * 1000
    logger.info(
        f"[TIMING] Redis hgetall took {hgetall_time:.1f}ms",
@@ -328,8 +490,8 @@ async def subscribe_to_session(
            "[TIMING] Session not found on first attempt, retrying after 50ms delay",
            extra={"json_fields": {**log_meta}},
        )
-        await asyncio.sleep(0.05)  # 50ms
-        meta = await redis.hgetall(meta_key)  # type: ignore[misc]
+        await asyncio.sleep(SESSION_LOOKUP_RETRY_SECONDS)
+        meta = await _redis_hgetall(redis, meta_key)
        if not meta:
            elapsed = (time.perf_counter() - start_time) * 1000
            logger.info(
@@ -374,7 +536,12 @@ async def subscribe_to_session(

    # Step 1: Replay messages from Redis Stream
    xread_start = time.perf_counter()
-    messages = await redis.xread({stream_key: last_message_id}, block=None, count=1000)
+    messages = await _redis_xread(
+        redis,
+        {stream_key: last_message_id},
+        block=None,
+        count=STREAM_REPLAY_COUNT,
+    )
    xread_time = (time.perf_counter() - xread_start) * 1000
    logger.info(
        f"[TIMING] Redis xread (replay) took {xread_time:.1f}ms, status={session_status}",
@@ -387,22 +554,11 @@ async def subscribe_to_session(
        },
    )

-    replayed_count = 0
-    replay_last_id = last_message_id
-    if messages:
-        for _stream_name, stream_messages in messages:
-            for msg_id, msg_data in stream_messages:
-                replay_last_id = msg_id if isinstance(msg_id, str) else msg_id.decode()
-                # Note: Redis client uses decode_responses=True, so keys are strings
-                if "data" in msg_data:
-                    try:
-                        chunk_data = orjson.loads(msg_data["data"])
-                        chunk = _reconstruct_chunk(chunk_data)
-                        if chunk:
-                            await subscriber_queue.put(chunk)
-                            replayed_count += 1
-                    except Exception as e:
-                        logger.warning(f"Failed to replay message: {e}")
+    replayed_count, replay_last_id = await _replay_messages(
+        messages,
+        subscriber_queue,
+        last_message_id=last_message_id,
+    )

    logger.info(
        f"[TIMING] Replayed {replayed_count} messages, last_id={replay_last_id}",
@@ -455,7 +611,7 @@ async def _stream_listener(
    session_id: str,
    subscriber_queue: asyncio.Queue[StreamBaseResponse],
    last_replayed_id: str,
-    log_meta: dict | None = None,
+    log_meta: dict[str, Any] | None = None,
    turn_id: str = "",
 ) -> None:
    """Listen to Redis Stream for new messages using blocking XREAD.
@@ -499,8 +655,11 @@ async def _stream_listener(
            # Short timeout prevents frontend timeout (12s) while waiting for heartbeats (15s)
            xread_start = time.perf_counter()
            xread_count += 1
-            messages = await redis.xread(
-                {stream_key: current_id}, block=5000, count=100
+            messages = await _redis_xread(
+                redis,
+                {stream_key: current_id},
+                block=STREAM_XREAD_BLOCK_MS,
+                count=STREAM_XREAD_COUNT,
            )
            xread_time = (time.perf_counter() - xread_start) * 1000

@@ -532,114 +691,66 @@ async def _stream_listener(
                )

            if not messages:
-                # Timeout - check if session is still running
-                meta_key = _get_session_meta_key(session_id)
-                status = await redis.hget(meta_key, "status")  # type: ignore[misc]
-                # Stop if session metadata is gone (TTL expired) or status is not "running"
-                if status != "running":
-                    try:
-                        await asyncio.wait_for(
-                            subscriber_queue.put(StreamFinish()),
-                            timeout=QUEUE_PUT_TIMEOUT,
-                        )
-                    except asyncio.TimeoutError:
-                        logger.warning(
-                            f"Timeout delivering finish event for session {session_id}"
-                        )
+                if not await _handle_xread_timeout(
+                    redis,
+                    session_id,
+                    subscriber_queue,
+                ):
                    break
-                # Session still running - send heartbeat to keep connection alive
-                # This prevents frontend timeout (12s) during long-running operations
-                try:
-                    await asyncio.wait_for(
-                        subscriber_queue.put(StreamHeartbeat()),
-                        timeout=QUEUE_PUT_TIMEOUT,
-                    )
-                except asyncio.TimeoutError:
-                    logger.warning(
-                        f"Timeout delivering heartbeat for session {session_id}"
-                    )
                continue

            for _stream_name, stream_messages in messages:
                for msg_id, msg_data in stream_messages:
-                    current_id = msg_id if isinstance(msg_id, str) else msg_id.decode()
-
-                    if "data" not in msg_data:
-                        continue
-
+                    current_id = msg_id
                    try:
-                        chunk_data = orjson.loads(msg_data["data"])
-                        chunk = _reconstruct_chunk(chunk_data)
-                        if chunk:
-                            try:
-                                await asyncio.wait_for(
-                                    subscriber_queue.put(chunk),
-                                    timeout=QUEUE_PUT_TIMEOUT,
-                                )
-                                # Update last delivered ID on successful delivery
-                                last_delivered_id = current_id
-                                messages_delivered += 1
-                                if first_message_time is None:
-                                    first_message_time = time.perf_counter()
-                                    elapsed = (first_message_time - start_time) * 1000
-                                    logger.info(
-                                        f"[TIMING] FIRST live message at {elapsed:.1f}ms, type={type(chunk).__name__}",
-                                        extra={
-                                            "json_fields": {
-                                                **log_meta,
-                                                "elapsed_ms": elapsed,
-                                                "chunk_type": type(chunk).__name__,
-                                            }
-                                        },
-                                    )
-                            except asyncio.TimeoutError:
-                                logger.warning(
-                                    f"[TIMING] Subscriber queue full, delivery timed out after {QUEUE_PUT_TIMEOUT}s",
-                                    extra={
-                                        "json_fields": {
-                                            **log_meta,
-                                            "timeout_s": QUEUE_PUT_TIMEOUT,
-                                            "reason": "queue_full",
-                                        }
-                                    },
-                                )
-                                # Send overflow error with recovery info
-                                try:
-                                    overflow_error = StreamError(
-                                        errorText="Message delivery timeout - some messages may have been missed",
-                                        code="QUEUE_OVERFLOW",
-                                        details={
-                                            "last_delivered_id": last_delivered_id,
-                                            "recovery_hint": f"Reconnect with last_message_id={last_delivered_id}",
-                                        },
-                                    )
-                                    subscriber_queue.put_nowait(overflow_error)
-                                except asyncio.QueueFull:
-                                    # Queue is completely stuck, nothing more we can do
-                                    logger.error(
-                                        f"Cannot deliver overflow error for session {session_id}, "
-                                        "queue completely blocked"
-                                    )
-
-                            # Stop listening on finish
-                            if isinstance(chunk, StreamFinish):
-                                total_time = (time.perf_counter() - start_time) * 1000
-                                logger.info(
-                                    f"[TIMING] StreamFinish received in {total_time / 1000:.1f}s; delivered={messages_delivered}",
-                                    extra={
-                                        "json_fields": {
-                                            **log_meta,
-                                            "total_time_ms": total_time,
-                                            "messages_delivered": messages_delivered,
-                                        }
-                                    },
-                                )
-                                return
+                        chunk = _decode_stream_chunk(msg_data)
                    except Exception as e:
                        logger.warning(
                            f"Error processing stream message: {e}",
                            extra={"json_fields": {**log_meta, "error": str(e)}},
                        )
+                        continue
+
+                    if chunk is None:
+                        continue
+
+                    delivered = await _deliver_message_to_queue(
+                        session_id,
+                        subscriber_queue,
+                        chunk,
+                        last_delivered_id=last_delivered_id,
+                        log_meta=log_meta,
+                    )
+                    if delivered:
+                        last_delivered_id = current_id
+                        messages_delivered += 1
+                        if first_message_time is None:
+                            first_message_time = time.perf_counter()
+                            elapsed = (first_message_time - start_time) * 1000
+                            logger.info(
+                                f"[TIMING] FIRST live message at {elapsed:.1f}ms, type={type(chunk).__name__}",
+                                extra={
+                                    "json_fields": {
+                                        **log_meta,
+                                        "elapsed_ms": elapsed,
+                                        "chunk_type": type(chunk).__name__,
+                                    }
+                                },
+                            )
+
+                    if isinstance(chunk, StreamFinish):
+                        total_time = (time.perf_counter() - start_time) * 1000
+                        logger.info(
+                            f"[TIMING] StreamFinish received in {total_time / 1000:.1f}s; delivered={messages_delivered}",
+                            extra={
+                                "json_fields": {
+                                    **log_meta,
+                                    "total_time_ms": total_time,
+                                    "messages_delivered": messages_delivered,
+                                }
+                            },
+                        )
+                        return

    except asyncio.CancelledError:
        elapsed = (time.perf_counter() - start_time) * 1000
@@ -712,16 +823,16 @@ async def mark_session_completed(
    Returns:
        True if session was newly marked completed, False if already completed/failed
    """
-    status: Literal["completed", "failed"] = "failed" if error_message else "completed"
+    status: SessionStatus = "failed" if error_message else "completed"
    redis = await get_redis_async()
    meta_key = _get_session_meta_key(session_id)

    # Resolve turn_id for publishing to the correct stream
-    meta: dict[Any, Any] = await redis.hgetall(meta_key)  # type: ignore[misc]
+    meta = await _redis_hgetall(redis, meta_key)
    turn_id = _parse_session_meta(meta, session_id).turn_id if meta else session_id

    # Atomic compare-and-swap: only update if status is "running"
-    result = await redis.eval(COMPLETE_SESSION_SCRIPT, 1, meta_key, status)  # type: ignore[misc]
+    result = await _redis_complete_session(redis, meta_key, status)

    if result == 0:
        logger.debug(f"Session {session_id} already completed/failed, skipping")
@@ -774,6 +885,18 @@ async def mark_session_completed(
                    f"for session {session_id}: {e}"
                )

+    try:
+        from backend.copilot.autopilot import handle_non_manual_session_completion
+
+        await handle_non_manual_session_completion(session_id)
+    except Exception as e:
+        logger.warning(
+            "Failed to process non-manual completion for session %s: %s",
+            session_id,
+            e,
+            exc_info=True,
+        )
+
    return True


@@ -788,7 +911,7 @@ async def get_session(session_id: str) -> ActiveSession | None:
    """
    redis = await get_redis_async()
    meta_key = _get_session_meta_key(session_id)
-    meta: dict[Any, Any] = await redis.hgetall(meta_key)  # type: ignore[misc]
+    meta = await _redis_hgetall(redis, meta_key)

    if not meta:
        return None
@@ -815,7 +938,7 @@ async def get_session_with_expiry_info(
    redis = await get_redis_async()
    meta_key = _get_session_meta_key(session_id)

-    meta: dict[Any, Any] = await redis.hgetall(meta_key)  # type: ignore[misc]
+    meta = await _redis_hgetall(redis, meta_key)

    if not meta:
        # Metadata expired — we can't resolve turn_id, so check using
@@ -847,7 +970,7 @@ async def get_active_session(

    redis = await get_redis_async()
    meta_key = _get_session_meta_key(session_id)
-    meta: dict[Any, Any] = await redis.hgetall(meta_key)  # type: ignore[misc]
+    meta = await _redis_hgetall(redis, meta_key)

    if not meta:
        return None, "0-0"
@@ -871,7 +994,9 @@ async def get_active_session(
        try:
            created_at = datetime.fromisoformat(created_at_str)
            age_seconds = (datetime.now(timezone.utc) - created_at).total_seconds()
-            stale_threshold = COPILOT_CONSUMER_TIMEOUT_SECONDS + 300  # + 5min buffer
+            stale_threshold = (
+                COPILOT_CONSUMER_TIMEOUT_SECONDS + STALE_SESSION_BUFFER_SECONDS
+            )
            if age_seconds > stale_threshold:
                logger.warning(
                    f"[STALE_SESSION] Auto-completing stale session {session_id[:8]}... "
@@ -946,7 +1071,11 @@ def _reconstruct_chunk(chunk_data: dict) -> StreamBaseResponse | None:
    }

    chunk_type = chunk_data.get("type")
-    chunk_class = type_to_class.get(chunk_type)  # type: ignore[arg-type]
+    if not isinstance(chunk_type, str):
+        logger.warning(f"Unknown chunk type: {chunk_type}")
+        return None
+
+    chunk_class = type_to_class.get(chunk_type)

    if chunk_class is None:
        logger.warning(f"Unknown chunk type: {chunk_type}")
@@ -1011,7 +1140,7 @@ async def unsubscribe_from_session(

    try:
        # Wait for the task to be cancelled with a timeout
-        await asyncio.wait_for(listener_task, timeout=5.0)
+        await asyncio.wait_for(listener_task, timeout=UNSUBSCRIBE_TIMEOUT_SECONDS)
    except asyncio.CancelledError:
        # Expected - the task was successfully cancelled
        pass
--- a/autogpt_platform/backend/backend/copilot/tools/init.py
+++ b/autogpt_platform/backend/backend/copilot/tools/init.py
@@ -12,6 +12,7 @@ from .agent_browser import BrowserActTool, BrowserNavigateTool, BrowserScreensho
 from .agent_output import AgentOutputTool
 from .base import BaseTool
 from .bash_exec import BashExecTool
+from .completion_report import CompletionReportTool
 from .continue_run_block import ContinueRunBlockTool
 from .create_agent import CreateAgentTool
 from .customize_agent import CustomizeAgentTool
@@ -50,10 +51,12 @@ if TYPE_CHECKING:
    from backend.copilot.response_model import StreamToolOutputAvailable

 logger = logging.getLogger(__name__)
+SESSION_SCOPED_TOOL_NAMES = {"completion_report"}

 # Single source of truth for all tools
 TOOL_REGISTRY: dict[str, BaseTool] = {
    "add_understanding": AddUnderstandingTool(),
+    "completion_report": CompletionReportTool(),
    "create_agent": CreateAgentTool(),
    "customize_agent": CustomizeAgentTool(),
    "edit_agent": EditAgentTool(),
@@ -103,16 +106,38 @@ find_agent_tool = TOOL_REGISTRY["find_agent"]
 run_agent_tool = TOOL_REGISTRY["run_agent"]


-def get_available_tools() -> list[ChatCompletionToolParam]:
+def is_tool_enabled(tool_name: str, session: "ChatSession | None" = None) -> bool:
+    if tool_name not in TOOL_REGISTRY:
+        return False
+    if session is not None and session.disables_tool(tool_name):
+        return False
+    if tool_name not in SESSION_SCOPED_TOOL_NAMES:
+        return True
+    if session is None:
+        return False
+    return session.allows_tool(tool_name)
+
+
+def iter_available_tools(
+    session: "ChatSession | None" = None,
+) -> list[tuple[str, BaseTool]]:
+    return [
+        (tool_name, tool)
+        for tool_name, tool in TOOL_REGISTRY.items()
+        if tool.is_available and is_tool_enabled(tool_name, session)
+    ]
+
+
+def get_available_tools(
+    session: "ChatSession | None" = None,
+) -> list[ChatCompletionToolParam]:
    """Return OpenAI tool schemas for tools available in the current environment.

    Called per-request so that env-var or binary availability is evaluated
    fresh each time (e.g. browser_* tools are excluded when agent-browser
    CLI is not installed).
    """
-    return [
-        tool.as_openai_tool() for tool in TOOL_REGISTRY.values() if tool.is_available
-    ]
+    return [tool.as_openai_tool() for _, tool in iter_available_tools(session)]


 def get_tool(tool_name: str) -> BaseTool | None:
@@ -128,6 +153,9 @@ async def execute_tool(
    tool_call_id: str,
 ) -> "StreamToolOutputAvailable":
    """Execute a tool by name."""
+    if not is_tool_enabled(tool_name, session):
+        raise ValueError(f"Tool {tool_name} is not enabled for this session")
+
    tool = get_tool(tool_name)
    if not tool:
        raise ValueError(f"Tool {tool_name} not found")
--- a/autogpt_platform/backend/backend/copilot/tools/agent_generator/validator.py
+++ b/autogpt_platform/backend/backend/copilot/tools/agent_generator/validator.py
@@ -935,5 +935,5 @@ class AgentValidator:
            for i, error in enumerate(self.errors, 1):
                error_message += f"{i}. {error}\n"

-            logger.error(f"Agent validation failed: {error_message}")
+            logger.warning(f"Agent validation failed: {error_message}")
            return False, error_message
--- a/autogpt_platform/backend/backend/copilot/tools/completion_report.py
+++ b/autogpt_platform/backend/backend/copilot/tools/completion_report.py
@@ -0,0 +1,83 @@
+"""Tool for finalizing non-manual Copilot sessions."""
+
+from typing import Any
+
+from backend.copilot.constants import COPILOT_SESSION_PREFIX
+from backend.copilot.model import ChatSession
+from backend.copilot.session_types import CompletionReportInput
+from backend.data.db_accessors import review_db
+
+from .base import BaseTool
+from .models import CompletionReportSavedResponse, ErrorResponse, ToolResponseBase
+
+
+class CompletionReportTool(BaseTool):
+    @property
+    def name(self) -> str:
+        return "completion_report"
+
+    @property
+    def description(self) -> str:
+        return (
+            "Finalize a non-manual session after you have finished the work. "
+            "Use this exactly once at the end of the flow. "
+            "Summarize what you did, state whether the user should be notified, "
+            "and provide any email/callback content that should be used."
+        )
+
+    @property
+    def parameters(self) -> dict[str, Any]:
+        schema = CompletionReportInput.model_json_schema()
+        return {
+            "type": "object",
+            "properties": schema.get("properties", {}),
+            "required": [
+                "thoughts",
+                "should_notify_user",
+                "email_title",
+                "email_body",
+                "callback_session_message",
+                "approval_summary",
+            ],
+        }
+
+    async def _execute(
+        self,
+        user_id: str | None,
+        session: ChatSession,
+        **kwargs,
+    ) -> ToolResponseBase:
+        if session.is_manual:
+            return ErrorResponse(
+                message="completion_report is only available in non-manual sessions.",
+                session_id=session.session_id,
+            )
+
+        try:
+            report = CompletionReportInput.model_validate(kwargs)
+        except Exception as exc:
+            return ErrorResponse(
+                message="completion_report arguments are invalid.",
+                error=str(exc),
+                session_id=session.session_id,
+            )
+
+        pending_approval_count = await review_db().count_pending_reviews_for_graph_exec(
+            f"{COPILOT_SESSION_PREFIX}{session.session_id}",
+            session.user_id,
+        )
+
+        if pending_approval_count > 0 and not report.approval_summary:
+            return ErrorResponse(
+                message=(
+                    "approval_summary is required because this session has pending approvals."
+                ),
+                session_id=session.session_id,
+            )
+
+        return CompletionReportSavedResponse(
+            message="Completion report recorded successfully.",
+            session_id=session.session_id,
+            has_pending_approvals=pending_approval_count > 0,
+            pending_approval_count=pending_approval_count,
+        )
--- a/autogpt_platform/backend/backend/copilot/tools/completion_report_test.py
+++ b/autogpt_platform/backend/backend/copilot/tools/completion_report_test.py
@@ -0,0 +1,95 @@
+from typing import cast
+from unittest.mock import AsyncMock, Mock
+
+import pytest
+
+from backend.copilot.model import ChatSession
+from backend.copilot.session_types import ChatSessionStartType
+from backend.copilot.tools.completion_report import CompletionReportTool
+from backend.copilot.tools.models import CompletionReportSavedResponse, ResponseType
+
+
+@pytest.mark.asyncio
+async def test_completion_report_rejects_manual_sessions() -> None:
+    tool = CompletionReportTool()
+    session = ChatSession.new("user-1")
+
+    response = await tool._execute(
+        user_id="user-1",
+        session=session,
+        thoughts="Wrapped up the session.",
+        should_notify_user=False,
+        email_title=None,
+        email_body=None,
+        callback_session_message=None,
+        approval_summary=None,
+    )
+
+    assert response.type == ResponseType.ERROR
+    assert "non-manual sessions" in response.message
+
+
+@pytest.mark.asyncio
+async def test_completion_report_requires_approval_summary_when_pending(
+    mocker,
+) -> None:
+    tool = CompletionReportTool()
+    session = ChatSession.new(
+        "user-1",
+        start_type=ChatSessionStartType.AUTOPILOT_NIGHTLY,
+    )
+
+    review_store = Mock()
+    review_store.count_pending_reviews_for_graph_exec = AsyncMock(return_value=2)
+    mocker.patch(
+        "backend.copilot.tools.completion_report.review_db",
+        return_value=review_store,
+    )
+
+    response = await tool._execute(
+        user_id="user-1",
+        session=session,
+        thoughts="Prepared a recommendation for the user.",
+        should_notify_user=True,
+        email_title="Your nightly update",
+        email_body="I found something worth reviewing.",
+        callback_session_message="Let's review the next step together.",
+        approval_summary=None,
+    )
+
+    assert response.type == ResponseType.ERROR
+    assert "approval_summary is required" in response.message
+
+
+@pytest.mark.asyncio
+async def test_completion_report_succeeds_without_pending_approvals(
+    mocker,
+) -> None:
+    tool = CompletionReportTool()
+    session = ChatSession.new(
+        "user-1",
+        start_type=ChatSessionStartType.AUTOPILOT_CALLBACK,
+    )
+
+    review_store = Mock()
+    review_store.count_pending_reviews_for_graph_exec = AsyncMock(return_value=0)
+    mocker.patch(
+        "backend.copilot.tools.completion_report.review_db",
+        return_value=review_store,
+    )
+
+    response = await tool._execute(
+        user_id="user-1",
+        session=session,
+        thoughts="Reviewed the account and prepared a useful follow-up.",
+        should_notify_user=True,
+        email_title="Autopilot found something useful",
+        email_body="I put together a recommendation for you.",
+        callback_session_message="Open this chat and I will walk you through it.",
+        approval_summary=None,
+    )
+
+    assert response.type == ResponseType.COMPLETION_REPORT_SAVED
+    response = cast(CompletionReportSavedResponse, response)
+    assert response.has_pending_approvals is False
+    assert response.pending_approval_count == 0
--- a/autogpt_platform/backend/backend/copilot/tools/e2b_sandbox.py
+++ b/autogpt_platform/backend/backend/copilot/tools/e2b_sandbox.py
@@ -21,9 +21,11 @@ Lifecycle
 Cost control
 ------------
 Sandboxes are created with a configurable ``on_timeout`` lifecycle action
-(default: ``"pause"``).  The explicit per-turn ``pause_sandbox()`` call is the
-primary mechanism; the lifecycle setting is a safety net.  Paused sandboxes are
-free.
+(default: ``"pause"``) and ``auto_resume`` (default: ``True``).  The explicit
+per-turn ``pause_sandbox()`` call is the primary mechanism; the lifecycle
+timeout is a safety net (default: 5 min).  ``auto_resume`` ensures that paused
+sandboxes wake transparently on SDK activity, making the aggressive safety-net
+timeout safe.  Paused sandboxes are free.

 The sandbox_id is stored in Redis.  The same key doubles as a creation lock:
 a ``"creating"`` sentinel value is written with a short TTL while a new sandbox
@@ -40,6 +42,7 @@ import logging
 from typing import Any, Awaitable, Callable, Literal

 from e2b import AsyncSandbox
+from e2b.sandbox.sandbox_api import SandboxLifecycle

 from backend.data.redis_client import get_redis_async

@@ -116,9 +119,10 @@ async def get_or_create_sandbox(
    removes the need for a separate lock key.

    *timeout* controls how long the e2b sandbox may run continuously before
-    the ``on_timeout`` lifecycle rule fires (default: 3 h).
+    the ``on_timeout`` lifecycle rule fires (default: 5 min).
    *on_timeout* controls what happens on timeout: ``"pause"`` (default, free)
-    or ``"kill"``.
+    or ``"kill"``.  When ``"pause"``, ``auto_resume`` is enabled so paused
+    sandboxes wake transparently on SDK activity.
    """
    redis = await get_redis_async()
    key = _sandbox_key(session_id)
@@ -156,11 +160,15 @@ async def get_or_create_sandbox(

        # We hold the slot — create the sandbox.
        try:
+            lifecycle = SandboxLifecycle(
+                on_timeout=on_timeout,
+                auto_resume=on_timeout == "pause",
+            )
            sandbox = await AsyncSandbox.create(
                template=template,
                api_key=api_key,
                timeout=timeout,
-                lifecycle={"on_timeout": on_timeout},
+                lifecycle=lifecycle,
            )
            try:
                await _set_stored_sandbox_id(session_id, sandbox.sandbox_id)
--- a/autogpt_platform/backend/backend/copilot/tools/e2b_sandbox_test.py
+++ b/autogpt_platform/backend/backend/copilot/tools/e2b_sandbox_test.py
@@ -157,14 +157,17 @@ class TestGetOrCreateSandbox:

        assert result is new_sb
        mock_cls.create.assert_awaited_once()
-        # Verify lifecycle param is set
+        # Verify lifecycle: pause + auto_resume enabled
        _, kwargs = mock_cls.create.call_args
-        assert kwargs.get("lifecycle") == {"on_timeout": "pause"}
+        assert kwargs.get("lifecycle") == {
+            "on_timeout": "pause",
+            "auto_resume": True,
+        }
        # sandbox_id should be saved to Redis
        redis.set.assert_awaited()

    def test_create_with_on_timeout_kill(self):
-        """on_timeout='kill' is passed through to AsyncSandbox.create."""
+        """on_timeout='kill' disables auto_resume automatically."""
        new_sb = _mock_sandbox("sb-new")
        redis = _mock_redis(set_nx_result=True, stored_sandbox_id=None)
        with (
@@ -179,7 +182,10 @@ class TestGetOrCreateSandbox:
            )

        _, kwargs = mock_cls.create.call_args
-        assert kwargs.get("lifecycle") == {"on_timeout": "kill"}
+        assert kwargs.get("lifecycle") == {
+            "on_timeout": "kill",
+            "auto_resume": False,
+        }

    def test_create_failure_releases_slot(self):
        """If sandbox creation fails, the Redis creation slot is deleted."""
--- a/autogpt_platform/backend/backend/copilot/tools/helpers.py
+++ b/autogpt_platform/backend/backend/copilot/tools/helpers.py
@@ -13,6 +13,7 @@ from backend.data.execution import ExecutionContext
 from backend.data.model import CredentialsFieldInfo, CredentialsMetaInput
 from backend.integrations.creds_manager import IntegrationCredentialsManager
 from backend.util.exceptions import BlockError
+from backend.util.type import coerce_inputs_to_schema

 from .models import BlockOutputResponse, ErrorResponse, ToolResponseBase
 from .utils import match_credentials_to_requirements
@@ -111,6 +112,9 @@ async def execute_block(
                    session_id=session_id,
                )

+        # Coerce non-matching data types to the expected input schema.
+        coerce_inputs_to_schema(input_data, block.input_schema)
+
        # Execute the block and collect outputs
        outputs: dict[str, list[Any]] = defaultdict(list)
        async for output_name, output_data in block.execute(
--- a/autogpt_platform/backend/backend/copilot/tools/helpers_test.py
+++ b/autogpt_platform/backend/backend/copilot/tools/helpers_test.py
@@ -0,0 +1,333 @@
+"""Tests for execute_block type coercion in helpers.py.
+
+Verifies that execute_block() coerces string input values to match the block's
+expected input types, mirroring the executor's validate_exec() logic.
+This is critical for @@agptfile: expansion, where file content is always a string
+but the block may expect structured types (e.g. list[list[str]]).
+"""
+
+from typing import Any
+from unittest.mock import AsyncMock, MagicMock, patch
+
+import pytest
+
+from backend.copilot.tools.helpers import execute_block
+from backend.copilot.tools.models import BlockOutputResponse
+
+
+def _make_block_schema(annotations: dict[str, Any]) -> MagicMock:
+    """Create a mock input_schema with model_fields matching the given annotations."""
+    schema = MagicMock()
+    # coerce_inputs_to_schema uses model_fields (Pydantic v2 API)
+    model_fields = {}
+    for name, ann in annotations.items():
+        field = MagicMock()
+        field.annotation = ann
+        model_fields[name] = field
+    schema.model_fields = model_fields
+    return schema
+
+
+def _make_block(
+    block_id: str,
+    name: str,
+    annotations: dict[str, Any],
+    outputs: dict[str, list[Any]] | None = None,
+) -> MagicMock:
+    """Create a mock block with typed annotations and a simple execute method."""
+    block = MagicMock()
+    block.id = block_id
+    block.name = name
+    block.input_schema = _make_block_schema(annotations)
+
+    captured_inputs: dict[str, Any] = {}
+
+    async def mock_execute(input_data: dict, **_kwargs: Any):
+        captured_inputs.update(input_data)
+        for output_name, values in (outputs or {"result": ["ok"]}).items():
+            for v in values:
+                yield output_name, v
+
+    block.execute = mock_execute
+    block._captured_inputs = captured_inputs
+    return block
+
+
+_TEST_SESSION_ID = "test-session-coerce"
+_TEST_USER_ID = "test-user-coerce"
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_json_string_to_nested_list():
+    """JSON string → list[list[str]] (Google Sheets CSV import case)."""
+    block = _make_block(
+        "sheets-write",
+        "Google Sheets Write",
+        {"values": list[list[str]], "spreadsheet_id": str},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="sheets-write",
+            input_data={
+                "values": '[["Name","Score"],["Alice","90"],["Bob","85"]]',
+                "spreadsheet_id": "abc123",
+            },
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-1",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    assert response.success is True
+    # Verify the input was coerced from string to list[list[str]]
+    assert block._captured_inputs["values"] == [
+        ["Name", "Score"],
+        ["Alice", "90"],
+        ["Bob", "85"],
+    ]
+    assert isinstance(block._captured_inputs["values"], list)
+    assert isinstance(block._captured_inputs["values"][0], list)
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_json_string_to_list():
+    """JSON string → list[str]."""
+    block = _make_block(
+        "list-block",
+        "List Block",
+        {"items": list[str]},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="list-block",
+            input_data={"items": '["a","b","c"]'},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-2",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    assert block._captured_inputs["items"] == ["a", "b", "c"]
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_json_string_to_dict():
+    """JSON string → dict[str, str]."""
+    block = _make_block(
+        "dict-block",
+        "Dict Block",
+        {"config": dict[str, str]},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="dict-block",
+            input_data={"config": '{"key": "value", "foo": "bar"}'},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-3",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    assert block._captured_inputs["config"] == {"key": "value", "foo": "bar"}
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_no_coercion_when_type_matches():
+    """Already-correct types pass through without coercion."""
+    block = _make_block(
+        "pass-through",
+        "Pass Through",
+        {"values": list[list[str]], "name": str},
+    )
+
+    original_values = [["a", "b"], ["c", "d"]]
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="pass-through",
+            input_data={"values": original_values, "name": "test"},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-4",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    assert block._captured_inputs["values"] == original_values
+    assert block._captured_inputs["name"] == "test"
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_string_to_int():
+    """String number → int."""
+    block = _make_block(
+        "int-block",
+        "Int Block",
+        {"count": int},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="int-block",
+            input_data={"count": "42"},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-5",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    assert block._captured_inputs["count"] == 42
+    assert isinstance(block._captured_inputs["count"], int)
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_skips_none_values():
+    """None values are not coerced (they may be optional fields)."""
+    block = _make_block(
+        "optional-block",
+        "Optional Block",
+        {"data": list[str], "label": str},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="optional-block",
+            input_data={"label": "test"},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-6",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    # 'data' was not provided, so it should not appear in captured inputs
+    assert "data" not in block._captured_inputs
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_union_type_preserves_valid_member():
+    """Union-typed fields should not be coerced when the value matches a member."""
+    block = _make_block(
+        "union-block",
+        "Union Block",
+        {"content": str | list[str]},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="union-block",
+            input_data={"content": ["a", "b"]},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-7",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    # list[str] should NOT be stringified to '["a", "b"]'
+    assert block._captured_inputs["content"] == ["a", "b"]
+    assert isinstance(block._captured_inputs["content"], list)
+
+
+@pytest.mark.asyncio(loop_scope="session")
+async def test_coerce_inner_elements_of_generic():
+    """Inner elements of generic containers are recursively coerced."""
+    block = _make_block(
+        "inner-coerce",
+        "Inner Coerce",
+        {"values": list[str]},
+    )
+
+    mock_workspace_db = MagicMock()
+    mock_workspace_db.get_or_create_workspace = AsyncMock(
+        return_value=MagicMock(id="ws-1")
+    )
+
+    with patch(
+        "backend.copilot.tools.helpers.workspace_db",
+        return_value=mock_workspace_db,
+    ):
+        response = await execute_block(
+            block=block,
+            block_id="inner-coerce",
+            # Inner elements are ints, but target is list[str]
+            input_data={"values": [1, 2, 3]},
+            user_id=_TEST_USER_ID,
+            session_id=_TEST_SESSION_ID,
+            node_exec_id="exec-8",
+            matched_credentials={},
+        )
+
+    assert isinstance(response, BlockOutputResponse)
+    # Inner elements should be coerced from int to str
+    assert block._captured_inputs["values"] == ["1", "2", "3"]
+    assert all(isinstance(v, str) for v in block._captured_inputs["values"])
--- a/autogpt_platform/backend/backend/copilot/tools/models.py
+++ b/autogpt_platform/backend/backend/copilot/tools/models.py
@@ -16,6 +16,7 @@ class ResponseType(str, Enum):
    ERROR = "error"
    NO_RESULTS = "no_results"
    NEED_LOGIN = "need_login"
+    COMPLETION_REPORT_SAVED = "completion_report_saved"

    # Agent discovery & execution
    AGENTS_FOUND = "agents_found"
@@ -138,7 +139,7 @@ class NoResultsResponse(ToolResponseBase):
    """Response when no agents found."""

    type: ResponseType = ResponseType.NO_RESULTS
-    suggestions: list[str] = []
+    suggestions: list[str] = Field(default_factory=list)
    name: str = "no_results"


@@ -170,8 +171,8 @@ class AgentDetails(BaseModel):
    name: str
    description: str
    in_library: bool = False
-    inputs: dict[str, Any] = {}
-    credentials: list[CredentialsMetaInput] = []
+    inputs: dict[str, Any] = Field(default_factory=dict)
+    credentials: list[CredentialsMetaInput] = Field(default_factory=list)
    execution_options: ExecutionOptions = Field(default_factory=ExecutionOptions)
    trigger_info: dict[str, Any] | None = None

@@ -191,7 +192,7 @@ class UserReadiness(BaseModel):
    """User readiness status."""

    has_all_credentials: bool = False
-    missing_credentials: dict[str, Any] = {}
+    missing_credentials: dict[str, Any] = Field(default_factory=dict)
    ready_to_run: bool = False


@@ -248,6 +249,14 @@ class ErrorResponse(ToolResponseBase):
    details: dict[str, Any] | None = None


+class CompletionReportSavedResponse(ToolResponseBase):
+    """Response for completion_report."""
+
+    type: ResponseType = ResponseType.COMPLETION_REPORT_SAVED
+    has_pending_approvals: bool = False
+    pending_approval_count: int = 0
+
+
 class InputValidationErrorResponse(ToolResponseBase):
    """Response when run_agent receives unknown input fields."""

@@ -436,9 +445,9 @@ class BlockDetails(BaseModel):
    id: str
    name: str
    description: str
-    inputs: dict[str, Any] = {}
-    outputs: dict[str, Any] = {}
-    credentials: list[CredentialsMetaInput] = []
+    inputs: dict[str, Any] = Field(default_factory=dict)
+    outputs: dict[str, Any] = Field(default_factory=dict)
+    credentials: list[CredentialsMetaInput] = Field(default_factory=list)


 class BlockDetailsResponse(ToolResponseBase):
@@ -631,7 +640,7 @@ class FolderInfo(BaseModel):
 class FolderTreeInfo(FolderInfo):
    """Folder with nested children for tree display."""

-    children: list["FolderTreeInfo"] = []
+    children: list["FolderTreeInfo"] = Field(default_factory=list)


 class FolderCreatedResponse(ToolResponseBase):
@@ -678,6 +687,6 @@ class AgentsMovedToFolderResponse(ToolResponseBase):

    type: ResponseType = ResponseType.AGENTS_MOVED_TO_FOLDER
    agent_ids: list[str]
-    agent_names: list[str] = []
+    agent_names: list[str] = Field(default_factory=list)
    folder_id: str | None = None
    count: int = 0
--- a/autogpt_platform/backend/backend/copilot/tools/run_mcp_tool.py
+++ b/autogpt_platform/backend/backend/copilot/tools/run_mcp_tool.py
@@ -184,10 +184,12 @@ class RunMCPToolTool(BaseTool):
            if e.status_code in _AUTH_STATUS_CODES and not creds:
                # Server requires auth and user has no stored credentials
                return self._build_setup_requirements(server_url, session_id)
-            logger.warning("MCP HTTP error for %s: %s", server_host(server_url), e)
+            host = server_host(server_url)
+            logger.warning("MCP HTTP error for %s: status=%s", host, e.status_code)
            return ErrorResponse(
-                message=f"MCP server returned HTTP {e.status_code}: {e}",
+                message=(f"MCP request to {host} failed with HTTP {e.status_code}."),
                session_id=session_id,
+                error=f"HTTP {e.status_code}: {str(e)[:300]}",
            )

        except MCPClientError as e:
--- a/autogpt_platform/backend/backend/copilot/tools/test_run_mcp_tool.py
+++ b/autogpt_platform/backend/backend/copilot/tools/test_run_mcp_tool.py
@@ -580,6 +580,49 @@ async def test_auth_error_with_existing_creds_returns_error():
    assert "403" in response.message


+@pytest.mark.asyncio(loop_scope="session")
+async def test_http_error_returns_clean_message_with_collapsible_detail():
+    """Non-auth HTTP errors return a clean message with raw detail in the `error` field."""
+    from backend.util.request import HTTPClientError
+
+    tool = RunMCPToolTool()
+    session = make_session(_USER_ID)
+
+    with patch(
+        "backend.copilot.tools.run_mcp_tool.validate_url_host", new_callable=AsyncMock
+    ):
+        with patch(
+            "backend.copilot.tools.run_mcp_tool.auto_lookup_mcp_credential",
+            new_callable=AsyncMock,
+            return_value=None,
+        ):
+            mock_client = AsyncMock()
+            mock_client.initialize = AsyncMock(
+                side_effect=HTTPClientError(
+                    "<!doctype html><html><body>Not Found</body></html>",
+                    status_code=404,
+                )
+            )
+            with patch(
+                "backend.copilot.tools.run_mcp_tool.MCPClient",
+                return_value=mock_client,
+            ):
+                response = await tool._execute(
+                    user_id=_USER_ID,
+                    session=session,
+                    server_url=_SERVER_URL,
+                )
+
+    assert isinstance(response, ErrorResponse)
+    assert "404" in response.message
+    # Raw HTML body must NOT leak into the user-facing message
+    assert "<!doctype" not in response.message
+    # Raw detail (including original body) goes in the collapsible `error` field
+    assert response.error is not None
+    assert "404" in response.error
+    assert "<!doctype" in response.error.lower()
+
+
@pytest.mark.asyncio(loop_scope="session")
 async def test_mcp_client_error_returns_error_response():
    """MCPClientError (protocol-level) maps to a clean ErrorResponse."""
--- a/autogpt_platform/backend/backend/data/db_accessors.py
+++ b/autogpt_platform/backend/backend/data/db_accessors.py
@@ -92,6 +92,19 @@ def user_db():
    return user_db


+def invited_user_db():
+    if db.is_connected():
+        from backend.data import invited_user as _invited_user_db
+
+        invited_user_db = _invited_user_db
+    else:
+        from backend.util.clients import get_database_manager_async_client
+
+        invited_user_db = get_database_manager_async_client()
+
+    return invited_user_db
+
+
 def understanding_db():
    if db.is_connected():
        from backend.data import understanding as _understanding_db
--- a/autogpt_platform/backend/backend/data/db_manager.py
+++ b/autogpt_platform/backend/backend/data/db_manager.py
@@ -79,6 +79,7 @@ from backend.data.graph import (
 from backend.data.human_review import (
    cancel_pending_reviews_for_execution,
    check_approval,
+    count_pending_reviews_for_graph_exec,
    delete_review_by_node_exec_id,
    get_or_create_human_review,
    get_pending_reviews_for_execution,
@@ -86,6 +87,7 @@ from backend.data.human_review import (
    has_pending_reviews_for_graph_exec,
    update_review_processed_status,
 )
+from backend.data.invited_user import list_invited_users_for_auth_users
 from backend.data.notifications import (
    clear_all_user_notification_batches,
    create_or_add_to_user_notification_batch,
@@ -107,6 +109,7 @@ from backend.data.user import (
    get_user_email_verification,
    get_user_integrations,
    get_user_notification_preference,
+    list_users,
    update_user_integrations,
 )
 from backend.data.workspace import (
@@ -115,6 +118,7 @@ from backend.data.workspace import (
    get_or_create_workspace,
    get_workspace_file,
    get_workspace_file_by_path,
+    get_workspace_files_by_ids,
    list_workspace_files,
    soft_delete_workspace_file,
 )
@@ -237,6 +241,7 @@ class DatabaseManager(AppService):

    # ============ User + Integrations ============ #
    get_user_by_id = _(get_user_by_id)
+    list_users = _(list_users)
    get_user_integrations = _(get_user_integrations)
    update_user_integrations = _(update_user_integrations)

@@ -249,6 +254,7 @@ class DatabaseManager(AppService):
    # ============ Human In The Loop ============ #
    cancel_pending_reviews_for_execution = _(cancel_pending_reviews_for_execution)
    check_approval = _(check_approval)
+    count_pending_reviews_for_graph_exec = _(count_pending_reviews_for_graph_exec)
    delete_review_by_node_exec_id = _(delete_review_by_node_exec_id)
    get_or_create_human_review = _(get_or_create_human_review)
    get_pending_reviews_for_execution = _(get_pending_reviews_for_execution)
@@ -313,12 +319,16 @@ class DatabaseManager(AppService):
    # ============ Workspace ============ #
    count_workspace_files = _(count_workspace_files)
    create_workspace_file = _(create_workspace_file)
+    get_workspace_files_by_ids = _(get_workspace_files_by_ids)
    get_or_create_workspace = _(get_or_create_workspace)
    get_workspace_file = _(get_workspace_file)
    get_workspace_file_by_path = _(get_workspace_file_by_path)
    list_workspace_files = _(list_workspace_files)
    soft_delete_workspace_file = _(soft_delete_workspace_file)

+    # ============ Invited Users ============ #
+    list_invited_users_for_auth_users = _(list_invited_users_for_auth_users)
+
    # ============ Understanding ============ #
    get_business_understanding = _(get_business_understanding)
    upsert_business_understanding = _(upsert_business_understanding)
@@ -328,8 +338,28 @@ class DatabaseManager(AppService):
    update_block_optimized_description = _(update_block_optimized_description)

    # ============ CoPilot Chat Sessions ============ #
+    get_chat_messages_since = _(chat_db.get_chat_messages_since)
+    get_chat_session_callback_token = _(chat_db.get_chat_session_callback_token)
    get_chat_session = _(chat_db.get_chat_session)
+    create_chat_session_callback_token = _(chat_db.create_chat_session_callback_token)
    create_chat_session = _(chat_db.create_chat_session)
+    get_manual_chat_sessions_since = _(chat_db.get_manual_chat_sessions_since)
+    get_pending_notification_chat_sessions = _(
+        chat_db.get_pending_notification_chat_sessions
+    )
+    get_pending_notification_chat_sessions_for_user = _(
+        chat_db.get_pending_notification_chat_sessions_for_user
+    )
+    get_recent_completion_report_chat_sessions = _(
+        chat_db.get_recent_completion_report_chat_sessions
+    )
+    get_recent_sent_email_chat_sessions = _(chat_db.get_recent_sent_email_chat_sessions)
+    has_recent_manual_message = _(chat_db.has_recent_manual_message)
+    has_session_since = _(chat_db.has_session_since)
+    mark_chat_session_callback_token_consumed = _(
+        chat_db.mark_chat_session_callback_token_consumed
+    )
+    session_exists_for_execution_tag = _(chat_db.session_exists_for_execution_tag)
    update_chat_session = _(chat_db.update_chat_session)
    add_chat_message = _(chat_db.add_chat_message)
    add_chat_messages_batch = _(chat_db.add_chat_messages_batch)
@@ -374,10 +404,18 @@ class DatabaseManagerClient(AppServiceClient):
    get_marketplace_graphs_for_monitoring = _(d.get_marketplace_graphs_for_monitoring)

    # Human In The Loop
+    count_pending_reviews_for_graph_exec = _(d.count_pending_reviews_for_graph_exec)
    has_pending_reviews_for_graph_exec = _(d.has_pending_reviews_for_graph_exec)

    # User Emails
    get_user_email_by_id = _(d.get_user_email_by_id)
+    list_users = _(d.list_users)
+
+    # CoPilot Chat Sessions
+    get_recent_completion_report_chat_sessions = _(
+        d.get_recent_completion_report_chat_sessions
+    )
+    get_recent_sent_email_chat_sessions = _(d.get_recent_sent_email_chat_sessions)

    # Library
    list_library_agents = _(d.list_library_agents)
@@ -433,12 +471,14 @@ class DatabaseManagerAsyncClient(AppServiceClient):

    # ============ User + Integrations ============ #
    get_user_by_id = d.get_user_by_id
+    list_users = d.list_users
    get_user_integrations = d.get_user_integrations
    update_user_integrations = d.update_user_integrations

    # ============ Human In The Loop ============ #
    cancel_pending_reviews_for_execution = d.cancel_pending_reviews_for_execution
    check_approval = d.check_approval
+    count_pending_reviews_for_graph_exec = d.count_pending_reviews_for_graph_exec
    delete_review_by_node_exec_id = d.delete_review_by_node_exec_id
    get_or_create_human_review = d.get_or_create_human_review
    get_pending_reviews_for_execution = d.get_pending_reviews_for_execution
@@ -506,12 +546,16 @@ class DatabaseManagerAsyncClient(AppServiceClient):
    # ============ Workspace ============ #
    count_workspace_files = d.count_workspace_files
    create_workspace_file = d.create_workspace_file
+    get_workspace_files_by_ids = d.get_workspace_files_by_ids
    get_or_create_workspace = d.get_or_create_workspace
    get_workspace_file = d.get_workspace_file
    get_workspace_file_by_path = d.get_workspace_file_by_path
    list_workspace_files = d.list_workspace_files
    soft_delete_workspace_file = d.soft_delete_workspace_file

+    # ============ Invited Users ============ #
+    list_invited_users_for_auth_users = d.list_invited_users_for_auth_users
+
    # ============ Understanding ============ #
    get_business_understanding = d.get_business_understanding
    upsert_business_understanding = d.upsert_business_understanding
@@ -520,8 +564,26 @@ class DatabaseManagerAsyncClient(AppServiceClient):
    get_blocks_needing_optimization = d.get_blocks_needing_optimization

    # ============ CoPilot Chat Sessions ============ #
+    get_chat_messages_since = d.get_chat_messages_since
+    get_chat_session_callback_token = d.get_chat_session_callback_token
    get_chat_session = d.get_chat_session
+    create_chat_session_callback_token = d.create_chat_session_callback_token
    create_chat_session = d.create_chat_session
+    get_manual_chat_sessions_since = d.get_manual_chat_sessions_since
+    get_pending_notification_chat_sessions = d.get_pending_notification_chat_sessions
+    get_pending_notification_chat_sessions_for_user = (
+        d.get_pending_notification_chat_sessions_for_user
+    )
+    get_recent_completion_report_chat_sessions = (
+        d.get_recent_completion_report_chat_sessions
+    )
+    get_recent_sent_email_chat_sessions = d.get_recent_sent_email_chat_sessions
+    has_recent_manual_message = d.has_recent_manual_message
+    has_session_since = d.has_session_since
+    mark_chat_session_callback_token_consumed = (
+        d.mark_chat_session_callback_token_consumed
+    )
+    session_exists_for_execution_tag = d.session_exists_for_execution_tag
    update_chat_session = d.update_chat_session
    add_chat_message = d.add_chat_message
    add_chat_messages_batch = d.add_chat_messages_batch
--- a/autogpt_platform/backend/backend/data/human_review.py
+++ b/autogpt_platform/backend/backend/data/human_review.py
@@ -342,6 +342,19 @@ async def has_pending_reviews_for_graph_exec(graph_exec_id: str) -> bool:
    return count > 0


+async def count_pending_reviews_for_graph_exec(
+    graph_exec_id: str,
+    user_id: str,
+) -> int:
+    return await PendingHumanReview.prisma().count(
+        where={
+            "userId": user_id,
+            "graphExecId": graph_exec_id,
+            "status": ReviewStatus.WAITING,
+        }
+    )
+
+
 async def _resolve_node_id(node_exec_id: str, get_node_execution) -> str:
    """Resolve node_id from a node_exec_id.

--- a/autogpt_platform/backend/backend/data/invited_user.py
+++ b/autogpt_platform/backend/backend/data/invited_user.py
@@ -0,0 +1,754 @@
+import asyncio
+import csv
+import io
+import logging
+import os
+import re
+import socket
+from dataclasses import dataclass
+from datetime import datetime, timezone
+from typing import Any, Literal, Optional
+from uuid import uuid4
+
+import prisma.enums
+import prisma.models
+import prisma.types
+from prisma.errors import UniqueViolationError
+from pydantic import BaseModel, EmailStr, TypeAdapter, ValidationError
+
+from backend.data.db import transaction
+from backend.data.model import User
+from backend.data.redis_client import get_redis_async
+from backend.data.tally import get_business_understanding_input_from_tally, mask_email
+from backend.data.understanding import (
+    merge_business_understanding_data,
+    parse_business_understanding_input,
+)
+from backend.data.user import get_user_by_email, get_user_by_id
+from backend.executor.cluster_lock import AsyncClusterLock
+from backend.util.exceptions import (
+    NotAuthorizedError,
+    NotFoundError,
+    PreconditionFailed,
+)
+from backend.util.json import SafeJson
+from backend.util.settings import Settings
+
+logger = logging.getLogger(__name__)
+_settings = Settings()
+
+_WORKER_ID = f"{socket.gethostname()}:{os.getpid()}"
+
+_tally_seed_tasks: dict[str, asyncio.Task] = {}
+_TALLY_STALE_SECONDS = 300
+_MAX_TALLY_ERROR_LENGTH = 200
+_email_adapter = TypeAdapter(EmailStr)
+
+MAX_BULK_INVITE_FILE_BYTES = 1024 * 1024
+MAX_BULK_INVITE_ROWS = 500
+
+
+class InvitedUserRecord(BaseModel):
+    id: str
+    email: str
+    status: prisma.enums.InvitedUserStatus
+    auth_user_id: Optional[str] = None
+    name: Optional[str] = None
+    tally_understanding: Optional[dict[str, Any]] = None
+    tally_status: prisma.enums.TallyComputationStatus
+    tally_computed_at: Optional[datetime] = None
+    tally_error: Optional[str] = None
+    created_at: datetime
+    updated_at: datetime
+
+    @classmethod
+    def from_db(cls, invited_user: "prisma.models.InvitedUser") -> "InvitedUserRecord":
+        payload = parse_business_understanding_input(invited_user.tallyUnderstanding)
+        return cls(
+            id=invited_user.id,
+            email=invited_user.email,
+            status=invited_user.status,
+            auth_user_id=invited_user.authUserId,
+            name=invited_user.name,
+            tally_understanding=(
+                payload.model_dump(mode="json") if payload is not None else None
+            ),
+            tally_status=invited_user.tallyStatus,
+            tally_computed_at=invited_user.tallyComputedAt,
+            tally_error=invited_user.tallyError,
+            created_at=invited_user.createdAt,
+            updated_at=invited_user.updatedAt,
+        )
+
+
+class BulkInvitedUserRowResult(BaseModel):
+    row_number: int
+    email: Optional[str] = None
+    name: Optional[str] = None
+    status: Literal["CREATED", "SKIPPED", "ERROR"]
+    message: str
+    invited_user: Optional[InvitedUserRecord] = None
+
+
+class BulkInvitedUsersResult(BaseModel):
+    created_count: int
+    skipped_count: int
+    error_count: int
+    results: list[BulkInvitedUserRowResult]
+
+
+@dataclass(frozen=True)
+class _ParsedInviteRow:
+    row_number: int
+    email: str
+    name: Optional[str]
+
+
+def normalize_email(email: str) -> str:
+    return email.strip().lower()
+
+
+def _normalize_name(name: Optional[str]) -> Optional[str]:
+    if name is None:
+        return None
+    normalized = name.strip()
+    return normalized or None
+
+
+def _default_profile_name(email: str, preferred_name: Optional[str]) -> str:
+    if preferred_name:
+        return preferred_name
+    local_part = email.split("@", 1)[0].strip()
+    return local_part or "user"
+
+
+def _sanitize_username_base(email: str) -> str:
+    local_part = email.split("@", 1)[0].lower()
+    sanitized = re.sub(r"[^a-z0-9-]", "", local_part)
+    sanitized = sanitized.strip("-")
+    return sanitized[:40] or "user"
+
+
+async def _generate_unique_profile_username(email: str, tx) -> str:
+    base = _sanitize_username_base(email)
+
+    for _ in range(2):
+        candidate = f"{base}-{uuid4().hex[:6]}"
+        existing = await prisma.models.Profile.prisma(tx).find_unique(
+            where={"username": candidate}
+        )
+        if existing is None:
+            return candidate
+
+    raise RuntimeError(f"Unable to generate unique username for {email}")
+
+
+async def _ensure_default_profile(
+    user_id: str,
+    email: str,
+    preferred_name: Optional[str],
+    tx,
+) -> None:
+    existing_profile = await prisma.models.Profile.prisma(tx).find_unique(
+        where={"userId": user_id}
+    )
+    if existing_profile is not None:
+        return
+
+    username = await _generate_unique_profile_username(email, tx)
+    await prisma.models.Profile.prisma(tx).create(
+        data=prisma.types.ProfileCreateInput(
+            userId=user_id,
+            name=_default_profile_name(email, preferred_name),
+            username=username,
+            description="I'm new here",
+            links=[],
+            avatarUrl="",
+        )
+    )
+
+
+async def _ensure_default_onboarding(user_id: str, tx) -> None:
+    await prisma.models.UserOnboarding.prisma(tx).upsert(
+        where={"userId": user_id},
+        data={
+            "create": prisma.types.UserOnboardingCreateInput(userId=user_id),
+            "update": {},
+        },
+    )
+
+
+async def _apply_tally_understanding(
+    user_id: str,
+    invited_user: "prisma.models.InvitedUser",
+    tx,
+) -> None:
+    input_data = parse_business_understanding_input(invited_user.tallyUnderstanding)
+    if input_data is None:
+        if invited_user.tallyUnderstanding is not None:
+            logger.warning(
+                "Malformed tallyUnderstanding for invited user %s; skipping",
+                invited_user.id,
+            )
+        return
+
+    payload = merge_business_understanding_data({}, input_data)
+    await prisma.models.CoPilotUnderstanding.prisma(tx).upsert(
+        where={"userId": user_id},
+        data={
+            "create": {"userId": user_id, "data": SafeJson(payload)},
+            "update": {"data": SafeJson(payload)},
+        },
+    )
+
+
+async def list_invited_users(
+    page: int = 1,
+    page_size: int = 50,
+) -> tuple[list[InvitedUserRecord], int]:
+    total = await prisma.models.InvitedUser.prisma().count()
+    invited_users = await prisma.models.InvitedUser.prisma().find_many(
+        order={"createdAt": "desc"},
+        skip=(page - 1) * page_size,
+        take=page_size,
+    )
+    return [InvitedUserRecord.from_db(iu) for iu in invited_users], total
+
+
+async def list_invited_users_for_auth_users(
+    auth_user_ids: list[str],
+) -> list[InvitedUserRecord]:
+    if not auth_user_ids:
+        return []
+
+    invited_users = await prisma.models.InvitedUser.prisma().find_many(
+        where={"authUserId": {"in": auth_user_ids}}
+    )
+    return [InvitedUserRecord.from_db(invited_user) for invited_user in invited_users]
+
+
+async def create_invited_user(
+    email: str, name: Optional[str] = None
+) -> InvitedUserRecord:
+    normalized_email = normalize_email(email)
+    normalized_name = _normalize_name(name)
+
+    existing_user = await prisma.models.User.prisma().find_unique(
+        where={"email": normalized_email}
+    )
+    if existing_user is not None:
+        raise PreconditionFailed("An active user with this email already exists")
+
+    existing_invited_user = await prisma.models.InvitedUser.prisma().find_unique(
+        where={"email": normalized_email}
+    )
+    if existing_invited_user is not None:
+        raise PreconditionFailed("An invited user with this email already exists")
+
+    try:
+        invited_user = await prisma.models.InvitedUser.prisma().create(
+            data={
+                "email": normalized_email,
+                "name": normalized_name,
+                "status": prisma.enums.InvitedUserStatus.INVITED,
+                "tallyStatus": prisma.enums.TallyComputationStatus.PENDING,
+            }
+        )
+    except UniqueViolationError:
+        raise PreconditionFailed("An invited user with this email already exists")
+    schedule_invited_user_tally_precompute(invited_user.id)
+    return InvitedUserRecord.from_db(invited_user)
+
+
+async def revoke_invited_user(invited_user_id: str) -> InvitedUserRecord:
+    invited_user = await prisma.models.InvitedUser.prisma().find_unique(
+        where={"id": invited_user_id}
+    )
+    if invited_user is None:
+        raise NotFoundError(f"Invited user {invited_user_id} not found")
+
+    if invited_user.status == prisma.enums.InvitedUserStatus.CLAIMED:
+        raise PreconditionFailed("Claimed invited users cannot be revoked")
+
+    if invited_user.status == prisma.enums.InvitedUserStatus.REVOKED:
+        return InvitedUserRecord.from_db(invited_user)
+
+    revoked_user = await prisma.models.InvitedUser.prisma().update(
+        where={"id": invited_user_id},
+        data={"status": prisma.enums.InvitedUserStatus.REVOKED},
+    )
+    if revoked_user is None:
+        raise NotFoundError(f"Invited user {invited_user_id} not found")
+    return InvitedUserRecord.from_db(revoked_user)
+
+
+async def retry_invited_user_tally(invited_user_id: str) -> InvitedUserRecord:
+    invited_user = await prisma.models.InvitedUser.prisma().find_unique(
+        where={"id": invited_user_id}
+    )
+    if invited_user is None:
+        raise NotFoundError(f"Invited user {invited_user_id} not found")
+
+    if invited_user.status == prisma.enums.InvitedUserStatus.REVOKED:
+        raise PreconditionFailed("Revoked invited users cannot retry Tally seeding")
+
+    refreshed_user = await prisma.models.InvitedUser.prisma().update(
+        where={"id": invited_user_id},
+        data={
+            "tallyUnderstanding": None,
+            "tallyStatus": prisma.enums.TallyComputationStatus.PENDING,
+            "tallyComputedAt": None,
+            "tallyError": None,
+        },
+    )
+    if refreshed_user is None:
+        raise NotFoundError(f"Invited user {invited_user_id} not found")
+    schedule_invited_user_tally_precompute(invited_user_id)
+    return InvitedUserRecord.from_db(refreshed_user)
+
+
+def _decode_bulk_invite_file(content: bytes) -> str:
+    if len(content) > MAX_BULK_INVITE_FILE_BYTES:
+        raise ValueError("Invite file exceeds the maximum size of 1 MB")
+
+    try:
+        return content.decode("utf-8-sig")
+    except UnicodeDecodeError as exc:
+        raise ValueError("Invite file must be UTF-8 encoded") from exc
+
+
+def _parse_bulk_invite_csv(text: str) -> list[_ParsedInviteRow]:
+    indexed_rows: list[tuple[int, list[str]]] = []
+
+    for row_number, row in enumerate(csv.reader(io.StringIO(text)), start=1):
+        normalized_row = [cell.strip() for cell in row]
+        if any(normalized_row):
+            indexed_rows.append((row_number, normalized_row))
+
+    if not indexed_rows:
+        return []
+
+    header = [cell.lower() for cell in indexed_rows[0][1]]
+    has_header = "email" in header
+    email_index = header.index("email") if has_header else 0
+    name_index: Optional[int] = (
+        header.index("name")
+        if has_header and "name" in header
+        else (1 if not has_header else None)
+    )
+    data_rows = indexed_rows[1:] if has_header else indexed_rows
+
+    parsed_rows: list[_ParsedInviteRow] = []
+    for row_number, row in data_rows:
+        if len(parsed_rows) >= MAX_BULK_INVITE_ROWS:
+            break
+        email = row[email_index].strip() if len(row) > email_index else ""
+        name = (
+            row[name_index].strip()
+            if name_index is not None and len(row) > name_index
+            else ""
+        )
+        parsed_rows.append(
+            _ParsedInviteRow(
+                row_number=row_number,
+                email=email,
+                name=name or None,
+            )
+        )
+
+    return parsed_rows
+
+
+def _parse_bulk_invite_text(text: str) -> list[_ParsedInviteRow]:
+    parsed_rows: list[_ParsedInviteRow] = []
+
+    for row_number, raw_line in enumerate(text.splitlines(), start=1):
+        if len(parsed_rows) >= MAX_BULK_INVITE_ROWS:
+            break
+        line = raw_line.strip()
+        if not line or line.startswith("#"):
+            continue
+
+        parsed_rows.append(
+            _ParsedInviteRow(
+                row_number=row_number,
+                email=line,
+                name=None,
+            )
+        )
+
+    return parsed_rows
+
+
+def _parse_bulk_invite_file(
+    filename: Optional[str],
+    content: bytes,
+) -> list[_ParsedInviteRow]:
+    text = _decode_bulk_invite_file(content)
+    file_name = filename.lower() if filename else ""
+    parsed_rows = (
+        _parse_bulk_invite_csv(text)
+        if file_name.endswith(".csv")
+        else _parse_bulk_invite_text(text)
+    )
+
+    if not parsed_rows:
+        raise ValueError("Invite file did not contain any emails")
+
+    return parsed_rows
+
+
+async def bulk_create_invited_users_from_file(
+    filename: Optional[str],
+    content: bytes,
+) -> BulkInvitedUsersResult:
+    parsed_rows = _parse_bulk_invite_file(filename, content)
+
+    created_count = 0
+    skipped_count = 0
+    error_count = 0
+    results: list[BulkInvitedUserRowResult] = []
+    seen_emails: set[str] = set()
+
+    for row in parsed_rows:
+        row_name = _normalize_name(row.name)
+
+        try:
+            validated_email = _email_adapter.validate_python(row.email)
+        except ValidationError:
+            error_count += 1
+            results.append(
+                BulkInvitedUserRowResult(
+                    row_number=row.row_number,
+                    email=row.email or None,
+                    name=row_name,
+                    status="ERROR",
+                    message="Invalid email address",
+                )
+            )
+            continue
+
+        normalized_email = normalize_email(str(validated_email))
+        if normalized_email in seen_emails:
+            skipped_count += 1
+            results.append(
+                BulkInvitedUserRowResult(
+                    row_number=row.row_number,
+                    email=normalized_email,
+                    name=row_name,
+                    status="SKIPPED",
+                    message="Duplicate email in upload file",
+                )
+            )
+            continue
+
+        seen_emails.add(normalized_email)
+
+        try:
+            invited_user = await create_invited_user(normalized_email, row_name)
+        except PreconditionFailed as exc:
+            skipped_count += 1
+            results.append(
+                BulkInvitedUserRowResult(
+                    row_number=row.row_number,
+                    email=normalized_email,
+                    name=row_name,
+                    status="SKIPPED",
+                    message=str(exc),
+                )
+            )
+        except Exception:
+            masked = mask_email(normalized_email)
+            logger.exception(
+                "Failed to create bulk invite for row %s (%s)",
+                row.row_number,
+                masked,
+            )
+            error_count += 1
+            results.append(
+                BulkInvitedUserRowResult(
+                    row_number=row.row_number,
+                    email=normalized_email,
+                    name=row_name,
+                    status="ERROR",
+                    message="Unexpected error creating invite",
+                )
+            )
+        else:
+            created_count += 1
+            results.append(
+                BulkInvitedUserRowResult(
+                    row_number=row.row_number,
+                    email=normalized_email,
+                    name=row_name,
+                    status="CREATED",
+                    message="Invite created",
+                    invited_user=invited_user,
+                )
+            )
+
+    return BulkInvitedUsersResult(
+        created_count=created_count,
+        skipped_count=skipped_count,
+        error_count=error_count,
+        results=results,
+    )
+
+
+async def _compute_invited_user_tally_seed(invited_user_id: str) -> None:
+    invited_user = await prisma.models.InvitedUser.prisma().find_unique(
+        where={"id": invited_user_id}
+    )
+    if invited_user is None:
+        return
+
+    if invited_user.status == prisma.enums.InvitedUserStatus.REVOKED:
+        return
+
+    try:
+        r = await get_redis_async()
+    except Exception:
+        r = None
+
+    lock: AsyncClusterLock | None = None
+
+    if r is not None:
+        lock = AsyncClusterLock(
+            redis=r,
+            key=f"tally_seed:{invited_user_id}",
+            owner_id=_WORKER_ID,
+            timeout=_TALLY_STALE_SECONDS,
+        )
+        current_owner = await lock.try_acquire()
+
+        if current_owner is None:
+            logger.warn("Redis unvailable for tally lock - skipping tally enrichement")
+            return
+        elif current_owner != _WORKER_ID:
+            logger.debug(
+                "Tally seed for %s already locked by %s, skipping",
+                invited_user_id,
+                current_owner,
+            )
+            return
+    if (
+        invited_user.tallyStatus == prisma.enums.TallyComputationStatus.RUNNING
+        and invited_user.updatedAt is not None
+    ):
+        age = (datetime.now(timezone.utc) - invited_user.updatedAt).total_seconds()
+        if age < _TALLY_STALE_SECONDS:
+            logger.debug(
+                "Tally task for %s still RUNNING (age=%ds), skipping",
+                invited_user_id,
+                int(age),
+            )
+            return
+        logger.info(
+            "Tally task for %s is stale (age=%ds), re-running",
+            invited_user_id,
+            int(age),
+        )
+
+    await prisma.models.InvitedUser.prisma().update(
+        where={"id": invited_user_id},
+        data={
+            "tallyStatus": prisma.enums.TallyComputationStatus.RUNNING,
+            "tallyError": None,
+        },
+    )
+
+    try:
+        input_data = await get_business_understanding_input_from_tally(
+            invited_user.email,
+            require_api_key=True,
+        )
+        payload = (
+            SafeJson(input_data.model_dump(exclude_none=True))
+            if input_data is not None
+            else None
+        )
+        await prisma.models.InvitedUser.prisma().update(
+            where={"id": invited_user_id},
+            data={
+                "tallyUnderstanding": payload,
+                "tallyStatus": prisma.enums.TallyComputationStatus.READY,
+                "tallyComputedAt": datetime.now(timezone.utc),
+                "tallyError": None,
+            },
+        )
+    except Exception as exc:
+        logger.exception(
+            "Failed to compute Tally understanding for invited user %s",
+            invited_user_id,
+        )
+        sanitized_error = re.sub(
+            r"https?://\S+", "<url>", f"{type(exc).__name__}: {exc}"
+        )[:_MAX_TALLY_ERROR_LENGTH]
+        await prisma.models.InvitedUser.prisma().update(
+            where={"id": invited_user_id},
+            data={
+                "tallyStatus": prisma.enums.TallyComputationStatus.FAILED,
+                "tallyError": sanitized_error,
+            },
+        )
+
+
+def schedule_invited_user_tally_precompute(invited_user_id: str) -> None:
+    existing = _tally_seed_tasks.get(invited_user_id)
+    if existing is not None and not existing.done():
+        logger.debug("Tally task already running for %s, skipping", invited_user_id)
+        return
+
+    task = asyncio.create_task(_compute_invited_user_tally_seed(invited_user_id))
+    _tally_seed_tasks[invited_user_id] = task
+
+    def _on_done(t: asyncio.Task, _id: str = invited_user_id) -> None:
+        if _tally_seed_tasks.get(_id) is t:
+            del _tally_seed_tasks[_id]
+
+    task.add_done_callback(_on_done)
+
+
+async def _open_signup_create_user(
+    auth_user_id: str,
+    normalized_email: str,
+    metadata_name: Optional[str],
+) -> User:
+    """Create a user without requiring an invite (open signup mode)."""
+    preferred_name = _normalize_name(metadata_name)
+    try:
+        async with transaction() as tx:
+            user = await prisma.models.User.prisma(tx).create(
+                data=prisma.types.UserCreateInput(
+                    id=auth_user_id,
+                    email=normalized_email,
+                    name=preferred_name,
+                )
+            )
+            await _ensure_default_profile(
+                auth_user_id, normalized_email, preferred_name, tx
+            )
+            await _ensure_default_onboarding(auth_user_id, tx)
+    except UniqueViolationError:
+        existing = await prisma.models.User.prisma().find_unique(
+            where={"id": auth_user_id}
+        )
+        if existing is not None:
+            return User.from_db(existing)
+        raise
+
+    return User.from_db(user)
+
+
+# TODO: We need to change this functions logic before going live
+async def get_or_activate_user(user_data: dict) -> User:
+    auth_user_id = user_data.get("sub")
+    if not auth_user_id:
+        raise NotAuthorizedError("User ID not found in token")
+
+    auth_email = user_data.get("email")
+    if not auth_email:
+        raise NotAuthorizedError("Email not found in token")
+
+    normalized_email = normalize_email(auth_email)
+    user_metadata = user_data.get("user_metadata")
+    metadata_name = (
+        user_metadata.get("name") if isinstance(user_metadata, dict) else None
+    )
+
+    existing_user = None
+    try:
+        existing_user = await get_user_by_id(auth_user_id)
+    except ValueError:
+        existing_user = None
+    except Exception:
+        logger.exception("Error on get user by id during tally enrichment process")
+        raise
+
+    if existing_user is not None:
+        return existing_user
+
+    if not _settings.config.enable_invite_gate or normalized_email.endswith("@agpt.co"):
+        return await _open_signup_create_user(
+            auth_user_id, normalized_email, metadata_name
+        )
+
+    invited_user = await prisma.models.InvitedUser.prisma().find_unique(
+        where={"email": normalized_email}
+    )
+    if invited_user is None:
+        raise NotAuthorizedError("Your email is not allowed to access the platform")
+
+    if invited_user.status != prisma.enums.InvitedUserStatus.INVITED:
+        raise NotAuthorizedError("Your invitation is no longer active")
+
+    try:
+        async with transaction() as tx:
+            current_user = await prisma.models.User.prisma(tx).find_unique(
+                where={"id": auth_user_id}
+            )
+            if current_user is not None:
+                return User.from_db(current_user)
+
+            current_invited_user = await prisma.models.InvitedUser.prisma(
+                tx
+            ).find_unique(where={"email": normalized_email})
+            if current_invited_user is None:
+                raise NotAuthorizedError(
+                    "Your email is not allowed to access the platform"
+                )
+
+            if current_invited_user.status != prisma.enums.InvitedUserStatus.INVITED:
+                raise NotAuthorizedError("Your invitation is no longer active")
+
+            if current_invited_user.authUserId not in (None, auth_user_id):
+                raise NotAuthorizedError("Your invitation has already been claimed")
+
+            preferred_name = current_invited_user.name or _normalize_name(metadata_name)
+            await prisma.models.User.prisma(tx).create(
+                data=prisma.types.UserCreateInput(
+                    id=auth_user_id,
+                    email=normalized_email,
+                    name=preferred_name,
+                )
+            )
+
+            await prisma.models.InvitedUser.prisma(tx).update(
+                where={"id": current_invited_user.id},
+                data={
+                    "status": prisma.enums.InvitedUserStatus.CLAIMED,
+                    "authUserId": auth_user_id,
+                },
+            )
+
+            await _ensure_default_profile(
+                auth_user_id,
+                normalized_email,
+                preferred_name,
+                tx,
+            )
+            await _ensure_default_onboarding(auth_user_id, tx)
+            await _apply_tally_understanding(auth_user_id, current_invited_user, tx)
+    except UniqueViolationError:
+        logger.info("Concurrent activation for user %s; re-fetching", auth_user_id)
+        already_created = await prisma.models.User.prisma().find_unique(
+            where={"id": auth_user_id}
+        )
+        if already_created is not None:
+            return User.from_db(already_created)
+        raise RuntimeError(
+            f"UniqueViolationError during activation but user {auth_user_id} not found"
+        )
+
+    get_user_by_id.cache_delete(auth_user_id)
+    get_user_by_email.cache_delete(normalized_email)
+
+    activated_user = await prisma.models.User.prisma().find_unique(
+        where={"id": auth_user_id}
+    )
+    if activated_user is None:
+        raise RuntimeError(
+            f"Activated user {auth_user_id} was not found after creation"
+        )
+
+    return User.from_db(activated_user)
--- a/autogpt_platform/backend/backend/data/invited_user_test.py
+++ b/autogpt_platform/backend/backend/data/invited_user_test.py
@@ -0,0 +1,335 @@
+from contextlib import asynccontextmanager
+from datetime import datetime, timezone
+from types import SimpleNamespace
+from typing import cast
+from unittest.mock import AsyncMock, Mock
+
+import prisma.enums
+import prisma.models
+import pytest
+import pytest_mock
+
+from backend.util.exceptions import NotAuthorizedError, PreconditionFailed
+
+from .invited_user import (
+    InvitedUserRecord,
+    bulk_create_invited_users_from_file,
+    create_invited_user,
+    get_or_activate_user,
+    retry_invited_user_tally,
+)
+
+
+def _invited_user_db_record(
+    *,
+    status: prisma.enums.InvitedUserStatus = prisma.enums.InvitedUserStatus.INVITED,
+    tally_understanding: dict | None = None,
+):
+    now = datetime.now(timezone.utc)
+    return SimpleNamespace(
+        id="invite-1",
+        email="invited@example.com",
+        status=status,
+        authUserId=None,
+        name="Invited User",
+        tallyUnderstanding=tally_understanding,
+        tallyStatus=prisma.enums.TallyComputationStatus.PENDING,
+        tallyComputedAt=None,
+        tallyError=None,
+        createdAt=now,
+        updatedAt=now,
+    )
+
+
+def _invited_user_record(
+    *,
+    status: prisma.enums.InvitedUserStatus = prisma.enums.InvitedUserStatus.INVITED,
+    tally_understanding: dict | None = None,
+):
+    return InvitedUserRecord.from_db(
+        cast(
+            prisma.models.InvitedUser,
+            _invited_user_db_record(
+                status=status,
+                tally_understanding=tally_understanding,
+            ),
+        )
+    )
+
+
+def _user_db_record():
+    now = datetime.now(timezone.utc)
+    return SimpleNamespace(
+        id="auth-user-1",
+        email="invited@example.com",
+        emailVerified=True,
+        name="Invited User",
+        createdAt=now,
+        updatedAt=now,
+        metadata={},
+        integrations="",
+        stripeCustomerId=None,
+        topUpConfig=None,
+        maxEmailsPerDay=3,
+        notifyOnAgentRun=True,
+        notifyOnZeroBalance=True,
+        notifyOnLowBalance=True,
+        notifyOnBlockExecutionFailed=True,
+        notifyOnContinuousAgentError=True,
+        notifyOnDailySummary=True,
+        notifyOnWeeklySummary=True,
+        notifyOnMonthlySummary=True,
+        notifyOnAgentApproved=True,
+        notifyOnAgentRejected=True,
+        timezone="not-set",
+    )
+
+
+@pytest.mark.asyncio
+async def test_create_invited_user_rejects_existing_active_user(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    user_repo = Mock()
+    user_repo.find_unique = AsyncMock(return_value=_user_db_record())
+    invited_user_repo = Mock()
+    invited_user_repo.find_unique = AsyncMock()
+
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.User.prisma", return_value=user_repo
+    )
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.InvitedUser.prisma",
+        return_value=invited_user_repo,
+    )
+
+    with pytest.raises(PreconditionFailed):
+        await create_invited_user("Invited@example.com")
+
+
+@pytest.mark.asyncio
+async def test_create_invited_user_schedules_tally_seed(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    user_repo = Mock()
+    user_repo.find_unique = AsyncMock(return_value=None)
+    invited_user_repo = Mock()
+    invited_user_repo.find_unique = AsyncMock(return_value=None)
+    invited_user_repo.create = AsyncMock(return_value=_invited_user_db_record())
+    schedule = mocker.patch(
+        "backend.data.invited_user.schedule_invited_user_tally_precompute"
+    )
+
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.User.prisma", return_value=user_repo
+    )
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.InvitedUser.prisma",
+        return_value=invited_user_repo,
+    )
+
+    invited_user = await create_invited_user("Invited@example.com", "Invited User")
+
+    assert invited_user.email == "invited@example.com"
+    invited_user_repo.create.assert_awaited_once()
+    schedule.assert_called_once_with("invite-1")
+
+
+@pytest.mark.asyncio
+async def test_retry_invited_user_tally_resets_state_and_schedules(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    invited_user_repo = Mock()
+    invited_user_repo.find_unique = AsyncMock(return_value=_invited_user_db_record())
+    invited_user_repo.update = AsyncMock(return_value=_invited_user_db_record())
+    schedule = mocker.patch(
+        "backend.data.invited_user.schedule_invited_user_tally_precompute"
+    )
+
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.InvitedUser.prisma",
+        return_value=invited_user_repo,
+    )
+
+    invited_user = await retry_invited_user_tally("invite-1")
+
+    assert invited_user.id == "invite-1"
+    invited_user_repo.update.assert_awaited_once()
+    schedule.assert_called_once_with("invite-1")
+
+
+@pytest.mark.asyncio
+async def test_get_or_activate_user_requires_invite(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    invited_user_repo = Mock()
+    invited_user_repo.find_unique = AsyncMock(return_value=None)
+
+    mock_get_user_by_id = AsyncMock(side_effect=ValueError("User not found"))
+    mock_get_user_by_id.cache_delete = Mock()
+    mocker.patch(
+        "backend.data.invited_user.get_user_by_id",
+        mock_get_user_by_id,
+    )
+    mocker.patch(
+        "backend.data.invited_user._settings.config.enable_invite_gate",
+        True,
+    )
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.InvitedUser.prisma",
+        return_value=invited_user_repo,
+    )
+
+    with pytest.raises(NotAuthorizedError):
+        await get_or_activate_user(
+            {"sub": "auth-user-1", "email": "invited@example.com"}
+        )
+
+
+@pytest.mark.asyncio
+async def test_get_or_activate_user_creates_user_from_invite(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    tx = object()
+    invited_user = _invited_user_db_record(
+        tally_understanding={"user_name": "Invited User", "industry": "Automation"}
+    )
+    created_user = _user_db_record()
+
+    outside_user_repo = Mock()
+    # Only called once at post-transaction verification (line 741);
+    # get_user_by_id (line 657) uses prisma.user.find_unique, not this mock.
+    outside_user_repo.find_unique = AsyncMock(return_value=created_user)
+
+    inside_user_repo = Mock()
+    inside_user_repo.find_unique = AsyncMock(return_value=None)
+    inside_user_repo.create = AsyncMock(return_value=created_user)
+
+    outside_invited_repo = Mock()
+    outside_invited_repo.find_unique = AsyncMock(return_value=invited_user)
+
+    inside_invited_repo = Mock()
+    inside_invited_repo.find_unique = AsyncMock(return_value=invited_user)
+    inside_invited_repo.update = AsyncMock(return_value=invited_user)
+
+    def user_prisma(client=None):
+        return inside_user_repo if client is tx else outside_user_repo
+
+    def invited_user_prisma(client=None):
+        return inside_invited_repo if client is tx else outside_invited_repo
+
+    @asynccontextmanager
+    async def fake_transaction():
+        yield tx
+
+    # Mock get_user_by_id since it uses prisma.user.find_unique (global client),
+    # not prisma.models.User.prisma().find_unique which we mock above.
+    mock_get_user_by_id = AsyncMock(side_effect=ValueError("User not found"))
+    mock_get_user_by_id.cache_delete = Mock()
+    mocker.patch(
+        "backend.data.invited_user.get_user_by_id",
+        mock_get_user_by_id,
+    )
+    mock_get_user_by_email = AsyncMock()
+    mock_get_user_by_email.cache_delete = Mock()
+    mocker.patch(
+        "backend.data.invited_user.get_user_by_email",
+        mock_get_user_by_email,
+    )
+    ensure_profile = mocker.patch(
+        "backend.data.invited_user._ensure_default_profile",
+        AsyncMock(),
+    )
+    ensure_onboarding = mocker.patch(
+        "backend.data.invited_user._ensure_default_onboarding",
+        AsyncMock(),
+    )
+    apply_tally = mocker.patch(
+        "backend.data.invited_user._apply_tally_understanding",
+        AsyncMock(),
+    )
+    mocker.patch("backend.data.invited_user.transaction", fake_transaction)
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.User.prisma", side_effect=user_prisma
+    )
+    mocker.patch(
+        "backend.data.invited_user.prisma.models.InvitedUser.prisma",
+        side_effect=invited_user_prisma,
+    )
+
+    user = await get_or_activate_user(
+        {
+            "sub": "auth-user-1",
+            "email": "Invited@example.com",
+            "user_metadata": {"name": "Invited User"},
+        }
+    )
+
+    assert user.id == "auth-user-1"
+    inside_user_repo.create.assert_awaited_once()
+    inside_invited_repo.update.assert_awaited_once()
+    ensure_profile.assert_awaited_once()
+    ensure_onboarding.assert_awaited_once_with("auth-user-1", tx)
+    apply_tally.assert_awaited_once_with("auth-user-1", invited_user, tx)
+
+
+@pytest.mark.asyncio
+async def test_bulk_create_invited_users_from_text_file(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    create_invited = mocker.patch(
+        "backend.data.invited_user.create_invited_user",
+        AsyncMock(
+            side_effect=[
+                _invited_user_record(),
+                _invited_user_record(),
+            ]
+        ),
+    )
+
+    result = await bulk_create_invited_users_from_file(
+        "invites.txt",
+        b"Invited@example.com\nsecond@example.com\n",
+    )
+
+    assert result.created_count == 2
+    assert result.skipped_count == 0
+    assert result.error_count == 0
+    assert [row.status for row in result.results] == ["CREATED", "CREATED"]
+    assert create_invited.await_count == 2
+
+
+@pytest.mark.asyncio
+async def test_bulk_create_invited_users_handles_csv_duplicates_and_invalid_rows(
+    mocker: pytest_mock.MockerFixture,
+) -> None:
+    create_invited = mocker.patch(
+        "backend.data.invited_user.create_invited_user",
+        AsyncMock(
+            side_effect=[
+                _invited_user_record(),
+                PreconditionFailed("An invited user with this email already exists"),
+            ]
+        ),
+    )
+
+    result = await bulk_create_invited_users_from_file(
+        "invites.csv",
+        (
+            "email,name\n"
+            "valid@example.com,Valid User\n"
+            "not-an-email,Bad Row\n"
+            "valid@example.com,Duplicate In File\n"
+            "existing@example.com,Existing User\n"
+        ).encode("utf-8"),
+    )
+
+    assert result.created_count == 1
+    assert result.skipped_count == 2
+    assert result.error_count == 1
+    assert [row.status for row in result.results] == [
+        "CREATED",
+        "ERROR",
+        "SKIPPED",
+        "SKIPPED",
+    ]
+    assert create_invited.await_count == 2
--- a/autogpt_platform/backend/backend/data/tally.py
+++ b/autogpt_platform/backend/backend/data/tally.py
@@ -41,7 +41,7 @@ _MAX_PAGES = 100
 _LLM_TIMEOUT = 30


-def _mask_email(email: str) -> str:
+def mask_email(email: str) -> str:
    """Mask an email for safe logging: 'alice@example.com' -> 'a***e@example.com'."""
    try:
        local, domain = email.rsplit("@", 1)
@@ -196,8 +196,7 @@ async def _refresh_cache(form_id: str) -> tuple[dict, list]:

    Returns (email_index, questions).
    """
-    settings = Settings()
-    client = _make_tally_client(settings.secrets.tally_api_key)
+    client = _make_tally_client(_settings.secrets.tally_api_key)

    redis = await get_redis_async()
    last_fetch_key = _LAST_FETCH_KEY.format(form_id=form_id)
@@ -332,6 +331,9 @@ Fields:
 - current_software (list of strings): software/tools currently used
 - existing_automation (list of strings): existing automations
 - additional_notes (string): any additional context
+- suggested_prompts (list of 5 strings): short action prompts (each under 20 words) that would help \
+this person get started with automating their work. Should be specific to their industry, role, and \
+pain points; actionable and conversational in tone; focused on automation opportunities.

 Form data:
 """
@@ -339,21 +341,21 @@ Form data:
 _EXTRACTION_SUFFIX = "\n\nReturn ONLY valid JSON."


-async def extract_business_understanding(
+async def extract_business_understanding_from_tally(
    formatted_text: str,
 ) -> BusinessUnderstandingInput:
-    """Use an LLM to extract structured business understanding from form text.
+    """
+    Use an LLM to extract structured business understanding from form text.

    Raises on timeout or unparseable response so the caller can handle it.
    """
-    settings = Settings()
-    api_key = settings.secrets.open_router_api_key
+    api_key = _settings.secrets.open_router_api_key
    client = AsyncOpenAI(api_key=api_key, base_url=OPENROUTER_BASE_URL)

    try:
        response = await asyncio.wait_for(
            client.chat.completions.create(
-                model="openai/gpt-4o-mini",
+                model=_settings.config.tally_extraction_llm_model,
                messages=[
                    {
                        "role": "user",
@@ -378,9 +380,57 @@ async def extract_business_understanding(

    # Filter out null values before constructing
    cleaned = {k: v for k, v in data.items() if v is not None}
+
+    # Validate suggested_prompts: filter >20 words, keep top 3
+    raw_prompts = cleaned.get("suggested_prompts", [])
+    if isinstance(raw_prompts, list):
+        valid = [
+            p.strip()
+            for p in raw_prompts
+            if isinstance(p, str) and len(p.strip().split()) <= 20
+        ]
+        # This will keep up to 3 suggestions
+        short_prompts = valid[:3] if valid else None
+        if short_prompts:
+            cleaned["suggested_prompts"] = short_prompts
+        else:
+            # We dont want to add a None value suggested_prompts field
+            cleaned.pop("suggested_prompts", None)
+    else:
+        # suggested_prompts must be a list - removing it as its not here
+        cleaned.pop("suggested_prompts", None)
+
    return BusinessUnderstandingInput(**cleaned)


+async def get_business_understanding_input_from_tally(
+    email: str,
+    *,
+    require_api_key: bool = False,
+) -> Optional[BusinessUnderstandingInput]:
+    if not _settings.secrets.tally_api_key:
+        if require_api_key:
+            raise RuntimeError("Tally API key is not configured")
+        logger.debug("Tally: no API key configured, skipping")
+        return None
+
+    masked = mask_email(email)
+    result = await find_submission_by_email(TALLY_FORM_ID, email)
+    if result is None:
+        logger.debug(f"Tally: no submission found for {masked}")
+        return None
+
+    submission, questions = result
+    logger.info(f"Tally: found submission for {masked}, extracting understanding")
+
+    formatted = format_submission_for_llm(submission, questions)
+    if not formatted.strip():
+        logger.warning("Tally: formatted submission was empty, skipping")
+        return None
+
+    return await extract_business_understanding_from_tally(formatted)
+
+
 async def populate_understanding_from_tally(user_id: str, email: str) -> None:
    """Main orchestrator: check Tally for a matching submission and populate understanding.

@@ -395,30 +445,10 @@ async def populate_understanding_from_tally(user_id: str, email: str) -> None:
            )
            return

-        # Check API key is configured
-        settings = Settings()
-        if not settings.secrets.tally_api_key:
-            logger.debug("Tally: no API key configured, skipping")
+        understanding_input = await get_business_understanding_input_from_tally(email)
+        if understanding_input is None:
            return

-        # Look up submission by email
-        masked = _mask_email(email)
-        result = await find_submission_by_email(TALLY_FORM_ID, email)
-        if result is None:
-            logger.debug(f"Tally: no submission found for {masked}")
-            return
-
-        submission, questions = result
-        logger.info(f"Tally: found submission for {masked}, extracting understanding")
-
-        # Format and extract
-        formatted = format_submission_for_llm(submission, questions)
-        if not formatted.strip():
-            logger.warning("Tally: formatted submission was empty, skipping")
-            return
-
-        understanding_input = await extract_business_understanding(formatted)
-
        # Upsert into database
        await upsert_business_understanding(user_id, understanding_input)
        logger.info(f"Tally: successfully populated understanding for user {user_id}")
--- a/autogpt_platform/backend/backend/data/tally_test.py
+++ b/autogpt_platform/backend/backend/data/tally_test.py
@@ -12,11 +12,11 @@ from backend.data.tally import (
    _build_email_index,
    _format_answer,
    _make_tally_client,
-    _mask_email,
    _refresh_cache,
-    extract_business_understanding,
+    extract_business_understanding_from_tally,
    find_submission_by_email,
    format_submission_for_llm,
+    mask_email,
    populate_understanding_from_tally,
 )

@@ -248,7 +248,7 @@ async def test_populate_understanding_skips_no_api_key():
            new_callable=AsyncMock,
            return_value=None,
        ),
-        patch("backend.data.tally.Settings", return_value=mock_settings),
+        patch("backend.data.tally._settings", mock_settings),
        patch(
            "backend.data.tally.find_submission_by_email",
            new_callable=AsyncMock,
@@ -284,6 +284,7 @@ async def test_populate_understanding_full_flow():
        ],
    }
    mock_input = MagicMock()
+    mock_input.suggested_prompts = ["Prompt 1", "Prompt 2", "Prompt 3"]

    with (
        patch(
@@ -291,14 +292,14 @@ async def test_populate_understanding_full_flow():
            new_callable=AsyncMock,
            return_value=None,
        ),
-        patch("backend.data.tally.Settings", return_value=mock_settings),
+        patch("backend.data.tally._settings", mock_settings),
        patch(
            "backend.data.tally.find_submission_by_email",
            new_callable=AsyncMock,
            return_value=(submission, SAMPLE_QUESTIONS),
        ),
        patch(
-            "backend.data.tally.extract_business_understanding",
+            "backend.data.tally.extract_business_understanding_from_tally",
            new_callable=AsyncMock,
            return_value=mock_input,
        ) as mock_extract,
@@ -331,14 +332,14 @@ async def test_populate_understanding_handles_llm_timeout():
            new_callable=AsyncMock,
            return_value=None,
        ),
-        patch("backend.data.tally.Settings", return_value=mock_settings),
+        patch("backend.data.tally._settings", mock_settings),
        patch(
            "backend.data.tally.find_submission_by_email",
            new_callable=AsyncMock,
            return_value=(submission, SAMPLE_QUESTIONS),
        ),
        patch(
-            "backend.data.tally.extract_business_understanding",
+            "backend.data.tally.extract_business_understanding_from_tally",
            new_callable=AsyncMock,
            side_effect=asyncio.TimeoutError(),
        ),
@@ -356,13 +357,13 @@ async def test_populate_understanding_handles_llm_timeout():


 def test_mask_email():
-    assert _mask_email("alice@example.com") == "a***e@example.com"
-    assert _mask_email("ab@example.com") == "a***@example.com"
-    assert _mask_email("a@example.com") == "a***@example.com"
+    assert mask_email("alice@example.com") == "a***e@example.com"
+    assert mask_email("ab@example.com") == "a***@example.com"
+    assert mask_email("a@example.com") == "a***@example.com"


 def test_mask_email_invalid():
-    assert _mask_email("no-at-sign") == "***"
+    assert mask_email("no-at-sign") == "***"


 # ── Prompt construction (curly-brace safety) ─────────────────────────────────
@@ -393,11 +394,11 @@ def test_extraction_prompt_no_format_placeholders():
    assert single_braces == [], f"Found format placeholders: {single_braces}"


-# ── extract_business_understanding ────────────────────────────────────────────
+# ── extract_business_understanding_from_tally ────────────────────────────────────────────


@pytest.mark.asyncio
-async def test_extract_business_understanding_success():
+async def test_extract_business_understanding_from_tally_success():
    """Happy path: LLM returns valid JSON that maps to BusinessUnderstandingInput."""
    mock_choice = MagicMock()
    mock_choice.message.content = json.dumps(
@@ -406,6 +407,13 @@ async def test_extract_business_understanding_success():
            "business_name": "Acme Corp",
            "industry": "Technology",
            "pain_points": ["manual reporting"],
+            "suggested_prompts": [
+                "Automate weekly reports",
+                "Set up invoice processing",
+                "Create a customer onboarding flow",
+                "Track project deadlines automatically",
+                "Send follow-up emails after meetings",
+            ],
        }
    )
    mock_response = MagicMock()
@@ -415,16 +423,56 @@ async def test_extract_business_understanding_success():
    mock_client.chat.completions.create.return_value = mock_response

    with patch("backend.data.tally.AsyncOpenAI", return_value=mock_client):
-        result = await extract_business_understanding("Q: Name?\nA: Alice")
+        result = await extract_business_understanding_from_tally("Q: Name?\nA: Alice")

    assert result.user_name == "Alice"
    assert result.business_name == "Acme Corp"
    assert result.industry == "Technology"
    assert result.pain_points == ["manual reporting"]
+    # suggested_prompts validated and sliced to top 3
+    assert result.suggested_prompts == [
+        "Automate weekly reports",
+        "Set up invoice processing",
+        "Create a customer onboarding flow",
+    ]


@pytest.mark.asyncio
-async def test_extract_business_understanding_filters_nulls():
+async def test_extract_business_understanding_from_tally_filters_long_prompts():
+    """Prompts exceeding 20 words are excluded and only top 3 are kept."""
+    long_prompt = " ".join(["word"] * 21)
+    mock_choice = MagicMock()
+    mock_choice.message.content = json.dumps(
+        {
+            "user_name": "Alice",
+            "suggested_prompts": [
+                long_prompt,
+                "Short prompt one",
+                long_prompt,
+                "Short prompt two",
+                "Short prompt three",
+                "Short prompt four",
+            ],
+        }
+    )
+    mock_response = MagicMock()
+    mock_response.choices = [mock_choice]
+
+    mock_client = AsyncMock()
+    mock_client.chat.completions.create.return_value = mock_response
+
+    with patch("backend.data.tally.AsyncOpenAI", return_value=mock_client):
+        result = await extract_business_understanding_from_tally("Q: Name?\nA: Alice")
+
+    assert result.suggested_prompts == [
+        "Short prompt one",
+        "Short prompt two",
+        "Short prompt three",
+    ]
+
+
+@pytest.mark.asyncio
+async def test_extract_business_understanding_from_tally_filters_nulls():
    """Null values from LLM should be excluded from the result."""
    mock_choice = MagicMock()
    mock_choice.message.content = json.dumps(
@@ -437,7 +485,7 @@ async def test_extract_business_understanding_filters_nulls():
    mock_client.chat.completions.create.return_value = mock_response

    with patch("backend.data.tally.AsyncOpenAI", return_value=mock_client):
-        result = await extract_business_understanding("Q: Name?\nA: Alice")
+        result = await extract_business_understanding_from_tally("Q: Name?\nA: Alice")

    assert result.user_name == "Alice"
    assert result.business_name is None
@@ -445,7 +493,7 @@ async def test_extract_business_understanding_filters_nulls():


@pytest.mark.asyncio
-async def test_extract_business_understanding_invalid_json():
+async def test_extract_business_understanding_from_tally_invalid_json():
    """Invalid JSON from LLM should raise JSONDecodeError."""
    mock_choice = MagicMock()
    mock_choice.message.content = "not valid json {"
@@ -459,11 +507,11 @@ async def test_extract_business_understanding_invalid_json():
        patch("backend.data.tally.AsyncOpenAI", return_value=mock_client),
        pytest.raises(json.JSONDecodeError),
    ):
-        await extract_business_understanding("Q: Name?\nA: Alice")
+        await extract_business_understanding_from_tally("Q: Name?\nA: Alice")


@pytest.mark.asyncio
-async def test_extract_business_understanding_timeout():
+async def test_extract_business_understanding_from_tally_timeout():
    """LLM timeout should propagate as asyncio.TimeoutError."""
    mock_client = AsyncMock()
    mock_client.chat.completions.create.side_effect = asyncio.TimeoutError()
@@ -473,7 +521,7 @@ async def test_extract_business_understanding_timeout():
        patch("backend.data.tally._LLM_TIMEOUT", 0.001),
        pytest.raises(asyncio.TimeoutError),
    ):
-        await extract_business_understanding("Q: Name?\nA: Alice")
+        await extract_business_understanding_from_tally("Q: Name?\nA: Alice")


 # ── _refresh_cache ───────────────────────────────────────────────────────────
@@ -492,7 +540,7 @@ async def test_refresh_cache_full_fetch():
    submissions = SAMPLE_SUBMISSIONS

    with (
-        patch("backend.data.tally.Settings", return_value=mock_settings),
+        patch("backend.data.tally._settings", mock_settings),
        patch(
            "backend.data.tally.get_redis_async",
            new_callable=AsyncMock,
@@ -540,7 +588,7 @@ async def test_refresh_cache_incremental_fetch():
    new_submissions = [SAMPLE_SUBMISSIONS[0]]  # Just Alice

    with (
-        patch("backend.data.tally.Settings", return_value=mock_settings),
+        patch("backend.data.tally._settings", mock_settings),
        patch(
            "backend.data.tally.get_redis_async",
            new_callable=AsyncMock,
--- a/autogpt_platform/backend/backend/data/understanding.py
+++ b/autogpt_platform/backend/backend/data/understanding.py
@@ -31,6 +31,18 @@ def _json_to_list(value: Any) -> list[str]:
    return []


+def parse_business_understanding_input(
+    payload: Any,
+) -> "BusinessUnderstandingInput | None":
+    if payload is None:
+        return None
+
+    try:
+        return BusinessUnderstandingInput.model_validate(payload)
+    except pydantic.ValidationError:
+        return None
+
+
 class BusinessUnderstandingInput(pydantic.BaseModel):
    """Input model for updating business understanding - all fields optional for incremental updates."""

@@ -86,6 +98,11 @@ class BusinessUnderstandingInput(pydantic.BaseModel):
        None, description="Any additional context"
    )

+    # Suggested prompts (UI-only, not included in system prompt)
+    suggested_prompts: Optional[list[str]] = pydantic.Field(
+        None, description="LLM-generated suggested prompts based on business context"
+    )
+

 class BusinessUnderstanding(pydantic.BaseModel):
    """Full business understanding model returned from database."""
@@ -122,6 +139,9 @@ class BusinessUnderstanding(pydantic.BaseModel):
    # Additional context
    additional_notes: Optional[str] = None

+    # Suggested prompts (UI-only, not included in system prompt)
+    suggested_prompts: list[str] = pydantic.Field(default_factory=list)
+
    @classmethod
    def from_db(cls, db_record: CoPilotUnderstanding) -> "BusinessUnderstanding":
        """Convert database record to Pydantic model."""
@@ -149,6 +169,7 @@ class BusinessUnderstanding(pydantic.BaseModel):
            current_software=_json_to_list(business.get("current_software")),
            existing_automation=_json_to_list(business.get("existing_automation")),
            additional_notes=business.get("additional_notes"),
+            suggested_prompts=_json_to_list(data.get("suggested_prompts")),
        )


@@ -166,6 +187,62 @@ def _merge_lists(existing: list | None, new: list | None) -> list | None:
    return merged


+def merge_business_understanding_data(
+    existing_data: dict[str, Any],
+    input_data: BusinessUnderstandingInput,
+) -> dict[str, Any]:
+    merged_data = dict(existing_data)
+
+    merged_business: dict[str, Any] = {}
+    if isinstance(merged_data.get("business"), dict):
+        merged_business = dict(merged_data["business"])
+
+    business_string_fields = [
+        "job_title",
+        "business_name",
+        "industry",
+        "business_size",
+        "user_role",
+        "additional_notes",
+    ]
+    business_list_fields = [
+        "key_workflows",
+        "daily_activities",
+        "pain_points",
+        "bottlenecks",
+        "manual_tasks",
+        "automation_goals",
+        "current_software",
+        "existing_automation",
+    ]
+
+    if input_data.user_name is not None:
+        merged_data["name"] = input_data.user_name
+
+    for field in business_string_fields:
+        value = getattr(input_data, field)
+        if value is not None:
+            merged_business[field] = value
+
+    for field in business_list_fields:
+        value = getattr(input_data, field)
+        if value is not None:
+            existing_list = _json_to_list(merged_business.get(field))
+            merged_list = _merge_lists(existing_list, value)
+            merged_business[field] = merged_list
+
+    merged_business["version"] = 1
+    merged_data["business"] = merged_business
+
+    # suggested_prompts lives at the top level (not under `business`) because
+    # it's a UI-only artifact consumed by the frontend, not business understanding
+    # data. The `business` sub-dict feeds the system prompt.
+    if input_data.suggested_prompts is not None:
+        merged_data["suggested_prompts"] = input_data.suggested_prompts
+
+    return merged_data
+
+
 async def _get_from_cache(user_id: str) -> Optional[BusinessUnderstanding]:
    """Get business understanding from Redis cache."""
    try:
@@ -245,63 +322,18 @@ async def upsert_business_understanding(
        where={"userId": user_id}
    )

-    # Get existing data structure or start fresh
    existing_data: dict[str, Any] = {}
    if existing and isinstance(existing.data, dict):
        existing_data = dict(existing.data)

-    existing_business: dict[str, Any] = {}
-    if isinstance(existing_data.get("business"), dict):
-        existing_business = dict(existing_data["business"])
-
-    # Business fields (stored inside business object)
-    business_string_fields = [
-        "job_title",
-        "business_name",
-        "industry",
-        "business_size",
-        "user_role",
-        "additional_notes",
-    ]
-    business_list_fields = [
-        "key_workflows",
-        "daily_activities",
-        "pain_points",
-        "bottlenecks",
-        "manual_tasks",
-        "automation_goals",
-        "current_software",
-        "existing_automation",
-    ]
-
-    # Handle top-level name field
-    if input_data.user_name is not None:
-        existing_data["name"] = input_data.user_name
-
-    # Business string fields - overwrite if provided
-    for field in business_string_fields:
-        value = getattr(input_data, field)
-        if value is not None:
-            existing_business[field] = value
-
-    # Business list fields - merge with existing
-    for field in business_list_fields:
-        value = getattr(input_data, field)
-        if value is not None:
-            existing_list = _json_to_list(existing_business.get(field))
-            merged = _merge_lists(existing_list, value)
-            existing_business[field] = merged
-
-    # Set version and nest business data
-    existing_business["version"] = 1
-    existing_data["business"] = existing_business
+    merged_data = merge_business_understanding_data(existing_data, input_data)

    # Upsert with the merged data
    record = await CoPilotUnderstanding.prisma().upsert(
        where={"userId": user_id},
        data={
-            "create": {"userId": user_id, "data": SafeJson(existing_data)},
-            "update": {"data": SafeJson(existing_data)},
+            "create": {"userId": user_id, "data": SafeJson(merged_data)},
+            "update": {"data": SafeJson(merged_data)},
        },
    )

--- a/autogpt_platform/backend/backend/data/understanding_test.py
+++ b/autogpt_platform/backend/backend/data/understanding_test.py
@@ -0,0 +1,102 @@
+"""Tests for business understanding merge and format logic."""
+
+from datetime import datetime, timezone
+from typing import Any
+
+from backend.data.understanding import (
+    BusinessUnderstanding,
+    BusinessUnderstandingInput,
+    format_understanding_for_prompt,
+    merge_business_understanding_data,
+)
+
+
+def _make_input(**kwargs: Any) -> BusinessUnderstandingInput:
+    """Create a BusinessUnderstandingInput with only the specified fields."""
+    return BusinessUnderstandingInput.model_validate(kwargs)
+
+
+# ─── merge_business_understanding_data: suggested_prompts ─────────────
+
+
+def test_merge_suggested_prompts_overwrites_existing():
+    """New suggested_prompts should fully replace existing ones (not append)."""
+    existing = {
+        "name": "Alice",
+        "business": {"industry": "Tech", "version": 1},
+        "suggested_prompts": ["Old prompt 1", "Old prompt 2"],
+    }
+    input_data = _make_input(
+        suggested_prompts=["New prompt A", "New prompt B", "New prompt C"],
+    )
+
+    result = merge_business_understanding_data(existing, input_data)
+
+    assert result["suggested_prompts"] == [
+        "New prompt A",
+        "New prompt B",
+        "New prompt C",
+    ]
+
+
+def test_merge_suggested_prompts_none_preserves_existing():
+    """When input has suggested_prompts=None, existing prompts are preserved."""
+    existing = {
+        "name": "Alice",
+        "business": {"industry": "Tech", "version": 1},
+        "suggested_prompts": ["Keep me"],
+    }
+    input_data = _make_input(industry="Finance")
+
+    result = merge_business_understanding_data(existing, input_data)
+
+    assert result["suggested_prompts"] == ["Keep me"]
+    assert result["business"]["industry"] == "Finance"
+
+
+def test_merge_suggested_prompts_added_to_empty_data():
+    """Suggested prompts are set at top level even when starting from empty data."""
+    existing: dict[str, Any] = {}
+    input_data = _make_input(suggested_prompts=["Prompt 1"])
+
+    result = merge_business_understanding_data(existing, input_data)
+
+    assert result["suggested_prompts"] == ["Prompt 1"]
+
+
+def test_merge_suggested_prompts_empty_list_overwrites():
+    """An explicit empty list should overwrite existing prompts."""
+    existing: dict[str, Any] = {
+        "suggested_prompts": ["Old prompt"],
+        "business": {"version": 1},
+    }
+    input_data = _make_input(suggested_prompts=[])
+
+    result = merge_business_understanding_data(existing, input_data)
+
+    assert result["suggested_prompts"] == []
+
+
+# ─── format_understanding_for_prompt: excludes suggested_prompts ──────
+
+
+def test_format_understanding_excludes_suggested_prompts():
+    """suggested_prompts is UI-only and must NOT appear in the system prompt."""
+    understanding = BusinessUnderstanding(
+        id="test-id",
+        user_id="user-1",
+        created_at=datetime.now(tz=timezone.utc),
+        updated_at=datetime.now(tz=timezone.utc),
+        user_name="Alice",
+        industry="Technology",
+        suggested_prompts=["Automate reports", "Set up alerts", "Track KPIs"],
+    )
+
+    formatted = format_understanding_for_prompt(understanding)
+
+    assert "Alice" in formatted
+    assert "Technology" in formatted
+    assert "suggested_prompts" not in formatted
+    assert "Automate reports" not in formatted
+    assert "Set up alerts" not in formatted
+    assert "Track KPIs" not in formatted
--- a/autogpt_platform/backend/backend/data/user.py
+++ b/autogpt_platform/backend/backend/data/user.py
@@ -62,6 +62,61 @@ async def get_user_by_id(user_id: str) -> User:
    return User.from_db(user)


+async def list_users(
+    limit: int = 500,
+    cursor: str | None = None,
+) -> list[User]:
+    try:
+        kwargs: dict = {
+            "take": limit,
+            "order": {"id": "asc"},
+        }
+        if cursor is not None:
+            kwargs["cursor"] = {"id": cursor}
+            kwargs["skip"] = 1
+        users = await PrismaUser.prisma().find_many(**kwargs)
+        return [User.from_db(user) for user in users]
+    except Exception as e:
+        raise DatabaseError(f"Failed to list users: {e}") from e
+
+
+async def search_users(query: str, limit: int = 20) -> list[User]:
+    normalized_query = query.strip()
+    if not normalized_query:
+        return []
+
+    try:
+        users = await PrismaUser.prisma().find_many(
+            where={
+                "OR": [
+                    {
+                        "email": {
+                            "contains": normalized_query,
+                            "mode": "insensitive",
+                        }
+                    },
+                    {
+                        "name": {
+                            "contains": normalized_query,
+                            "mode": "insensitive",
+                        }
+                    },
+                    {
+                        "id": {
+                            "contains": normalized_query,
+                            "mode": "insensitive",
+                        }
+                    },
+                ]
+            },
+            order={"updatedAt": "desc"},
+            take=limit,
+        )
+        return [User.from_db(user) for user in users]
+    except Exception as e:
+        raise DatabaseError(f"Failed to search users for query {query!r}: {e}") from e
+
+
 async def get_user_email_by_id(user_id: str) -> Optional[str]:
    try:
        user = await prisma.user.find_unique(where={"id": user_id})
--- a/autogpt_platform/backend/backend/data/workspace.py
+++ b/autogpt_platform/backend/backend/data/workspace.py
@@ -254,6 +254,23 @@ async def list_workspace_files(
    return [WorkspaceFile.from_db(f) for f in files]


+async def get_workspace_files_by_ids(
+    workspace_id: str,
+    file_ids: list[str],
+) -> list[WorkspaceFile]:
+    if not file_ids:
+        return []
+
+    files = await UserWorkspaceFile.prisma().find_many(
+        where={
+            "id": {"in": file_ids},
+            "workspaceId": workspace_id,
+            "isDeleted": False,
+        }
+    )
+    return [WorkspaceFile.from_db(file) for file in files]
+
+
 async def count_workspace_files(
    workspace_id: str,
    path_prefix: Optional[str] = None,
--- a/autogpt_platform/backend/backend/executor/manager.py
+++ b/autogpt_platform/backend/backend/executor/manager.py
@@ -61,7 +61,12 @@ from backend.util.decorator import (
    error_logged,
    time_measured,
 )
-from backend.util.exceptions import InsufficientBalanceError, ModerationError
+from backend.util.exceptions import (
+    GraphNotFoundError,
+    InsufficientBalanceError,
+    ModerationError,
+    NotFoundError,
+)
 from backend.util.file import clean_exec_files
 from backend.util.logging import TruncatedLogger, configure_logging
 from backend.util.metrics import DiscordChannel
@@ -375,9 +380,16 @@ async def execute_node(
            log_metadata.debug("Node produced output", **{output_name: output_data})
            yield output_name, output_data
    except Exception as ex:
-        # Capture exception WITH context still set before restoring scope
-        sentry_sdk.capture_exception(error=ex, scope=scope)
-        sentry_sdk.flush()  # Ensure it's sent before we restore scope
+        # Only capture unexpected errors to Sentry, not user-caused ones.
+        # Most ValueError subclasses here are expected (BlockExecutionError,
+        # InsufficientBalanceError, plain ValueError for auth/disabled blocks, etc.)
+        # but NotFoundError/GraphNotFoundError could indicate real platform issues.
+        is_expected = isinstance(ex, ValueError) and not isinstance(
+            ex, (NotFoundError, GraphNotFoundError)
+        )
+        if not is_expected:
+            sentry_sdk.capture_exception(error=ex, scope=scope)
+            sentry_sdk.flush()
        # Re-raise to maintain normal error flow
        raise
    finally:
@@ -1478,7 +1490,7 @@ class ExecutionProcessor:
                    alert_message, DiscordChannel.PRODUCT
                )
            except Exception as e:
-                logger.error(f"Failed to send low balance Discord alert: {e}")
+                logger.warning(f"Failed to send low balance Discord alert: {e}")


 class ExecutionManager(AppProcess):
@@ -1900,17 +1912,16 @@ class ExecutionManager(AppProcess):
            channel = client.get_channel()
            channel.connection.add_callback_threadsafe(lambda: channel.stop_consuming())

-            try:
-                thread.join(timeout=300)
-            except TimeoutError:
-                logger.error(
+            thread.join(timeout=300)
+            if thread.is_alive():
+                logger.warning(
                    f"{prefix} ⚠️ Run thread did not finish in time, forcing disconnect"
                )

            client.disconnect()
            logger.info(f"{prefix} ✅ Run client disconnected")
        except Exception as e:
-            logger.error(f"{prefix} ⚠️ Error disconnecting run client: {type(e)} {e}")
+            logger.warning(f"{prefix} ⚠️ Error disconnecting run client: {type(e)} {e}")

    def cleanup(self):
        """Override cleanup to implement graceful shutdown with active execution waiting."""
@@ -1926,7 +1937,9 @@ class ExecutionManager(AppProcess):
            )
            logger.info(f"{prefix} ✅ Exec consumer has been signaled to stop")
        except Exception as e:
-            logger.error(f"{prefix} ⚠️ Error signaling consumer to stop: {type(e)} {e}")
+            logger.warning(
+                f"{prefix} ⚠️ Error signaling consumer to stop: {type(e)} {e}"
+            )

        # Wait for active executions to complete
        if self.active_graph_runs:
@@ -1957,7 +1970,7 @@ class ExecutionManager(AppProcess):
                waited += wait_interval

            if self.active_graph_runs:
-                logger.error(
+                logger.warning(
                    f"{prefix} ⚠️ {len(self.active_graph_runs)} executions still running after {max_wait}s"
                )
            else:
@@ -1968,7 +1981,7 @@ class ExecutionManager(AppProcess):
            self.executor.shutdown(cancel_futures=True, wait=False)
            logger.info(f"{prefix} ✅ Executor shutdown completed")
        except Exception as e:
-            logger.error(f"{prefix} ⚠️ Error during executor shutdown: {type(e)} {e}")
+            logger.warning(f"{prefix} ⚠️ Error during executor shutdown: {type(e)} {e}")

        # Release remaining execution locks
        try:
--- a/autogpt_platform/backend/backend/executor/scheduler.py
+++ b/autogpt_platform/backend/backend/executor/scheduler.py
@@ -24,6 +24,12 @@ from dotenv import load_dotenv
 from pydantic import BaseModel, Field, ValidationError
 from sqlalchemy import MetaData, create_engine

+from backend.copilot.autopilot import (
+    dispatch_nightly_copilot as dispatch_nightly_copilot_async,
+)
+from backend.copilot.autopilot import (
+    send_nightly_copilot_emails as send_nightly_copilot_emails_async,
+)
 from backend.copilot.optimize_blocks import optimize_block_descriptions
 from backend.data.execution import GraphExecutionWithNodes
 from backend.data.model import CredentialsMetaInput, GraphInput
@@ -94,7 +100,7 @@ SCHEDULER_OPERATION_TIMEOUT_SECONDS = 300  # 5 minutes for scheduler operations
 def job_listener(event):
    """Logs job execution outcomes for better monitoring."""
    if event.exception:
-        logger.error(
+        logger.warning(
            f"Job {event.job_id} failed: {type(event.exception).__name__}: {event.exception}"
        )
    else:
@@ -137,7 +143,7 @@ def run_async(coro, timeout: float = SCHEDULER_OPERATION_TIMEOUT_SECONDS):
    try:
        return future.result(timeout=timeout)
    except Exception as e:
-        logger.error(f"Async operation failed: {type(e).__name__}: {e}")
+        logger.warning(f"Async operation failed: {type(e).__name__}: {e}")
        raise


@@ -186,7 +192,7 @@ async def _execute_graph(**kwargs):


 async def _handle_graph_validation_error(args: "GraphExecutionJobArgs") -> None:
-    logger.error(
+    logger.warning(
        f"Scheduled Graph {args.graph_id} failed validation. Unscheduling graph"
    )
    if args.schedule_id:
@@ -196,8 +202,9 @@ async def _handle_graph_validation_error(args: "GraphExecutionJobArgs") -> None:
            user_id=args.user_id,
        )
    else:
-        logger.error(
-            f"Unable to unschedule graph: {args.graph_id} as this is an old job with no associated schedule_id please remove manually"
+        logger.warning(
+            f"Unable to unschedule graph: {args.graph_id} as this is an old job "
+            f"with no associated schedule_id please remove manually"
        )


@@ -258,6 +265,16 @@ def cleanup_oauth_tokens():
    run_async(_cleanup())


+def dispatch_nightly_copilot():
+    """Dispatch proactive nightly copilot sessions."""
+    return run_async(dispatch_nightly_copilot_async())
+
+
+def send_nightly_copilot_emails():
+    """Send emails for completed non-manual copilot sessions."""
+    return run_async(send_nightly_copilot_emails_async())
+
+
 def execution_accuracy_alerts():
    """Check execution accuracy and send alerts if drops are detected."""
    return report_execution_accuracy_alerts()
@@ -403,7 +420,7 @@ class GraphExecutionJobInfo(GraphExecutionJobArgs):
    ) -> "GraphExecutionJobInfo":
        # Extract timezone from the trigger if it's a CronTrigger
        timezone_str = "UTC"
-        if hasattr(job_obj.trigger, "timezone"):
+        if isinstance(job_obj.trigger, CronTrigger):
            timezone_str = str(job_obj.trigger.timezone)

        return GraphExecutionJobInfo(
@@ -618,6 +635,24 @@ class Scheduler(AppService):
                jobstore=Jobstores.EXECUTION.value,
            )

+            self.scheduler.add_job(
+                dispatch_nightly_copilot,
+                id="dispatch_nightly_copilot",
+                trigger=CronTrigger(minute="0,30", timezone=ZoneInfo("UTC")),
+                replace_existing=True,
+                max_instances=1,
+                jobstore=Jobstores.EXECUTION.value,
+            )
+
+            self.scheduler.add_job(
+                send_nightly_copilot_emails,
+                id="send_nightly_copilot_emails",
+                trigger=CronTrigger(minute="15,45", timezone=ZoneInfo("UTC")),
+                replace_existing=True,
+                max_instances=1,
+                jobstore=Jobstores.EXECUTION.value,
+            )
+
        self.scheduler.add_listener(job_listener, EVENT_JOB_EXECUTED | EVENT_JOB_ERROR)
        self.scheduler.add_listener(job_missed_listener, EVENT_JOB_MISSED)
        self.scheduler.add_listener(job_max_instances_listener, EVENT_JOB_MAX_INSTANCES)
@@ -792,6 +827,14 @@ class Scheduler(AppService):
        """Manually trigger embedding backfill for approved store agents."""
        return ensure_embeddings_coverage()

+    @expose
+    def execute_dispatch_nightly_copilot(self):
+        return dispatch_nightly_copilot()
+
+    @expose
+    def execute_send_nightly_copilot_emails(self):
+        return send_nightly_copilot_emails()
+

 class SchedulerClient(AppServiceClient):
    @classmethod
--- a/autogpt_platform/backend/backend/executor/utils.py
+++ b/autogpt_platform/backend/backend/executor/utils.py
@@ -46,7 +46,7 @@ from backend.util.exceptions import (
 )
 from backend.util.logging import TruncatedLogger, is_structured_logging_enabled
 from backend.util.settings import Config
-from backend.util.type import convert
+from backend.util.type import coerce_inputs_to_schema

 config = Config()
 logger = TruncatedLogger(logging.getLogger(__name__), prefix="[GraphExecutorUtil]")
@@ -213,11 +213,8 @@ def validate_exec(
    if resolve_input:
        data = merge_execution_input(data)

-    # Convert non-matching data types to the expected input schema.
-    for name, data_type in schema.__annotations__.items():
-        value = data.get(name)
-        if (value is not None) and (type(value) is not data_type):
-            data[name] = convert(value, data_type)
+    # Coerce non-matching data types to the expected input schema.
+    coerce_inputs_to_schema(data, schema)

    # Input data post-merge should contain all required fields from the schema.
    if missing_input := schema.get_missing_input(data):
--- a/autogpt_platform/backend/backend/notifications/email.py
+++ b/autogpt_platform/backend/backend/notifications/email.py
@@ -1,5 +1,6 @@
 import logging
 import pathlib
+from typing import Any

 from postmarker.core import PostmarkClient
 from postmarker.models.emails import EmailManager
@@ -7,12 +8,14 @@ from prisma.enums import NotificationType
 from pydantic import BaseModel

 from backend.data.notifications import (
+    AgentRunData,
    NotificationDataType_co,
    NotificationEventModel,
    NotificationTypeOverride,
 )
 from backend.util.settings import Settings
 from backend.util.text import TextFormatter
+from backend.util.url import get_frontend_base_url

 logger = logging.getLogger(__name__)
 settings = Settings()
@@ -46,6 +49,102 @@ class EmailSender:

    MAX_EMAIL_CHARS = 5_000_000  # ~5MB buffer

+    def _get_unsubscribe_link(self, user_unsubscribe_link: str | None) -> str:
+        return user_unsubscribe_link or f"{get_frontend_base_url()}/profile/settings"
+
+    def _format_template_email(
+        self,
+        *,
+        subject_template: str,
+        content_template: str,
+        data: Any,
+        unsubscribe_link: str,
+    ) -> tuple[str, str]:
+        return self.formatter.format_email(
+            base_template=self._read_template("templates/base.html.jinja2"),
+            subject_template=subject_template,
+            content_template=content_template,
+            data=data,
+            unsubscribe_link=unsubscribe_link,
+        )
+
+    def _build_large_output_summary(
+        self,
+        data: (
+            NotificationEventModel[NotificationDataType_co]
+            | list[NotificationEventModel[NotificationDataType_co]]
+        ),
+        *,
+        email_size: int,
+        base_url: str,
+    ) -> str:
+        if isinstance(data, list):
+            if not data:
+                return (
+                    "⚠️ A notification generated a very large output "
+                    f"({email_size / 1_000_000:.2f} MB)."
+                )
+            event = data[0]
+        else:
+            event = data
+        execution_url = (
+            f"{base_url}/executions/{event.id}" if event.id is not None else None
+        )
+
+        if isinstance(event.data, AgentRunData):
+            lines = [
+                f"⚠️ Your agent '{event.data.agent_name}' generated a very large output ({email_size / 1_000_000:.2f} MB).",
+                "",
+                f"Execution time: {event.data.execution_time}",
+                f"Credits used: {event.data.credits_used}",
+            ]
+            if execution_url is not None:
+                lines.append(f"View full results: {execution_url}")
+            return "\n".join(lines)
+
+        lines = [
+            f"⚠️ A notification generated a very large output ({email_size / 1_000_000:.2f} MB).",
+        ]
+        if execution_url is not None:
+            lines.extend(["", f"View full results: {execution_url}"])
+        return "\n".join(lines)
+
+    def send_template(
+        self,
+        *,
+        user_email: str,
+        subject: str,
+        template_name: str,
+        data: dict[str, Any] | None = None,
+        user_unsubscribe_link: str | None = None,
+    ) -> None:
+        """Send an email using a named Jinja2 template file.
+
+        Unlike ``send_templated`` (which resolves templates via
+        ``NotificationType``), this method accepts a template filename
+        directly.  Both delegate to the shared ``_format_template_email``
+        + ``_send_email`` pipeline.
+        """
+        if not self.postmark:
+            logger.warning("Postmark client not initialized, email not sent")
+            return
+
+        unsubscribe_link = self._get_unsubscribe_link(user_unsubscribe_link)
+
+        _, full_message = self._format_template_email(
+            subject_template="{{ subject }}",
+            content_template=self._read_template(f"templates/{template_name}"),
+            data={"subject": subject, **(data or {})},
+            unsubscribe_link=unsubscribe_link,
+        )
+
+        self._send_email(
+            user_email=user_email,
+            subject=subject,
+            body=full_message,
+            user_unsubscribe_link=unsubscribe_link,
+        )
+
    def send_templated(
        self,
        notification: NotificationType,
@@ -62,21 +161,18 @@ class EmailSender:
            return

        template = self._get_template(notification)
-
-        base_url = (
-            settings.config.frontend_base_url or settings.config.platform_base_url
-        )
+        base_url = get_frontend_base_url()
+        unsubscribe_link = self._get_unsubscribe_link(user_unsub_link)

        # Normalize data
        template_data = {"notifications": data} if isinstance(data, list) else data

        try:
-            subject, full_message = self.formatter.format_email(
-                base_template=template.base_template,
+            subject, full_message = self._format_template_email(
                subject_template=template.subject_template,
                content_template=template.body_template,
                data=template_data,
-                unsubscribe_link=f"{base_url}/profile/settings",
+                unsubscribe_link=unsubscribe_link,
            )
        except Exception as e:
            logger.error(f"Error formatting full message: {e}")
@@ -90,20 +186,17 @@ class EmailSender:
                "Sending summary email instead."
            )

-            # Create lightweight summary
-            summary_message = (
-                f"⚠️ Your agent '{getattr(data, 'agent_name', 'Unknown')}' "
-                f"generated a very large output ({email_size / 1_000_000:.2f} MB).\n\n"
-                f"Execution time: {getattr(data, 'execution_time', 'N/A')}\n"
-                f"Credits used: {getattr(data, 'credits_used', 'N/A')}\n"
-                f"View full results: {base_url}/executions/{getattr(data, 'id', 'N/A')}"
+            summary_message = self._build_large_output_summary(
+                data,
+                email_size=email_size,
+                base_url=base_url,
            )

            self._send_email(
                user_email=user_email,
                subject=f"{subject} (Output Too Large)",
                body=summary_message,
-                user_unsubscribe_link=user_unsub_link,
+                user_unsubscribe_link=unsubscribe_link,
            )
            return  # Skip sending full email

@@ -112,7 +205,7 @@ class EmailSender:
            user_email=user_email,
            subject=subject,
            body=full_message,
-            user_unsubscribe_link=user_unsub_link,
+            user_unsubscribe_link=unsubscribe_link,
        )

    def _get_template(self, notification: NotificationType):
@@ -123,17 +216,18 @@ class EmailSender:
        logger.debug(
            f"Template full path: {pathlib.Path(__file__).parent / template_path}"
        )
-        base_template_path = "templates/base.html.jinja2"
-        with open(pathlib.Path(__file__).parent / base_template_path, "r") as file:
-            base_template = file.read()
-        with open(pathlib.Path(__file__).parent / template_path, "r") as file:
-            template = file.read()
+        base_template = self._read_template("templates/base.html.jinja2")
+        template = self._read_template(template_path)
        return Template(
            subject_template=notification_type_override.subject,
            body_template=template,
            base_template=base_template,
        )

+    def _read_template(self, template_path: str) -> str:
+        with open(pathlib.Path(__file__).parent / template_path, "r") as file:
+            return file.read()
+
    def _send_email(
        self,
        user_email: str,
@@ -144,18 +238,33 @@ class EmailSender:
        if not self.postmark:
            logger.warning("Email tried to send without postmark configured")
            return
+        sender_email = settings.config.postmark_sender_email
+        if not sender_email:
+            logger.warning("postmark_sender_email not configured, email not sent")
+            return
+        unsubscribe_link = self._get_unsubscribe_link(user_unsubscribe_link)
        logger.debug(f"Sending email to {user_email} with subject {subject}")
        self.postmark.emails.send(
-            From=settings.config.postmark_sender_email,
+            From=sender_email,
            To=user_email,
            Subject=subject,
            HtmlBody=body,
-            Headers=(
-                {
-                    "List-Unsubscribe-Post": "List-Unsubscribe=One-Click",
-                    "List-Unsubscribe": f"<{user_unsubscribe_link}>",
-                }
-                if user_unsubscribe_link
-                else None
-            ),
+            Headers={
+                "List-Unsubscribe-Post": "List-Unsubscribe=One-Click",
+                "List-Unsubscribe": f"<{unsubscribe_link}>",
+            },
+        )
+
+    def send_html(
+        self,
+        user_email: str,
+        subject: str,
+        body: str,
+        user_unsubscribe_link: str | None = None,
+    ) -> None:
+        self._send_email(
+            user_email=user_email,
+            subject=subject,
+            body=body,
+            user_unsubscribe_link=user_unsubscribe_link,
        )
--- a/autogpt_platform/backend/backend/notifications/email_test.py
+++ b/autogpt_platform/backend/backend/notifications/email_test.py
@@ -0,0 +1,241 @@
+from types import SimpleNamespace
+from typing import Any, cast
+
+from backend.api.test_helpers import override_config
+from backend.copilot.autopilot_email import _markdown_to_email_html
+from backend.notifications.email import EmailSender, settings
+from backend.util.settings import AppEnvironment
+
+
+def test_markdown_to_email_html_renders_bold_and_italic() -> None:
+    html = _markdown_to_email_html("**bold** and *italic*")
+    assert "<strong>bold</strong>" in html
+    assert "<em>italic</em>" in html
+    assert 'style="' in html
+
+
+def test_markdown_to_email_html_renders_links() -> None:
+    html = _markdown_to_email_html("[click here](https://example.com)")
+    assert 'href="https://example.com"' in html
+    assert "click here" in html
+    assert "color: #7733F5" in html
+
+
+def test_markdown_to_email_html_renders_bullet_list() -> None:
+    html = _markdown_to_email_html("- item one\n- item two")
+    assert "<ul" in html
+    assert "<li" in html
+    assert "item one" in html
+    assert "item two" in html
+
+
+def test_markdown_to_email_html_handles_empty_input() -> None:
+    assert _markdown_to_email_html(None) == ""
+    assert _markdown_to_email_html("") == ""
+    assert _markdown_to_email_html("   ") == ""
+
+
+def test_send_template_renders_nightly_copilot_email(mocker) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    sender.send_template(
+        user_email="user@example.com",
+        subject="Autopilot update",
+        template_name="nightly_copilot.html.jinja2",
+        data={
+            "email_body_html": _markdown_to_email_html(
+                "I found something useful for you.\n\n"
+                "Open Copilot and I will walk you through it."
+            ),
+            "cta_url": "https://example.com/copilot?callbackToken=token-1",
+            "cta_label": "Open Copilot",
+        },
+    )
+
+    body = send_email.call_args.kwargs["body"]
+
+    assert "I found something useful for you." in body
+    assert "Open Copilot" in body
+    assert "Approval needed" not in body
+    assert send_email.call_args.kwargs["user_unsubscribe_link"].endswith(
+        "/profile/settings"
+    )
+
+
+def test_send_template_renders_nightly_copilot_approval_block(mocker) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    sender.send_template(
+        user_email="user@example.com",
+        subject="Autopilot update",
+        template_name="nightly_copilot.html.jinja2",
+        data={
+            "email_body_html": _markdown_to_email_html(
+                "I prepared a change worth reviewing."
+            ),
+            "approval_summary_html": _markdown_to_email_html(
+                "I drafted a follow-up because it matches your recent activity."
+            ),
+            "cta_url": "https://example.com/copilot?sessionId=session-1&showAutopilot=1",
+            "cta_label": "Review in Copilot",
+        },
+    )
+
+    body = send_email.call_args.kwargs["body"]
+
+    assert "Approval needed" in body
+    assert "If you want it to happen, please hit approve." in body
+    assert "Review in Copilot" in body
+
+
+def test_send_template_renders_nightly_copilot_callback_email(mocker) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    sender.send_template(
+        user_email="user@example.com",
+        subject="Autopilot update",
+        template_name="nightly_copilot_callback.html.jinja2",
+        data={
+            "email_body_html": _markdown_to_email_html(
+                "I prepared a follow-up based on your recent work."
+            ),
+            "cta_url": "https://example.com/copilot?callbackToken=token-1",
+            "cta_label": "Open Copilot",
+        },
+    )
+
+    body = send_email.call_args.kwargs["body"]
+
+    assert "Autopilot picked up where you left off" in body
+    assert "I prepared a follow-up based on your recent work." in body
+
+
+def test_send_template_renders_nightly_copilot_callback_approval_block(mocker) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    sender.send_template(
+        user_email="user@example.com",
+        subject="Autopilot update",
+        template_name="nightly_copilot_callback.html.jinja2",
+        data={
+            "email_body_html": _markdown_to_email_html(
+                "I prepared a follow-up based on your recent work."
+            ),
+            "approval_summary_html": _markdown_to_email_html(
+                "I want your approval before I apply the next step."
+            ),
+            "cta_url": "https://example.com/copilot?sessionId=session-1&showAutopilot=1",
+            "cta_label": "Review in Copilot",
+        },
+    )
+
+    body = send_email.call_args.kwargs["body"]
+
+    assert "Approval needed" in body
+    assert "I want your approval before I apply the next step." in body
+
+
+def test_send_template_renders_nightly_copilot_invite_cta_email(mocker) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    sender.send_template(
+        user_email="user@example.com",
+        subject="Autopilot update",
+        template_name="nightly_copilot_invite_cta.html.jinja2",
+        data={
+            "email_body_html": _markdown_to_email_html(
+                "I put together an example of how Autopilot could help you."
+            ),
+            "cta_url": "https://example.com/copilot?callbackToken=token-1",
+            "cta_label": "Try Copilot",
+        },
+    )
+
+    body = send_email.call_args.kwargs["body"]
+
+    assert "Your Autopilot beta access is waiting" in body
+    assert "I put together an example of how Autopilot could help you." in body
+    assert "Try Copilot" in body
+
+
+def test_send_template_renders_nightly_copilot_invite_cta_approval_block(
+    mocker,
+) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    sender.send_template(
+        user_email="user@example.com",
+        subject="Autopilot update",
+        template_name="nightly_copilot_invite_cta.html.jinja2",
+        data={
+            "email_body_html": _markdown_to_email_html(
+                "I put together an example of how Autopilot could help you."
+            ),
+            "approval_summary_html": _markdown_to_email_html(
+                "If this looks useful, approve the next step to try it."
+            ),
+            "cta_url": "https://example.com/copilot?sessionId=session-1&showAutopilot=1",
+            "cta_label": "Review in Copilot",
+        },
+    )
+
+    body = send_email.call_args.kwargs["body"]
+
+    assert "Approval needed" in body
+    assert "If this looks useful, approve the next step to try it." in body
+
+
+def test_send_template_still_sends_in_production(mocker) -> None:
+    sender = EmailSender()
+    sender.postmark = cast(Any, object())
+    send_email = mocker.patch.object(sender, "_send_email")
+
+    with override_config(settings, "app_env", AppEnvironment.PRODUCTION):
+        sender.send_template(
+            user_email="user@example.com",
+            subject="Autopilot update",
+            template_name="nightly_copilot.html.jinja2",
+            data={
+                "email_body_html": _markdown_to_email_html(
+                    "I found something useful for you."
+                ),
+                "cta_url": "https://example.com/copilot?callbackToken=token-1",
+                "cta_label": "Open Copilot",
+            },
+        )
+
+    send_email.assert_called_once()
+
+
+def test_send_html_uses_default_unsubscribe_link(mocker) -> None:
+    sender = EmailSender()
+    send = mocker.Mock()
+    sender.postmark = cast(Any, SimpleNamespace(emails=SimpleNamespace(send=send)))
+
+    mocker.patch(
+        "backend.notifications.email.get_frontend_base_url",
+        return_value="https://example.com",
+    )
+    with override_config(settings, "postmark_sender_email", "test@example.com"):
+        sender.send_html(
+            user_email="user@example.com",
+            subject="Autopilot update",
+            body="<p>Hello</p>",
+        )
+
+    headers = send.call_args.kwargs["Headers"]
+
+    assert headers["List-Unsubscribe-Post"] == "List-Unsubscribe=One-Click"
+    assert headers["List-Unsubscribe"] == "<https://example.com/profile/settings>"
--- a/autogpt_platform/backend/backend/notifications/notifications.py
+++ b/autogpt_platform/backend/backend/notifications/notifications.py
@@ -303,9 +303,9 @@ class NotificationManager(AppService):
                    )

                    if not oldest_message:
-                        # this should never happen
-                        logger.error(
-                            f"Batch for user {batch.user_id} and type {notification_type} has no oldest message whichshould never happen!!!!!!!!!!!!!!!!"
+                        logger.warning(
+                            f"Batch for user {batch.user_id} and type {notification_type} "
+                            f"has no oldest message — batch may have been cleared concurrently"
                        )
                        continue

@@ -318,7 +318,7 @@ class NotificationManager(AppService):
                        ).get_user_email_by_id(batch.user_id)

                        if not recipient_email:
-                            logger.error(
+                            logger.warning(
                                f"User email not found for user {batch.user_id}"
                            )
                            continue
@@ -344,7 +344,7 @@ class NotificationManager(AppService):
                        ).get_user_notification_batch(batch.user_id, notification_type)

                        if not batch_data or not batch_data.notifications:
-                            logger.error(
+                            logger.warning(
                                f"Batch data not found for user {batch.user_id}"
                            )
                            # Clear the batch
@@ -372,7 +372,7 @@ class NotificationManager(AppService):
                                    )
                                )
                            except Exception as e:
-                                logger.error(
+                                logger.warning(
                                    f"Error parsing notification event: {e=}, {db_event=}"
                                )
                                continue
@@ -415,7 +415,10 @@ class NotificationManager(AppService):
    async def discord_system_alert(
        self, content: str, channel: DiscordChannel = DiscordChannel.PLATFORM
    ):
-        await discord_send_alert(content, channel)
+        try:
+            await discord_send_alert(content, channel)
+        except Exception as e:
+            logger.warning(f"Failed to send Discord system alert: {e}")

    async def _queue_scheduled_notification(self, event: SummaryParamsEventModel):
        """Queue a scheduled notification - exposed method for other services to call"""
@@ -516,7 +519,7 @@ class NotificationManager(AppService):
                raise ValueError("Invalid event type or params")

        except Exception as e:
-            logger.error(f"Failed to gather summary data: {e}")
+            logger.warning(f"Failed to gather summary data: {e}")
            # Return sensible defaults in case of error
            if event_type == NotificationType.DAILY_SUMMARY and isinstance(
                params, DailySummaryParams
@@ -562,8 +565,9 @@ class NotificationManager(AppService):
            should_retry=False
        ).get_user_notification_oldest_message_in_batch(user_id, event_type)
        if not oldest_message:
-            logger.error(
-                f"Batch for user {user_id} and type {event_type} has no oldest message whichshould never happen!!!!!!!!!!!!!!!!"
+            logger.warning(
+                f"Batch for user {user_id} and type {event_type} "
+                f"has no oldest message — batch may have been cleared concurrently"
            )
            return False
        oldest_age = oldest_message.created_at
@@ -585,7 +589,7 @@ class NotificationManager(AppService):
                get_notif_data_type(event.type)
            ].model_validate_json(message)
        except Exception as e:
-            logger.error(f"Error parsing message due to non matching schema {e}")
+            logger.warning(f"Error parsing message due to non matching schema {e}")
            return None

    async def _process_admin_message(self, message: str) -> bool:
@@ -614,7 +618,7 @@ class NotificationManager(AppService):
                should_retry=False
            ).get_user_email_by_id(event.user_id)
            if not recipient_email:
-                logger.error(f"User email not found for user {event.user_id}")
+                logger.warning(f"User email not found for user {event.user_id}")
                return False

            should_send = await self._should_email_user_based_on_preference(
@@ -651,7 +655,7 @@ class NotificationManager(AppService):
                should_retry=False
            ).get_user_email_by_id(event.user_id)
            if not recipient_email:
-                logger.error(f"User email not found for user {event.user_id}")
+                logger.warning(f"User email not found for user {event.user_id}")
                return False

            should_send = await self._should_email_user_based_on_preference(
@@ -672,7 +676,7 @@ class NotificationManager(AppService):
                should_retry=False
            ).get_user_notification_batch(event.user_id, event.type)
            if not batch or not batch.notifications:
-                logger.error(f"Batch not found for user {event.user_id}")
+                logger.warning(f"Batch not found for user {event.user_id}")
                return False
            unsub_link = generate_unsubscribe_link(event.user_id)

@@ -745,7 +749,7 @@ class NotificationManager(AppService):
                                        f"Removed {len(chunk_ids)} sent notifications from batch"
                                    )
                                except Exception as e:
-                                    logger.error(
+                                    logger.warning(
                                        f"Failed to remove sent notifications: {e}"
                                    )
                                    # Continue anyway - better to risk duplicates than lose emails
@@ -770,7 +774,7 @@ class NotificationManager(AppService):
                        else:
                            # Message is too large even after size reduction
                            if attempt_size == 1:
-                                logger.error(
+                                logger.warning(
                                    f"Failed to send notification at index {i}: "
                                    f"Single notification exceeds email size limit "
                                    f"({len(test_message):,} chars > {MAX_EMAIL_SIZE:,} chars). "
@@ -789,7 +793,7 @@ class NotificationManager(AppService):
                                            f"Removed oversized notification {chunk_ids[0]} from batch permanently"
                                        )
                                    except Exception as e:
-                                        logger.error(
+                                        logger.warning(
                                            f"Failed to remove oversized notification: {e}"
                                        )

@@ -823,7 +827,7 @@ class NotificationManager(AppService):
                                        f"Set email verification to false for user {event.user_id}"
                                    )
                                except Exception as deactivation_error:
-                                    logger.error(
+                                    logger.warning(
                                        f"Failed to deactivate email for user {event.user_id}: "
                                        f"{deactivation_error}"
                                    )
@@ -835,7 +839,7 @@ class NotificationManager(AppService):
                                        f"Disabled all notification preferences for user {event.user_id}"
                                    )
                                except Exception as disable_error:
-                                    logger.error(
+                                    logger.warning(
                                        f"Failed to disable notification preferences: {disable_error}"
                                    )

@@ -848,7 +852,7 @@ class NotificationManager(AppService):
                                        f"Cleared ALL notification batches for user {event.user_id}"
                                    )
                                except Exception as remove_error:
-                                    logger.error(
+                                    logger.warning(
                                        f"Failed to clear batches for inactive recipient: {remove_error}"
                                    )

@@ -859,7 +863,7 @@ class NotificationManager(AppService):
                                "422" in error_message
                                or "unprocessable" in error_message
                            ):
-                                logger.error(
+                                logger.warning(
                                    f"Failed to send notification at index {i}: "
                                    f"Malformed notification data rejected by Postmark. "
                                    f"Error: {e}. Removing from batch permanently."
@@ -877,7 +881,7 @@ class NotificationManager(AppService):
                                            "Removed malformed notification from batch permanently"
                                        )
                                    except Exception as remove_error:
-                                        logger.error(
+                                        logger.warning(
                                            f"Failed to remove malformed notification: {remove_error}"
                                        )
                            # Check if it's a ValueError for size limit
@@ -885,14 +889,14 @@ class NotificationManager(AppService):
                                isinstance(e, ValueError)
                                and "too large" in error_message
                            ):
-                                logger.error(
+                                logger.warning(
                                    f"Failed to send notification at index {i}: "
                                    f"Notification size exceeds email limit. "
                                    f"Error: {e}. Skipping this notification."
                                )
                            # Other API errors
                            else:
-                                logger.error(
+                                logger.warning(
                                    f"Failed to send notification at index {i}: "
                                    f"Email API error ({error_type}): {e}. "
                                    f"Skipping this notification."
@@ -907,7 +911,9 @@ class NotificationManager(AppService):

                if not chunk_sent:
                    # Should not reach here due to single notification handling
-                    logger.error(f"Failed to send notifications starting at index {i}")
+                    logger.warning(
+                        f"Failed to send notifications starting at index {i}"
+                    )
                    failed_indices.append(i)
                    i += 1

@@ -946,7 +952,7 @@ class NotificationManager(AppService):
                should_retry=False
            ).get_user_email_by_id(event.user_id)
            if not recipient_email:
-                logger.error(f"User email not found for user {event.user_id}")
+                logger.warning(f"User email not found for user {event.user_id}")
                return False
            should_send = await self._should_email_user_based_on_preference(
                event.user_id, event.type
@@ -1007,7 +1013,10 @@ class NotificationManager(AppService):
                        # Let message.process() handle the rejection
                        pass
                    except Exception as e:
-                        logger.error(f"Error processing message in {queue_name}: {e}")
+                        logger.warning(
+                            f"Error processing message in {queue_name}: {e}",
+                            exc_info=True,
+                        )
                        # Let message.process() handle the rejection
                        raise
        except asyncio.CancelledError:
--- a/autogpt_platform/backend/backend/notifications/templates/base.html.jinja2
+++ b/autogpt_platform/backend/backend/notifications/templates/base.html.jinja2
@@ -29,7 +29,6 @@
    </noscript>
    <![endif]-->
  <style type="text/css">
-    /* RESET STYLES */
    html,
    body {
      margin: 0 !important;
@@ -85,7 +84,6 @@
      word-break: break-word;
    }

-    /* iOS BLUE LINKS */
    a[x-apple-data-detectors] {
      color: inherit !important;
      text-decoration: none !important;
@@ -95,13 +93,11 @@
      line-height: inherit !important;
    }

-    /* ANDROID CENTER FIX */
    div[style*="margin: 16px 0;"] {
      margin: 0 !important;
    }

-    /* MEDIA QUERIES */
-    @media all and (max-width:639px) {
+    @media all and (max-width: 639px) {
      .wrapper {
        width: 100% !important;
      }
@@ -113,8 +109,8 @@
      }

      .row {
-        padding-left: 20px !important;
-        padding-right: 20px !important;
+        padding-left: 24px !important;
+        padding-right: 24px !important;
      }

      .col-mobile {
@@ -136,11 +132,6 @@
        float: none !important;
      }

-      .mobile-left {
-        text-align: center !important;
-        float: left !important;
-      }
-
      .mobile-hide {
        display: none !important;
      }
@@ -155,9 +146,9 @@
        max-width: 100% !important;
      }

-      .ml-btn-container {
-        width: 100% !important;
-        max-width: 100% !important;
+      .card-inner {
+        padding-left: 24px !important;
+        padding-right: 24px !important;
      }
    }
  </style>
@@ -174,170 +165,139 @@
  <title>{{data.title}}</title>
 </head>

-<body style="margin: 0 !important; padding: 0 !important; background-color:#070629;">
+<body style="margin: 0 !important; padding: 0 !important; background-color: #070629;">
  <div class="document" role="article" aria-roledescription="email" aria-label lang dir="ltr"
-    style="background-color:#070629; line-height: 100%; font-size:medium; font-size:max(16px, 1rem);">
-    <!-- Main Content -->
+    style="background-color: #070629; line-height: 100%; font-size: medium; font-size: max(16px, 1rem);">
+
    <table width="100%" align="center" cellspacing="0" cellpadding="0" border="0">
      <tr>
-        <td class="background" bgcolor="#070629" align="center" valign="top" style="padding: 0 8px;">
-          <!-- Email Content -->
+        <td align="center" valign="top" style="padding: 48px 16px 40px;">
+
+          <!-- ============ CARD ============ -->
          <table class="container" align="center" width="640" cellpadding="0" cellspacing="0" border="0"
            style="max-width: 640px;">
+
+            <!-- Gradient Accent Strip -->
            <tr>
-              <td align="center">
-                <!-- Logo Section -->
-                <table class="container ml-4 ml-default-border" width="640" bgcolor="#E2ECFD" align="center" border="0"
-                  cellspacing="0" cellpadding="0" style="width: 640px; min-width: 640px;">
-                  <tr>
-                    <td class="ml-default-border container" height="40" style="line-height: 40px; min-width: 640px;">
-                    </td>
-                  </tr>
-                  <tr>
-                    <td>
-                      <table align="center" width="100%" border="0" cellspacing="0" cellpadding="0">
-                        <tr>
-                          <td class="row" align="center" style="padding: 0 50px;">
-                            <img
-                              src="https://storage.mlcdn.com/account_image/597379/8QJ8kOjXakVvfe1kJLY2wWCObU1mp5EiDLfBlbQa.png"
-                              border="0" alt="" width="120" class="logo"
-                              style="max-width: 120px; display: inline-block;">
-                          </td>
-                        </tr>
-                      </table>
-                    </td>
-                  </tr>
-                </table>
+              <td bgcolor="#7733F5"
+                style="background: linear-gradient(90deg, #7733F5 0%, #60A5FA 35%, #EC4899 65%, #7733F5 100%); height: 6px; border-radius: 16px 16px 0 0; font-size: 0; line-height: 0;">
+                &nbsp;</td>
+            </tr>

-                <!-- Main Content Section -->
-                <table class="container ml-6 ml-default-border" width="640" bgcolor="#E2ECFD" align="center" border="0"
-                  cellspacing="0" cellpadding="0" style="color: #070629; width: 640px; min-width: 640px;">
-                  <tr>
-                    <td class="row" style="padding: 0 50px;">
-                      {{data.message|safe}}
-                    </td>
-                  </tr>
-                </table>
+            <!-- Logo -->
+            <tr>
+              <td bgcolor="#FFFFFF" align="center" class="card-inner" style="padding: 32px 48px 24px;">
+                <img
+                  src="https://storage.mlcdn.com/account_image/597379/8QJ8kOjXakVvfe1kJLY2wWCObU1mp5EiDLfBlbQa.png"
+                  border="0" alt="AutoGPT" width="120"
+                  style="max-width: 120px; display: inline-block;">
+              </td>
+            </tr>

-                <!-- Signature Section -->
-                <table class="container ml-8 ml-default-border" width="640" bgcolor="#E2ECFD" align="center" border="0"
-                  cellspacing="0" cellpadding="0" style="color: #070629; width: 640px; min-width: 640px;">
+            <!-- Divider -->
+            <tr>
+              <td bgcolor="#FFFFFF" class="card-inner" style="padding: 0 48px;">
+                <table width="100%" cellpadding="0" cellspacing="0" border="0">
                  <tr>
-                    <td class="row mobile-center" align="left" style="padding: 0 50px;">
-                      <table class="ml-8 wrapper" border="0" cellspacing="0" cellpadding="0"
-                        style="color: #070629; text-align: left;">
-                        <tr>
-                          <td class="col center mobile-center" align>
-                            <p
-                              style="font-family: 'Poppins', sans-serif; color: #070629; font-size: 16px; line-height: 165%; margin-top: 0; margin-bottom: 0;">
-                              Thank you for being a part of the AutoGPT community! Join the conversation on our Discord <a href="https://discord.gg/autogpt" style="color: #4285F4; text-decoration: underline;">here</a> and share your thoughts with us anytime.
-                            </p>
-                          </td>
-                        </tr>
-                      </table>
-                    </td>
-                  </tr>
-                </table>
-
-                <!-- Footer Section -->
-                <table class="container ml-10 ml-default-border" width="640" bgcolor="#ffffff" align="center" border="0"
-                  cellspacing="0" cellpadding="0" style="width: 640px; min-width: 640px;">
-                  <tr>
-                    <td class="row" style="padding: 0 50px;">
-                      <table align="center" width="100%" border="0" cellspacing="0" cellpadding="0">
-                        <tr>
-                          <td>
-                            <!-- Footer Content -->
-                            <table align="center" width="100%" border="0" cellspacing="0" cellpadding="0">
-                              <tr>
-                                <td class="col" align="left" valign="middle" width="120">
-                                  <img
-                                    src="https://storage.mlcdn.com/account_image/597379/8QJ8kOjXakVvfe1kJLY2wWCObU1mp5EiDLfBlbQa.png"
-                                    border="0" alt="" width="120" class="logo"
-                                    style="max-width: 120px; display: inline-block;">
-                                </td>
-                                <td class="col" width="40" height="30" style="line-height: 30px;"></td>
-                                <td class="col mobile-left" align="right" valign="middle" width="250">
-                                  <table role="presentation" cellpadding="0" cellspacing="0" border="0">
-                                    <tr>
-                                      <td align="center" valign="middle" width="18" style="padding: 0 5px 0 0;">
-                                        <a href="https://x.com/auto_gpt" target="blank" style="text-decoration: none;">
-                                          <img
-                                            src="https://assets.mlcdn.com/ml/images/icons/default/rounded_corners/black/x.png"
-                                            width="18" alt="x">
-                                        </a>
-                                      </td>
-                                      <td align="center" valign="middle" width="18" style="padding: 0 5px;">
-                                        <a href="https://discord.gg/autogpt" target="blank"
-                                          style="text-decoration: none;">
-                                          <img
-                                            src="https://assets.mlcdn.com/ml/images/icons/default/rounded_corners/black/discord.png"
-                                            width="18" alt="discord">
-                                        </a>
-                                      </td>
-                                      <td align="center" valign="middle" width="18" style="padding: 0 0 0 5px;">
-                                        <a href="https://agpt.co/" target="blank" style="text-decoration: none;">
-                                          <img
-                                            src="https://assets.mlcdn.com/ml/images/icons/default/rounded_corners/black/website.png"
-                                            width="18" alt="website">
-                                        </a>
-                                      </td>
-                                    </tr>
-                                  </table>
-                                </td>
-                              </tr>
-                            </table>
-                          </td>
-                        </tr>
-                        <tr>
-                          <td align="center" style="text-align: left!important;">
-                            <h5
-                              style="font-family: 'Poppins', sans-serif; color: #070629; font-size: 15px; line-height: 125%; font-weight: bold; font-style: normal; text-decoration: none; margin-bottom: 6px;">
-                              AutoGPT
-                            </h5>
-                          </td>
-                        </tr>
-                        <tr>
-                          <td align="center" style="text-align: left!important;">
-                            <p
-                              style="font-family: 'Poppins', sans-serif; color: #070629; font-size: 14px; line-height: 150%; display: inline-block; margin-bottom: 0;">
-                              3rd Floor 1 Ashley Road, Cheshire, United Kingdom, WA14 2DT, Altrincham<br>United Kingdom
-                            </p>
-                          </td>
-                        </tr>
-                        <tr>
-                          <td height="8" style="line-height: 8px;"></td>
-                        </tr>
-                        <tr>
-                          <td align="left" style="text-align: left!important;">
-                            <p
-                              style="font-family: 'Poppins', sans-serif; color: #070629; font-size: 14px; line-height: 150%; display: inline-block; margin-bottom: 0;">
-                              You received this email because you signed up on our website.</p>
-                          </td>
-                        </tr>
-                        <tr>
-                          <td height="1" style="line-height: 12px;"></td>
-                        </tr>
-                        <tr>
-                          <td align="left">
-                            <p
-                              style="font-family: 'Poppins', sans-serif; color: #070629; font-size: 14px; line-height: 150%; display: inline-block; margin-bottom: 0;">
-                              <a href="{{data.unsubscribe_link}}"
-                                style="color: #4285F4; font-weight: normal; font-style: normal; text-decoration: underline;">Unsubscribe</a>
-                            </p>
-                          </td>
-                        </tr>
-                      </table>
-                    </td>
+                    <td style="border-top: 1px solid #E8E5F0; font-size: 0; line-height: 0; height: 1px;">&nbsp;</td>
                  </tr>
                </table>
              </td>
            </tr>
+
+            <!-- Content -->
+            <tr>
+              <td class="card-inner" bgcolor="#FFFFFF"
+                style="padding: 36px 48px 44px; color: #1F1F20; font-family: 'Poppins', sans-serif; border-radius: 0 0 16px 16px;">
+                {{data.message|safe}}
+              </td>
+            </tr>
+
          </table>
+          <!-- ============ END CARD ============ -->
+
+          <!-- Spacer -->
+          <table width="640" class="container" align="center" cellpadding="0" cellspacing="0" border="0"
+            style="max-width: 640px;">
+            <tr>
+              <td style="height: 40px; font-size: 0; line-height: 0;">&nbsp;</td>
+            </tr>
+          </table>
+
+          <!-- ============ FOOTER ============ -->
+          <table class="container" align="center" width="640" cellpadding="0" cellspacing="0" border="0"
+            style="max-width: 640px;">
+            <tr>
+              <td align="center" style="padding: 0 48px;">
+
+                <!-- Logo Text -->
+                <p
+                  style="font-family: 'Poppins', sans-serif; font-size: 22px; font-weight: 700; color: #FFFFFF; margin: 0 0 16px; letter-spacing: -0.5px;">
+                  AutoGPT</p>
+
+                <!-- Social Icons -->
+                <table role="presentation" cellpadding="0" cellspacing="0" border="0" align="center">
+                  <tr>
+                    <td align="center" valign="middle" style="padding: 0 8px;">
+                      <a href="https://x.com/auto_gpt" target="_blank" style="text-decoration: none;">
+                        <img
+                          src="https://assets.mlcdn.com/ml/images/icons/default/rounded_corners/white/x.png"
+                          width="22" alt="X" style="display: block; opacity: 0.6;">
+                      </a>
+                    </td>
+                    <td align="center" valign="middle" style="padding: 0 8px;">
+                      <a href="https://discord.gg/autogpt" target="_blank" style="text-decoration: none;">
+                        <img
+                          src="https://assets.mlcdn.com/ml/images/icons/default/rounded_corners/white/discord.png"
+                          width="22" alt="Discord" style="display: block; opacity: 0.6;">
+                      </a>
+                    </td>
+                    <td align="center" valign="middle" style="padding: 0 8px;">
+                      <a href="https://agpt.co/" target="_blank" style="text-decoration: none;">
+                        <img
+                          src="https://assets.mlcdn.com/ml/images/icons/default/rounded_corners/white/website.png"
+                          width="22" alt="Website" style="display: block; opacity: 0.6;">
+                      </a>
+                    </td>
+                  </tr>
+                </table>
+
+                <!-- Spacer -->
+                <table width="100%" cellpadding="0" cellspacing="0" border="0">
+                  <tr>
+                    <td style="height: 20px; font-size: 0; line-height: 0;">&nbsp;</td>
+                  </tr>
+                </table>
+
+                <!-- Divider -->
+                <table width="100%" cellpadding="0" cellspacing="0" border="0">
+                  <tr>
+                    <td
+                      style="border-top: 1px solid rgba(255,255,255,0.08); font-size: 0; line-height: 0; height: 1px;">
+                      &nbsp;</td>
+                  </tr>
+                </table>
+
+                <!-- Footer Text -->
+                <p
+                  style="font-family: 'Poppins', sans-serif; color: rgba(255,255,255,0.3); font-size: 12px; line-height: 165%; margin: 16px 0 0; text-align: center;">
+                  AutoGPT &middot; 3rd Floor 1 Ashley Road, Altrincham, WA14 2DT, United Kingdom
+                </p>
+                <p
+                  style="font-family: 'Poppins', sans-serif; color: rgba(255,255,255,0.3); font-size: 12px; line-height: 165%; margin: 4px 0 0; text-align: center;">
+                  You received this email because you signed up on our website.
+                  <a href="{{data.unsubscribe_link}}"
+                    style="color: rgba(255,255,255,0.45); text-decoration: underline;">Unsubscribe</a>
+                </p>
+
+              </td>
+            </tr>
+          </table>
+
        </td>
      </tr>
    </table>
  </div>
 </body>

-</html>
+</html>
--- a/autogpt_platform/backend/backend/notifications/templates/nightly_copilot.html.jinja2
+++ b/autogpt_platform/backend/backend/notifications/templates/nightly_copilot.html.jinja2
@@ -0,0 +1,41 @@
+<div style="font-family: 'Poppins', sans-serif; color: #1F1F20;">
+  {{ email_body_html|safe }}
+
+  {% if approval_summary_html %}
+  <!-- Approval Callout -->
+  <table width="100%" cellpadding="0" cellspacing="0" border="0" style="margin-top: 28px; margin-bottom: 8px;">
+    <tr>
+      <td bgcolor="#FFF3E6"
+        style="background-color: #FFF3E6; border-left: 4px solid #FE8700; border-radius: 12px; padding: 20px 24px;">
+        <table width="100%" cellpadding="0" cellspacing="0" border="0">
+          <tr>
+            <td style="padding: 0 0 12px 0;">
+              <span
+                style="display: inline-block; background-color: #FE8700; color: #FFFFFF; font-size: 11px; font-weight: 600; letter-spacing: 0.05em; text-transform: uppercase; padding: 4px 10px; border-radius: 999px;">
+                Approval needed
+              </span>
+            </td>
+          </tr>
+        </table>
+        {{ approval_summary_html|safe }}
+        <p style="font-size: 14px; line-height: 165%; margin-top: 8px; margin-bottom: 0; color: #505057;">
+          I thought this was a good idea. If you want it to happen, please hit approve.
+        </p>
+      </td>
+    </tr>
+  </table>
+  {% endif %}
+
+  <!-- CTA Button -->
+  <table cellpadding="0" cellspacing="0" border="0" style="margin-top: 32px;">
+    <tr>
+      <td align="center" bgcolor="#7733F5"
+        style="background-color: #7733F5; border-radius: 12px;">
+        <a href="{{ cta_url }}"
+          style="display: inline-block; padding: 16px 36px; background-color: #7733F5; color: #FFFFFF; text-decoration: none; font-family: 'Poppins', sans-serif; font-weight: 600; font-size: 16px; border-radius: 12px; line-height: 1;">
+          {{ cta_label }}
+        </a>
+      </td>
+    </tr>
+  </table>
+</div>
--- a/autogpt_platform/backend/backend/notifications/templates/nightly_copilot_callback.html.jinja2
+++ b/autogpt_platform/backend/backend/notifications/templates/nightly_copilot_callback.html.jinja2
@@ -0,0 +1,58 @@
+<div style="font-family: 'Poppins', sans-serif; color: #1F1F20;">
+  <!-- Header -->
+  <h2
+    style="font-size: 24px; line-height: 130%; font-weight: 700; margin-top: 0; margin-bottom: 8px; color: #1F1F20; letter-spacing: -0.5px;">
+    Autopilot picked up where you left off
+  </h2>
+  <p style="font-size: 14px; line-height: 165%; margin-top: 0; margin-bottom: 28px; color: #505057;">
+    We used your recent Copilot activity to prepare a concrete follow-up for you.
+  </p>
+
+  <!-- Divider -->
+  <table width="100%" cellpadding="0" cellspacing="0" border="0" style="margin-bottom: 28px;">
+    <tr>
+      <td style="border-top: 1px solid #E8E5F0; font-size: 0; line-height: 0; height: 1px;">&nbsp;</td>
+    </tr>
+  </table>
+
+  <!-- Body -->
+  {{ email_body_html|safe }}
+
+  {% if approval_summary_html %}
+  <!-- Approval Callout -->
+  <table width="100%" cellpadding="0" cellspacing="0" border="0" style="margin-top: 28px; margin-bottom: 8px;">
+    <tr>
+      <td bgcolor="#FFF3E6"
+        style="background-color: #FFF3E6; border-left: 4px solid #FE8700; border-radius: 12px; padding: 20px 24px;">
+        <table width="100%" cellpadding="0" cellspacing="0" border="0">
+          <tr>
+            <td style="padding: 0 0 12px 0;">
+              <span
+                style="display: inline-block; background-color: #FE8700; color: #FFFFFF; font-size: 11px; font-weight: 600; letter-spacing: 0.05em; text-transform: uppercase; padding: 4px 10px; border-radius: 999px;">
+                Approval needed
+              </span>
+            </td>
+          </tr>
+        </table>
+        {{ approval_summary_html|safe }}
+        <p style="font-size: 14px; line-height: 165%; margin-top: 8px; margin-bottom: 0; color: #505057;">
+          I thought this was a good idea. If you want it to happen, please hit approve.
+        </p>
+      </td>
+    </tr>
+  </table>
+  {% endif %}
+
+  <!-- CTA Button -->
+  <table cellpadding="0" cellspacing="0" border="0" style="margin-top: 32px;">
+    <tr>
+      <td align="center" bgcolor="#7733F5"
+        style="background-color: #7733F5; border-radius: 12px;">
+        <a href="{{ cta_url }}"
+          style="display: inline-block; padding: 16px 36px; background-color: #7733F5; color: #FFFFFF; text-decoration: none; font-family: 'Poppins', sans-serif; font-weight: 600; font-size: 16px; border-radius: 12px; line-height: 1;">
+          {{ cta_label }}
+        </a>
+      </td>
+    </tr>
+  </table>
+</div>
--- a/autogpt_platform/backend/backend/notifications/templates/nightly_copilot_invite_cta.html.jinja2
+++ b/autogpt_platform/backend/backend/notifications/templates/nightly_copilot_invite_cta.html.jinja2
@@ -0,0 +1,64 @@
+<div style="font-family: 'Poppins', sans-serif; color: #1F1F20;">
+  <!-- Header -->
+  <h2
+    style="font-size: 24px; line-height: 130%; font-weight: 700; margin-top: 0; margin-bottom: 8px; color: #1F1F20; letter-spacing: -0.5px;">
+    Your Autopilot beta access is waiting
+  </h2>
+  <p style="font-size: 14px; line-height: 165%; margin-top: 0; margin-bottom: 28px; color: #505057;">
+    You applied to try Autopilot. Here is a tailored example of how it can help once you jump back in.
+  </p>
+
+  <!-- Highlight Card -->
+  <table width="100%" cellpadding="0" cellspacing="0" border="0" style="margin-bottom: 28px;">
+    <tr>
+      <td bgcolor="#5424AE"
+        style="background-color: #5424AE; border-radius: 12px; padding: 20px 24px;">
+        <p
+          style="font-size: 14px; line-height: 165%; margin-top: 0; margin-bottom: 0; color: #FFFFFF; font-weight: 500;">
+          Autopilot works in the background to handle tasks, surface insights, and take action on your behalf.
+        </p>
+      </td>
+    </tr>
+  </table>
+
+  <!-- Body -->
+  {{ email_body_html|safe }}
+
+  {% if approval_summary_html %}
+  <!-- Approval Callout -->
+  <table width="100%" cellpadding="0" cellspacing="0" border="0" style="margin-top: 28px; margin-bottom: 8px;">
+    <tr>
+      <td bgcolor="#FFF3E6"
+        style="background-color: #FFF3E6; border-left: 4px solid #FE8700; border-radius: 12px; padding: 20px 24px;">
+        <table width="100%" cellpadding="0" cellspacing="0" border="0">
+          <tr>
+            <td style="padding: 0 0 12px 0;">
+              <span
+                style="display: inline-block; background-color: #FE8700; color: #FFFFFF; font-size: 11px; font-weight: 600; letter-spacing: 0.05em; text-transform: uppercase; padding: 4px 10px; border-radius: 999px;">
+                Approval needed
+              </span>
+            </td>
+          </tr>
+        </table>
+        {{ approval_summary_html|safe }}
+        <p style="font-size: 14px; line-height: 165%; margin-top: 8px; margin-bottom: 0; color: #505057;">
+          I thought this was a good idea. If you want it to happen, please hit approve.
+        </p>
+      </td>
+    </tr>
+  </table>
+  {% endif %}
+
+  <!-- CTA Button -->
+  <table cellpadding="0" cellspacing="0" border="0" style="margin-top: 32px;">
+    <tr>
+      <td align="center" bgcolor="#7733F5"
+        style="background-color: #7733F5; border-radius: 12px;">
+        <a href="{{ cta_url }}"
+          style="display: inline-block; padding: 16px 36px; background-color: #7733F5; color: #FFFFFF; text-decoration: none; font-family: 'Poppins', sans-serif; font-weight: 600; font-size: 16px; border-radius: 12px; line-height: 1;">
+          {{ cta_label }}
+        </a>
+      </td>
+    </tr>
+  </table>
+</div>
--- a/autogpt_platform/backend/backend/notifications/test_notifications.py
+++ b/autogpt_platform/backend/backend/notifications/test_notifications.py
@@ -256,9 +256,9 @@ class TestNotificationErrorHandling:
            assert 2 not in successful_indices  # Index 2 failed

            # Verify 422 error was logged
-            error_calls = [call[0][0] for call in mock_logger.error.call_args_list]
+            warning_calls = [call[0][0] for call in mock_logger.warning.call_args_list]
            assert any(
-                "422" in call or "malformed" in call.lower() for call in error_calls
+                "422" in call or "malformed" in call.lower() for call in warning_calls
            )

            # Verify all notifications were removed (4 successful + 1 malformed)
@@ -371,10 +371,10 @@ class TestNotificationErrorHandling:
            assert 3 not in successful_indices  # Index 3 was not sent

            # Verify oversized error was logged
-            error_calls = [call[0][0] for call in mock_logger.error.call_args_list]
+            warning_calls = [call[0][0] for call in mock_logger.warning.call_args_list]
            assert any(
                "exceeds email size limit" in call or "oversized" in call.lower()
-                for call in error_calls
+                for call in warning_calls
            )

    @pytest.mark.asyncio
@@ -478,10 +478,10 @@ class TestNotificationErrorHandling:
            assert 1 in failed_indices  # Index 1 failed

            # Verify generic error was logged
-            error_calls = [call[0][0] for call in mock_logger.error.call_args_list]
+            warning_calls = [call[0][0] for call in mock_logger.warning.call_args_list]
            assert any(
                "api error" in call.lower() or "skipping" in call.lower()
-                for call in error_calls
+                for call in warning_calls
            )

            # Only successful ones should be removed from batch (failed one stays for retry)
--- a/autogpt_platform/backend/backend/util/cloud_storage.py
+++ b/autogpt_platform/backend/backend/util/cloud_storage.py
@@ -613,5 +613,5 @@ async def cleanup_expired_files_async() -> int:
            )
            return deleted_count
        except Exception as e:
-            logger.error(f"[CloudStorage] Error during cloud storage cleanup: {e}")
+            logger.warning(f"[CloudStorage] Error during cloud storage cleanup: {e}")
            return 0
--- a/autogpt_platform/backend/backend/util/feature_flag.py
+++ b/autogpt_platform/backend/backend/util/feature_flag.py
@@ -39,6 +39,7 @@ class Flag(str, Enum):
    ENABLE_PLATFORM_PAYMENT = "enable-platform-payment"
    CHAT = "chat"
    COPILOT_SDK = "copilot-sdk"
+    NIGHTLY_COPILOT = "nightly-copilot"


 def is_configured() -> bool:
--- a/autogpt_platform/backend/backend/util/metrics.py
+++ b/autogpt_platform/backend/backend/util/metrics.py
@@ -10,7 +10,7 @@ from sentry_sdk.integrations.launchdarkly import LaunchDarklyIntegration
 from sentry_sdk.integrations.logging import LoggingIntegration

 from backend.util import feature_flag
-from backend.util.settings import Settings
+from backend.util.settings import BehaveAs, Settings

 settings = Settings()
 logger = logging.getLogger(__name__)
@@ -21,6 +21,95 @@ class DiscordChannel(str, Enum):
    PRODUCT = "product"  # For product alerts (low balance, zero balance, etc.)


+def _before_send(event, hint):
+    """Filter out expected/transient errors from Sentry to reduce noise."""
+    if "exc_info" in hint:
+        exc_type, exc_value, _ = hint["exc_info"]
+        exc_msg = str(exc_value).lower() if exc_value else ""
+
+        # AMQP/RabbitMQ transient connection errors — expected during deploys
+        amqp_keywords = [
+            "amqpconnection",
+            "amqpconnector",
+            "connection_forced",
+            "channelinvalidstateerror",
+            "no active transport",
+        ]
+        if any(kw in exc_msg for kw in amqp_keywords):
+            return None
+
+        # "connection refused" only for AMQP-related exceptions (not other services)
+        if "connection refused" in exc_msg:
+            exc_module = getattr(exc_type, "__module__", "") or ""
+            exc_name = getattr(exc_type, "__name__", "") or ""
+            amqp_indicators = ["aio_pika", "aiormq", "amqp", "pika", "rabbitmq"]
+            if any(
+                ind in exc_module.lower() or ind in exc_name.lower()
+                for ind in amqp_indicators
+            ) or any(kw in exc_msg for kw in ["amqp", "pika", "rabbitmq"]):
+                return None
+
+        # User-caused credential/auth errors — not platform bugs
+        user_auth_keywords = [
+            "incorrect api key",
+            "invalid x-api-key",
+            "missing authentication header",
+            "invalid api token",
+            "authentication_error",
+        ]
+        if any(kw in exc_msg for kw in user_auth_keywords):
+            return None
+
+        # Expected business logic — insufficient balance
+        if "insufficient balance" in exc_msg or "no credits left" in exc_msg:
+            return None
+
+        # Expected security check — blocked IP access
+        if "access to blocked or private ip" in exc_msg:
+            return None
+
+        # Discord bot token misconfiguration — not a platform error
+        if "improper token has been passed" in exc_msg or (
+            exc_type and exc_type.__name__ == "Forbidden" and "50001" in exc_msg
+        ):
+            return None
+
+        # Google metadata DNS errors — expected in non-GCP environments
+        if (
+            "metadata.google.internal" in exc_msg
+            and settings.config.behave_as != BehaveAs.CLOUD
+        ):
+            return None
+
+        # Inactive email recipients — expected for bounced addresses
+        if "marked as inactive" in exc_msg or "inactive addresses" in exc_msg:
+            return None
+
+    # Also filter log-based events for known noisy messages.
+    # Sentry's LoggingIntegration stores log messages under "logentry", not "message".
+    logentry = event.get("logentry") or {}
+    log_msg = (
+        logentry.get("formatted") or logentry.get("message") or event.get("message")
+    )
+    if event.get("logger") and log_msg:
+        msg = log_msg.lower()
+        noisy_patterns = [
+            "amqpconnection",
+            "connection_forced",
+            "unclosed client session",
+            "unclosed connector",
+        ]
+        if any(p in msg for p in noisy_patterns):
+            return None
+        # "connection refused" in logs only when AMQP-related context is present
+        if "connection refused" in msg and any(
+            ind in msg for ind in ("amqp", "pika", "rabbitmq", "aio_pika", "aiormq")
+        ):
+            return None
+
+    return event
+
+
 def sentry_init():
    sentry_dsn = settings.secrets.sentry_dsn
    integrations = []
@@ -35,6 +124,7 @@ def sentry_init():
        profiles_sample_rate=1.0,
        environment=f"app:{settings.config.app_env.value}-behave:{settings.config.behave_as.value}",
        _experiments={"enable_logs": True},
+        before_send=_before_send,
        integrations=[
            AsyncioIntegration(),
            LoggingIntegration(sentry_logs_level=logging.INFO),
--- a/autogpt_platform/backend/backend/util/retry.py
+++ b/autogpt_platform/backend/backend/util/retry.py
@@ -64,7 +64,7 @@ def send_rate_limited_discord_alert(
        return True

    except Exception as alert_error:
-        logger.error(f"Failed to send Discord alert: {alert_error}")
+        logger.warning(f"Failed to send Discord alert: {alert_error}")
        return False


@@ -182,7 +182,8 @@ def conn_retry(
        func_name = getattr(retry_state.fn, "__name__", "unknown")

        if retry_state.outcome.failed and retry_state.next_action is None:
-            logger.error(f"{prefix} {action_name} failed after retries: {exception}")
+            # Final failure is logged by sync_wrapper/async_wrapper — skip here to avoid duplicates
+            pass
        else:
            if attempt_number == EXCESSIVE_RETRY_THRESHOLD:
                if send_rate_limited_discord_alert(
@@ -225,7 +226,7 @@ def conn_retry(
                logger.info(f"{prefix} {action_name} completed successfully.")
                return result
            except Exception as e:
-                logger.error(f"{prefix} {action_name} failed after retries: {e}")
+                logger.warning(f"{prefix} {action_name} failed after retries: {e}")
                raise

        @wraps(func)
@@ -237,7 +238,7 @@ def conn_retry(
                logger.info(f"{prefix} {action_name} completed successfully.")
                return result
            except Exception as e:
-                logger.error(f"{prefix} {action_name} failed after retries: {e}")
+                logger.warning(f"{prefix} {action_name} failed after retries: {e}")
                raise

        return async_wrapper if is_coroutine else sync_wrapper
--- a/autogpt_platform/backend/backend/util/settings.py
+++ b/autogpt_platform/backend/backend/util/settings.py
@@ -1,6 +1,7 @@
 import json
 import os
 import re
+from datetime import date
 from enum import Enum
 from typing import Any, Dict, Generic, List, Set, Tuple, Type, TypeVar

@@ -89,6 +90,10 @@ class Config(UpdateTrackingModel["Config"], BaseSettings):
        le=500,
        description="Thread pool size for FastAPI sync operations. All sync endpoints and dependencies automatically use this pool. Higher values support more concurrent sync operations but use more memory.",
    )
+    tally_extraction_llm_model: str = Field(
+        default="openai/gpt-4o-mini",
+        description="OpenRouter model ID used for extracting business understanding from Tally form data",
+    )
    ollama_host: str = Field(
        default="localhost:11434",
        description="Default Ollama host; exempted from SSRF checks.",
@@ -117,6 +122,26 @@ class Config(UpdateTrackingModel["Config"], BaseSettings):
        default=True,
        description="If authentication is enabled or not",
    )
+    enable_invite_gate: bool = Field(
+        default=True,
+        description="If the invite-only signup gate is enforced",
+    )
+    nightly_copilot_callback_start_date: date = Field(
+        default=date(2026, 2, 8),
+        description="Users with sessions since this date are eligible for the one-off autopilot callback cohort.",
+    )
+    nightly_copilot_invite_cta_start_date: date = Field(
+        default=date(2026, 3, 13),
+        description="Invite CTA cohort does not run before this date.",
+    )
+    nightly_copilot_invite_cta_delay_hours: int = Field(
+        default=48,
+        description="Delay after invite creation before the invite CTA can run.",
+    )
+    nightly_copilot_callback_token_ttl_hours: int = Field(
+        default=24 * 14,
+        description="TTL for nightly copilot callback tokens.",
+    )
    enable_credit: bool = Field(
        default=False,
        description="If user credit system is enabled or not",
--- a/Show More
+++ b/Show More