AutoGPT/autogpt_platform/backend at 1f4105e8f9554d7bff016ce8216041bf4c378bf3 - AutoGPT - AtHeartEngineering

github/AutoGPT

mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-02-10 06:45:28 -05:00

Files

History

Bently caf9ff34e6 fix(backend): Handle stale RabbitMQ channels on connection drop (#11929 )

### Changes 🏗️

Fixes
[**AUTOGPT-SERVER-1TN**](https://autoagpt.sentry.io/issues/?query=AUTOGPT-SERVER-1TN)
(~39K events since Feb 2025) and related connection issues
**6JC/6JD/6JE/6JF** (~6K combined).

#### Problem

When the RabbitMQ TCP connection drops (network blip, server restart,
etc.):

1. `connect_robust` (aio_pika) automatically reconnects the underlying
AMQP connection
2. But `AsyncRabbitMQ._channel` still references the **old dead
channel**
3. `is_ready` checks `not self._channel.is_closed` — but the channel
object doesn't know the transport is gone
4. `publish_message` tries to use the stale channel →
`ChannelInvalidStateError: No active transport in channel`
5. `@func_retry` retries 5 times, but each retry hits the same stale
channel (it passes `is_ready`)

This means every connection drop generates errors until the process is
restarted.

#### Fix

**New `_ensure_channel()` helper** that resets stale channels before
reconnecting, so `connect()` creates a fresh one instead of
short-circuiting on `is_connected`.

**Explicit `ChannelInvalidStateError` handling in `publish_message`:**
1. First attempt uses `_ensure_channel()` (handles normal staleness)
2. If publish throws `ChannelInvalidStateError`, does a full reconnect
(resets both `_channel` and `_connection`) and retries once
3. `@func_retry` provides additional retry resilience on top

**Simplified `get_channel()`** to use the same resilient helper.

**1 file changed, 62 insertions, 24 deletions.**

#### Impact
- Eliminates ~39K `ChannelInvalidStateError` Sentry events
- RabbitMQ operations self-heal after connection drops without process
restart
- Related transport EOF errors (6JC/6JD/6JE/6JF) should also reduce

2026-02-09 10:24:08 +00:00

..

feat(backend): add default store agents for seeding test databases (#11552 )

2025-12-05 16:08:37 +01:00

fix(backend): Handle stale RabbitMQ channels on connection drop (#11929 )

2026-02-09 10:24:08 +00:00

fix(backend): implement retry mechanism for SmartDecisionMaker tool call validation (#11015 )

2025-09-30 16:18:05 +00:00

chore(llm): remove deprecated Claude 3.7 Sonnet model with migration and defensive handling (#11841 )

2026-01-30 08:40:55 +00:00

refactor(docs): restructure platform docs for GitBook and remove MkDo… (#11825 )

2026-01-23 06:18:16 +00:00

fix(backend): Reduce GET /api/graphs expense + latency (#11986 )

2026-02-06 19:13:21 +00:00

feat(platform): Add Redis-based SSE reconnection for long-running CoPilot operations (#11877 )

2026-02-03 16:52:06 +01:00

.dockerignore

feat(platform/docker): add frontend service to docker-compose with env config improvements (#10615 )

2025-08-14 03:28:18 +00:00

.env.default

feat(blocks): Add video editing blocks (#11796 )

2026-02-05 22:22:33 +00:00

.gitignore

feat(frontend): Add progress indicator during agent generation [SECRT-1883] (#11974 )

2026-02-05 15:37:51 +01:00

CLAUDE.md

refactor(claude): Split autogpt_platform/CLAUDE.md into project-specific files (#11788 )

2026-01-29 17:33:02 +00:00

docker-compose.test.yaml

perf(backend/db): Optimize StoreAgent and Creator views with database indexes and materialized views (#10084 )

2025-07-10 14:57:55 +00:00

Dockerfile

feat(blocks): Add video editing blocks (#11796 )

2026-02-05 22:22:33 +00:00

gen_prisma_types_stub.py

feat(backend): add prisma types stub generator for pyright compatibility (#11736 )

2026-01-09 16:31:10 +01:00

linter.py

feat(backend): add prisma types stub generator for pyright compatibility (#11736 )

2026-01-09 16:31:10 +01:00

poetry.lock

chore(backend/deps-dev): bump the development-dependencies group across 1 directory with 3 updates (#12005 )

2026-02-09 04:26:58 +00:00

pyproject.toml

chore(backend/deps-dev): bump the development-dependencies group across 1 directory with 3 updates (#12005 )

2026-02-09 04:26:58 +00:00

README.advanced.md

docs(online/offline): #8887 Move to a single source of truth for docs — Remove duplicate info from backend readme[.advanced].md (#9580 )

2025-03-07 21:01:32 +00:00

README.md

docs(online/offline): #8887 Move to a single source of truth for docs — Remove duplicate info from backend readme[.advanced].md (#9580 )

2025-03-07 21:01:32 +00:00

run_tests.py

perf(backend/db): Optimize StoreAgent and Creator views with database indexes and materialized views (#10084 )

2025-07-10 14:57:55 +00:00

schema.prisma

feat(platform): add User Workspace for persistent CoPilot file storage (#11867 )

2026-01-29 05:49:47 +00:00

test_requeue_integration.py

feat(platform): implement graph-level Safe Mode toggle for HITL blocks (#11455 )

2025-12-02 09:55:55 +00:00

TESTING.md

refactor(claude): Split autogpt_platform/CLAUDE.md into project-specific files (#11788 )

2026-01-29 17:33:02 +00:00

README.md

Getting Started (Released)