The ORM Bug That Silently Killed Your Database Commits: SQLAlchemy's Identity Map in Multi-Agent Pipelines

There’s a class of bug in multi-agent systems that has nothing to do with LLMs, nothing to do with prompts, and everything to do with database sessions. It’s the kind of bug that returns success while silently doing nothing. You won’t find it in tests unless your tests are written exactly right. And when it hit the ACO system, it hid inside a layer of SQLAlchemy session management that nobody had documented.

The Symptom

An agent is supposed to update a story’s status and persist it. The code calls session.commit(). The code returns without raising an exception. But when you check the database, the status hasn’t changed.

def process_story(self, story, session):
    story.status = "PM_REVIEW"
    session.commit()  # returns successfully
    return story

This code looks correct. It ran without error. But the status didn’t change.

The Root Cause

SQLAlchemy uses an identity map — a session-level cache that tracks objects by their primary key. When you load a Story object with ID 42 in Session A, Session A remembers that object. If you load the same ID 42 in Session B, Session B has its own object, separate from Session A’s.

The bug manifests when you pass objects across session boundaries. In the ACO system, the find_work() method loads a story in one session, then passes it to an agent that was called in a different session context. The agent calls session.commit() in its session — but the story object it received belongs to the calling session’s identity map. SQLAlchemy’s commit walks the dirty objects in its own session’s identity map. The story isn’t dirty in the agent’s session because it’s not in that session’s identity map at all. The commit silently does nothing.

# In tests: agent runs in a fresh session
agent_session = Session()
story = agent_session.query(Story).filter_by(id=42).first()
# story is now in agent_session's identity map

# But in the real system: story was loaded in a different session
# story was passed in as a parameter
# story belongs to caller's_session, not agent_session

# agent calls:
agent_session.commit()  # agent_session has no dirty objects
# silently succeeds, nothing changed

The Fix: `session.merge()`

The fix is to call session.merge(story) at the start of every method that accepts an entity from an external session. merge() takes an object from another session and copies its state into a corresponding object in the local session — creating one if it doesn’t exist. After merge(), the local session’s identity map has a reference to the story, and any modifications to it will be tracked.

def process_story(self, story, session):
    story = session.merge(story)  # bring into this session's identity map
    story.status = "PM_REVIEW"
    session.commit()
    return story

merge() is idempotent — if the object is already in the session, it’s a no-op. So calling it unconditionally at method entry is safe even when the story was already loaded in the same session.

Why Tests Didn’t Catch It

The original tests called agents directly, bypassing the workflow orchestrator. This meant the test session and the agent session happened to be the same object — the test setup loaded the story in the same session the agent used. The bug only manifested in the real system where the orchestrator (running in one session) spawned agent work (in another session’s context).

The fix involved using subprocess isolation: each test runs in a fresh Python process, fresh module state, fresh database connection. This mimics production behavior where each agent invocation starts clean. Eight tests were written covering the full Water Reminder App workflow, all passing.

What This Teaches Us About Multi-Agent Architecture

Database session management is an often-ignored dimension of multi-agent system design. In single-agent systems, you control the session lifetime. In multi-agent pipelines where agents run in separate processes, threads, or containers, each agent has its own session context. Objects pass between them as plain data (serialized and deserialized) or as references that may not behave consistently across session boundaries.

The practical rules that emerge:

Always merge entities at session boundaries. Call session.merge() on any entity your method receives as a parameter.
Design your session lifetime around agent boundaries. Each agent invocation should have a clear session scope, not inherit from a parent.
Test across process boundaries, not just across function calls. If your agents run in separate processes (as they should in production), your tests should too.

The bug cost a day of debugging. The fix was three lines. The lesson is that in distributed AI systems, the most dangerous failures aren’t loud — they’re the ones that return success while doing nothing.

Aniket Karne

DevOps & AI Engineer · Amsterdam

Back to all posts

The Symptom

The Root Cause

The Fix: session.merge()

Why Tests Didn’t Catch It

What This Teaches Us About Multi-Agent Architecture

The Fix: `session.merge()`