Publications

Preprint

Belief Engine: Configurable and Inspectable Stance Dynamics in Multi-Agent LLM Deliberation

arXiv preprint arXiv:2605.15343 2026

Belief Engine preview

TL;DR

LLM agents often drift because they do not have a stable inner core. Belief Engine gives an AI agent a traceable stance it can hold onto, while still letting it be persuaded by good evidence. The result is an agent that can stay coherent, change its mind, and show why it changed.

Key Takeaways

Belief is explicit state

The system maintains proposition-level stance instead of inferring belief only from generated text.

Two interpretable controls

Evidence uptake tunes responsiveness to new arguments; prior anchoring tunes persistence of the initial position.

Human replay is diagnostic

The DEBATE replay shows that one global human-like update profile misses stable, evidence-aligned, and evidence-opposed participants.

Useful beyond prediction

The main value is an audit trail: why an agent moved, stayed stable, or diverged from the evidence stream.

Why It Matters

Multi-agent LLM simulations can look persuasive while hiding why agents change their minds. This work makes stance change explicit, configurable, and empirically testable, so researchers can separate evidence-responsive movement from anchoring, prompt effects, retrieval artifacts, and model bias.

Abstract

We introduce Belief Engine, an auditable belief-update layer for multi-agent LLM deliberation. The system extracts arguments into structured memory and updates stance with configurable log-odds dynamics, making belief change more inspectable and useful for studying social simulation, disagreement, and deliberation.

Paper Content

The Problem

Most multi-agent LLM deliberation systems produce transcripts that look socially plausible, but the internal reason for a stance shift is hard to inspect. Did the agent change because it heard stronger evidence, because recent text dominated the prompt, because its persona drifted, or because the base model carried a hidden bias?

The deeper issue is that ordinary LLM agents do not really have a durable core. They can sound confident in one turn, drift in the next, or “change their mind” without any clear account of what convinced them. Belief Engine gives an agent a maintained stance: something stable enough to make the agent coherent, but flexible enough to be updated by evidence.

That makes opinion change more human-like in the important sense: the agent can start somewhere, listen, be persuaded, resist weak arguments, and explain the path from its earlier position to its later one.

What The System Does

Belief Engine runs a five-step loop:

  1. Extract arguments from incoming messages.
  2. Judge their polarity and evidential strength.
  3. Store active and archived claims in structured memory.
  4. Update a maintained stance using evidence uptake and prior anchoring.
  5. Compose the next response from stance, memory, and recent dialogue.

The key design choice is that stance is not hidden inside a prompt. It is maintained explicitly, with an audit trail of the claims that moved it. This gives the agent something like a deliberative core rather than a loose stream of plausible responses.

What The Paper Finds

In generated 15-round debates, the two parameters behave as intended across multiple model backends. Raising evidence uptake makes agents more responsive to later arguments; raising prior anchoring makes them hold more strongly to the initial position.

The paper also validates that this internal stance is visible in generated language: an external judge’s score of the generated text correlates strongly with the assigned stance (r = 0.967).

The most interesting result comes from replaying 2,495 human DEBATE trajectories. Belief Engine improves pooled final-stance reconstruction over no-change and net-evidence baselines, but the larger lesson is heterogeneity: stable participants, evidence-aligned movers, and evidence-opposed movers require different update profiles.

Why It Matters

For civic simulation, educational debate, negotiation agents, and deliberation research, “the model said something plausible” is not enough. The system should expose what it treated as evidence, how much that evidence mattered, and how strongly prior commitments persisted.

Belief Engine is useful because it turns those assumptions into explicit research objects. A future civic simulation can be asked not only what agents concluded, but what update rule made them get there.

Limitations To Read Carefully

The current representation is intentionally narrow: one scalar stance over one proposition. That makes the system inspectable, but it cannot fully capture multidimensional political positions, values, trust, strategic behavior, or sub-issue movement.

The evidence pipeline also depends on argument extraction and quality scoring. Making those steps explicit is a major improvement, but it does not make them automatically neutral.