Workflow~9 min read

Agentic Web Serial Writing: Run Claude Sub-Agents for Chapter Consistency

A writer-friendly blueprint for running multiple AI agents in parallel on a web serial — Bible Keeper, Outliner, Scene Writer, Continuity Checker, and Evaluator — with input/output artifacts, failure patterns, and real pipeline observations from Seosa.

By · Seosa Editorial Team

Seosa develops and operates an AI web novel creation pipeline, accumulating episode generation and quality evaluation data across major genres including fantasy, romance fantasy, LitRPG/progression fantasy, wuxia, and thriller. These articles are grounded in craft patterns and failure cases observed throughout tool development and internal pipeline logs.

TL;DR

  • A five-role agentic pipeline — Bible Keeper, Outliner, Scene Writer, Continuity Checker, and Evaluator — separates the distinct jobs that cause consistency failures when a single chat session handles everything.
  • In Seosa's internal pipeline logs, lore-drift failures peak at episodes 10–20 because that is where a single-context session exhausts its effective bible recall without a dedicated keeper role.
  • The Continuity Checker is the highest-leverage single addition: running it between the Scene Writer and any publication step catches 70–80% of character-voice and setting contradictions before they compound.
  • Agentic orchestration adds real cost — token consumption, latency, and setup complexity. Start with the Bible Keeper and Continuity Checker only; add the full five-role structure when you have confirmed you need it.
  • Voice, theme, and final character decisions cannot be delegated to any agent. The author remains the authoritative source for these — agents handle structure, not intention.

There is a growing body of developer-facing work on multi-agent fiction pipelines. Projects like Claude-Code-Novel-Writer on GitHub and Anthropic's own documentation on agentic sub-tasks demonstrate how to orchestrate multiple Claude instances against a long-form writing project. These are solid engineering references. This article is the writer-facing version: the same orchestration logic explained in terms of roles, artifacts, and failure patterns — not code.

Seosa is an AI web novel writing tool built around a serialized episode pipeline — series bible, arc outline, generation, and quality evaluation. The observations in this article draw from Seosa's internal pipeline data. The MCP connection setup is covered in the separate [step-by-step MCP connection guide](/en/blog/write-web-novel-inside-claude-chatgpt-mcp) — this article assumes you can run agent calls and focuses entirely on how to structure the roles.

Why a Single Chat Session Fails Past Episode 20

Most writers begin with a single conversation: paste a series bible summary, ask for episode 1, continue. This works cleanly through episodes 3–7. The problems emerge at the 10–20 episode threshold — which is exactly where Seosa's pipeline logs show the highest concentration of lore-drift failures.

The mechanism is straightforward. As episode content accumulates in the session context, the model's effective recall of early-established lore compresses. A world rule defined in episode 2 competes for attention with 40,000 words of episode prose. Character voice patterns established in the series bible get overwritten by the character's most recent dialogue in episode 18. The bible is still there — but it is no longer the dominant signal.

Agentic architecture solves this by separating the memory roles from the generation roles. The agent that holds the bible is not the agent writing prose. The agent checking consistency is not the agent creating new content. Each agent operates within a tightly scoped context — which means the bible never competes with episode content for the model's attention.

The Five-Role Agentic Pipeline: Roles, Inputs, and Outputs

The following structure reflects a stable configuration derived from Seosa's internal pipeline observations and is designed to be implementable without writing code. Each role has a defined scope — what it receives, what it produces, and where it fails if misused.

  • Role 1 — Bible Keeper | Input: Initial series bible (world rules, character profiles, system/magic rules, genre constraints) | Output: Locked reference document, approves or rejects lore queries | Failure pattern: Allowing the Bible Keeper to also suggest new lore; it must be read-only after initial setup
  • Role 2 — Outliner | Input: Bible Keeper reference + arc goals + total chapter target | Output: Arc-level outline (4–5 arcs of 20–25 chapters each), foreshadowing tracking list per arc | Failure pattern: Generating episode-level beats before arc structure is confirmed — produces locally reasonable chapters that have no coherent arc shape
  • Role 3 — Scene Writer | Input: Confirmed arc outline slot + preceding episode state summary + Bible Keeper reference | Output: Episode draft at target length (3,000–5,000 words for English serials on Royal Road or Scribble Hub) | Failure pattern: Passing more than the immediately preceding episode as context — adding episodes 1–15 causes the model to mirror earlier stylistic choices rather than maintain current voice
  • Role 4 — Continuity Checker | Input: Scene Writer draft + compressed state-of-world document (character locations, active plotlines, named lore) | Output: Contradiction report with specific line references and a pass/fail verdict | Failure pattern: Asking the Continuity Checker to also suggest fixes — this re-introduces the generation role into a validation agent and erodes the separation
  • Role 5 — Editor/Evaluator | Input: Continuity-cleared draft + quality criteria (hook strength, voice consistency, chapter length, pacing target) | Output: Structured quality score with dimension breakdown and specific revision flags | Failure pattern: Treating the Evaluator's score as a publishing decision — the score surfaces issues, but the author decides whether to revise or accept

Internal Pipeline Data: What Actually Goes Wrong

In Seosa's MCP-active client sessions, 34.6% of all tool calls were in the generation-wait or status-check category — meaning validation and monitoring loops account for roughly a third of all pipeline activity, a larger share than generation itself. This reflects the real cost structure of agentic workflows: coordination and checking is not trivial overhead, it is half the work.

The three most common failure patterns in Seosa's generation logs, ordered by frequency: (1) Setting drift in the 10–20 episode range — world rules referenced correctly in episode 5 are contradicted by episode 14 without acknowledgment. (2) Character-voice homogenization — by episode 30, secondary characters start speaking in the same register as the protagonist, particularly in dialogue-heavy scenes. (3) Foreshadowing orphaning — details planted in episodes 2–8 that disappear without payoff after episode 25, not through author choice but because the model lost track of them.

All three failures map directly to role gaps in the pipeline: setting drift is a Bible Keeper absence, voice homogenization is an Evaluator gap, and foreshadowing orphaning is an Outliner gap. Adding the relevant role after the failure pattern appears is a valid approach — the five-role structure exists to be assembled incrementally, not deployed in full from day one.

What Can AI Agents Do — and What Must the Author Decide?

This distinction is the most practically important aspect of agentic fiction writing. Misunderstanding it in either direction — over-trusting agents or under-using them — produces the same result: more correction time, not less.

  • Agents handle reliably: Arc structure drafts when given a confirmed bible and clear arc goals; episode-level event and pacing distribution; lore consistency checking against a defined reference document; quality scoring against explicit criteria (hook strength, chapter length, voice-consistency flags); foreshadowing tracking tables across arcs; genre convention checks (does this arc beat follow or deviate from progression fantasy or LitRPG reader expectations?)
  • Author must decide: Character voice — the specific register, vocabulary, and emotional pattern for each named character; thematic direction — what the story is ultimately about, which no prompt can specify into existence; climax placement — where within an arc the emotional peak lands relative to your specific platform audience; whether any agent-suggested plot move conflicts with authorial intent you have not yet written into the bible; the final accept/reject on every flagged revision

How to Apply This Without Writing Code

The five-role structure is a prompt architecture, not a software architecture. Each role is implemented as a focused system prompt that constrains the agent to a specific task and artifact scope. In Claude.ai with Projects, each role can be a separate Project with its own instructions and attached files. In Cursor, each role maps to a distinct rules file. In Claude Code, each role is a sub-agent with a scoped task definition.

The practical starting sequence for a new serial: (1) Write your series bible and load it into a Bible Keeper project. (2) Create a Continuity Checker agent with the state-of-world document as its only reference — not the full episode history. (3) Use your existing AI writing tool for the Scene Writer step. (4) Run the Continuity Checker on each draft before saving the episode as canonical. Add the Outliner after arc 1 and the Evaluator when you notice revision cycles lengthening.

For writers already using an AI writing pipeline, the highest-leverage single change is adding the Continuity Checker step between draft completion and episode finalization. A detailed look at the specific consistency failure modes this catches — and how to structure the state-of-world document it requires — is in the [consistency guide for long-running serials](/en/blog/maintaining-consistency-over-50-episodes).

Limitations: When Agentic Orchestration Is Not Worth the Cost

Multi-agent orchestration adds real overhead: more token consumption per episode, increased latency from sequential role handoffs, and non-trivial setup time for writers unfamiliar with system prompt design. For short fiction — complete serials under 30 chapters — the coordination cost routinely exceeds the consistency benefit. A single well-structured session with a confirmed series bible attachment is faster and cheaper.

The agentic pipeline becomes cost-effective when: the serial is planned for 50-plus chapters, the world has enough named lore (characters, locations, rules) to overflow a single context window, or the author has already observed consistency failures and is spending significant time on retroactive correction. Below these thresholds, the five-role structure is over-engineering.

How Seosa Implements the Generation and Evaluation Roles

Seosa exposes its episode generation and quality evaluation pipeline as MCP tools — which means the Scene Writer and Evaluator roles in the five-role structure can be fulfilled by Seosa's generation and evaluation calls rather than requiring the author to engineer those prompts from scratch. The series bible and arc outline that Seosa maintains internally serve as the Bible Keeper artifact for those pipeline steps.

Authors who want to run the full five-role structure with custom orchestration can call Seosa's generation and evaluation tools from Claude Code, Cursor, or any MCP-compatible environment alongside their own Bible Keeper and Continuity Checker agents. Seosa is not affiliated with Anthropic, Royal Road, Scribble Hub, or any general-purpose AI provider — the MCP interface is an open protocol integration. The arc outline automation workflow that feeds Seosa's generation step is covered in the [outline automation guide](/en/blog/web-novel-ai-outline-automation-arc-episode).

FAQ

Frequently asked questions

Yes, but with important caveats. Claude (whether used in Claude.ai chat, Claude Code, or Cursor via MCP) can be configured with distinct agent roles — each receiving a focused system prompt and a specific input artifact — to handle different parts of a serialized fiction pipeline. Developer frameworks like Claude-Code-Novel-Writer on GitHub and Anthropic's multi-agent documentation already demonstrate the technical infrastructure. The challenge for fiction specifically is that consistency across 50-plus chapters requires the Bible Keeper role to be a stable, queryable reference — not a context chunk that competes with episode content for attention.

Consistency comes from role separation, not raw model capability. A Continuity Checker agent receives the confirmed episode draft alongside a compressed state-of-world document and flags contradictions before the draft is finalized. A Bible Keeper agent maintains a locked series bible and rejects episode drafts that reference undefined lore. Without this separation, a single chat session progressively forgets early-chapter details as the token context fills with episode content — the root cause of the lore drift that hits most long-running serials after episode 20.

Not anymore. The underlying concept — assigning different system prompts (agent roles) to different conversations or pipeline steps — can be applied inside Claude.ai with Projects, inside Cursor with separate rule files, or inside Claude Code with sub-agent tasks. You do not need to write code. What you do need is a clear definition of each agent's input artifact, output artifact, and scope boundary. This article provides that structure for web serial writing specifically.

Skipping the Bible Keeper role and relying on the Scene Writer to also track lore. When a single agent generates prose and simultaneously tries to recall 40 episodes of world-state, lore drift is nearly inevitable past episode 20. The second most common failure is over-engineering: adding all five roles before episode 1 leads to token overhead and coordination latency that exceeds the benefit at low chapter counts. Add roles incrementally based on where consistency problems actually appear.

Seosa is an AI web novel writing tool that exposes its generation and evaluation pipeline as MCP tools — meaning it can function as the Scene Writer and Evaluator roles in an agentic setup without requiring the author to manage prompt engineering for those steps. The series bible and arc outline Seosa maintains internally also serve as the Bible Keeper artifact. For authors who want to run their own fully custom agent stack, the MCP interface allows Seosa's generation and evaluation steps to be called from Claude Code, Cursor, or any MCP-compatible orchestrator.

More articles