Agentic Workflow — chevp-ai-framework

The Roster

Six concrete agents + one cross-cutting role — each with a single, focused job

balance

Challenger

Cross-cutting Role Active in Exploration · Production

The internal sceptic. Before any G2 transition, the AI must produce three concrete failure modes for its own plan, two genuinely-considered alternatives, the strongest counter-argument against the chosen approach, and a product-coherence check. Generic output ("schedule slip", "scope creep") is an automatic regenerate.

Output

4 sections inside the EXP plan: failures, alternatives, counter-argument, product-coherence

Triggers

EXP plan transitions draft → proposed; mid-Production scope changes

Source

02-exploration/challenger.md

gatekeeper-g1

Subagent Read-only Tools: Read · Glob · Grep

Validates the Context → Exploration transition. Confirms the CTX-Plan, the uncertainty triplet (problem-statement, hypotheses, risks), the System Spec, the Software Architecture doc, fundamental ADRs, the context inventory and scope confirmation. Spawns up to five PROP-NNN Plan-Proposals for out-of-scope items so nothing silently disappears.

Verdict

pass / conditional-pass / block

Triggers

Before any move from Context to Exploration; /gate-check G1

Source

agents/gatekeeper-g1.md

gatekeeper-g2

Subagent Read-only Tools: Read · Glob · Grep

Validates the Exploration → Production transition. Checks the EXP plan declares exploration-mode: A|B, that Kill Criteria, Acceptance Criteria, Risks and a UX-Prototype exist, that insights.md records what was learned, and — critically — that the Challenger output is concrete and not theatre. Generic Challenger blocks are an automatic block.

Verdict

pass / conditional-pass / block

Triggers

Before any move from Exploration to Production; /gate-check G2

Source

agents/gatekeeper-g2.md

gatekeeper-g3

Subagent Read-only (+ Bash for build) Tools: Read · Glob · Grep · Bash

Validates the Production → Done transition. Verifies every acceptance criterion against actual code & tests, runs the build, checks the insights.md was updated with implementation surprises (not just copied from G2), checks the PRD provenance frontmatter for the human approval (approved-by / approved-at), and detects code changed outside the approved PRD scope.

Verdict

pass / conditional-pass / block

Triggers

Before declaring a Production task done; /gate-check G3

Source

agents/gatekeeper-g3.md

architecture

architecture-reviewer

Subagent Read-only Tools: Read · Glob · Grep

Reviews individual changes — a plan, a code diff, a new ADR — against the project's documented architecture invariants and accepted ADRs. Flags forbidden layer crossings, wrong dependency directions, and patterns that conflict with prior decisions. Returns severities info / warn / block.

Output

REVIEW · VERDICT · FINDINGS with severity per finding

Triggers

New pattern proposed, layer boundary crossed, ADR drafted

Source

agents/architecture-reviewer.md

policy

governance-auditor

Subagent Read-only Tools: Read · Glob · Grep

Audits the whole repository for content-level drift against accepted ADRs and architecture invariants. Detects: ADRs whose constraints are violated by current code, recurring patterns that have no binding ADR, and accepted ADRs whose subject no longer exists in the codebase. Where the architecture-reviewer is per-change, the auditor is repo-wide.

Severity

BLOCK / CONCERN / INFO

Triggers

/governance-audit; per release; after any ADR is accepted/superseded

Source

agents/governance-auditor.md

history

gate-validator

Legacy · Dispatcher Superseded

Backward-compatibility dispatcher. Older /gate-check invocations still route through here; it forwards to the matching gatekeeper-g1/g2/g3 and returns its output unchanged. New code should call the specialised gatekeepers directly.

Status

Retained for compatibility; do not extend

Replaced by

gatekeeper-g1, gatekeeper-g2, gatekeeper-g3

Source

agents/gate-validator.md

Rule	Why
One agent = one job	An agent that reviews "everything" returns nothing useful. Narrow scope makes verdicts strong.
Read-only by default	Verdicts must be advisory. Only the orchestrator (with human approval) writes code or decisions.
Structured output, not prose	Verdicts must be diffable. Free-form “looks good” is forbidden — cite a path or a line.
Generic findings auto-fail	"Schedule slip", "scope creep" — if it could apply to any plan, it has not engaged with this one.
Out-of-scope → proposal, never silence	An agent that finds a tangent files a `PROP-NNN`; it does not expand its own scope. (Rule 12)
Cap output to 5 proposals	Prevents proposal-spam. Excess is rolled into one Sammel-Notiz paragraph.

chevp-ai-framework

Agents

What is an agentic workflow?

The Pattern

Why many small agents?

Confirmation bias

Diluted attention

Auditable verdicts

The Roster

Challenger

gatekeeper-g1

gatekeeper-g2

gatekeeper-g3

architecture-reviewer

governance-auditor

gate-validator

Where each agent activates

The output contract

Many more agents are possible

Design rules for new agents

Read the source files