Agents generate code with no record of what was tried, no gate it had to pass, no proof it works. 12-Factor AgentOps is the operating model that closes that gap. Twelve factors that make agent-built software something a team can stand behind, in the tradition of the Twelve-Factor App.
plan: token bucket, 5/min per IP, Redis-backed, jittered
// recorded → .agents/runs/2026-05-08-rate-limit/research.md
> /council --mixed validate this PR
// evidence packet sealed → 6 judges across Claude Code + Codex
claude · WARN rate limiting missing on /login
codex · WARN token-bucket refill lacks jitter
consensus: WARN — fix before shipping
// the bet
THE WAGER — Vendors will ship managed memory, review councils, and overnight learning loops natively; they will lock them to their runtime. Your corpus stays in .agents/ in your repo, runs on whichever harness you already pay for, and is portable across whichever frontier model wins next quarter.
Good architecture principles outlive every tool that implements them.