Competitive Radar

AgentOps should not try to be every agent workflow tool at once. The Claude Code skills/plugin ecosystem has consolidated into five lanes; the strongest position is narrower and harder to copy than any of them: a wiki for your agents, version-controlled in your repo, that compounds across sessions. The corpus is the moat. The tool is replaceable.

Source Set

Source	Lane	Link
AgentOps	Context library / wiki for agents (this doc)	boshu2/agentops
obra/superpowers	Methodology (TDD discipline, autonomy patterns)	obra/superpowers
EveryInc/compound-engineering-plugin	Methodology (ideate→compound, 7-phase loop)	EveryInc/compound-engineering-plugin
anthropics/claude-plugins-official	Curation (first-party marketplace)	anthropics/claude-plugins-official
jeremylongshore/claude-code-plugins-plus-skills	Volume marketplace (CCPI manager)	jeremylongshore/claude-code-plugins-plus-skills
alirezarezvani/claude-skills	Volume + breadth (skills × platforms)	alirezarezvani/claude-skills
affaan-m/everything-claude-code	Cross-harness (DRY parity across runtimes)	affaan-m/everything-claude-code
trailofbits/skills	Vertical authority (security domain)	trailofbits/skills

Market Read

The five-lane consolidation (see Lane segmentation below) means most "compete with AgentOps" framings are category errors. A volume marketplace sells inventory; a methodology plugin sells discipline; a vertical-authority collection sells domain credibility. None of the seven sells persistent context that compounds across sessions on your hardware. That lane is empty, and the rest of this radar is about staying in it.

Seven-competitor lane table

What each competitor wins on (the structural advantage AgentOps cannot beat them on) and what AgentOps wins on against them. Sourced from the 2026-05-07 council research (.agents/council/2026-05-07-research-readme-positioning.md in the repo).

Competitor	Lane	What they win on	What AgentOps wins on
obra/superpowers	Methodology	Official Anthropic marketplace placement, ~29K stars, Jesse Vincent brand, TDD red-green-refactor as a sharp methodology hook	Persistent corpus accumulating across sessions; cross-session memory in `.agents/`; multi-model council as a commit gate; off-API daemon on your hardware
EveryInc/compound-engineering-plugin	Methodology	10-target runtime conversion CLI; configurable per-project reviewer routing; ideation-to-compound surface area; closest philosophical neighbor	Model-independent per-phase routing (Claude for discovery, Codex for implementation, fresh Claude for validation, local model for overnight defrag — in one workflow); persistent corpus that lives in your repo, not the tool
anthropics/claude-plugins-official	Curation	First-party authority; default discovery channel; "trust before installing" posture	Operates underneath any harness — Claude Code, Codex, Cursor, OpenCode — turning sessions into a context library you own. AgentOps is not a coding harness; it sits on top of whichever harness you already use
jeremylongshore/claude-code-plugins-plus-skills	Volume marketplace	425 plugins / 2,810 skills inventory; CCPI package manager; sponsored placement; daily download metrics	Compounding context vs. static inventory: skills don't accumulate, a wiki does. AgentOps ships a bookkeeping schema that grows; volume marketplaces ship an inventory that doesn't
alirezarezvani/claude-skills	Volume + breadth	~5,200+ stars; 235 skills × 12 platforms; 305 stdlib Python tools; multi-domain coverage (engineering + marketing + compliance)	Same persistent-corpus argument plus model-independent phase routing inside one session — breadth of skills doesn't replace cross-session memory
affaan-m/everything-claude-code	Cross-harness	DRY parity across 5 runtimes; 48 subagents; "Anthropic Hackathon Winner" credential. (README also claims 140K+ stars / 21K forks; the entire Claude Code ecosystem is below those numbers — flagged as not validated.)	Cross-runtime distribution is not cross-model per-phase routing. AgentOps mixes models per phase within one RPI loop, with state preserved across boundaries — a different optimization
trailofbits/skills	Vertical authority (security)	Trail of Bits brand; security skills + a "Trophy Case" of CVE-shaped findings; domain credentialing no general-purpose tool can match	Different category — AgentOps is the substrate a vertical-authority collection runs on. Their skills can live inside an AgentOps corpus; the inverse is not true

What this table reveals: none of the seven are selling persistence, sovereignty, off-API operation, multi-model per-phase routing, or context-as-a-discipline. The substrate / wiki-for-agents lane is empty if AgentOps claims it.

Lane segmentation

The Claude Code skills/plugin ecosystem has consolidated into five lanes. Each row names the lane, the canonical examples, the buyer signal, and AgentOps's posture toward it (compete, complement, or ignore).

1. Volume (marketplace / many skills)

Examples: jeremylongshore/claude-code-plugins-plus-skills (425 plugins, 2,810 skills, CCPI manager, 100-point grading); alirezarezvani/claude-skills (235 skills × 12 platforms); affaan-m/everything-claude-code (182 skills, 48 subagents) overlaps here too.

Buyer signal: "We have the most stuff." Inventory bulk as the value prop. Costco bulk-buy of pre-built skills.

AgentOps's posture: Complement, do not compete. AgentOps is a different category — a context library / wiki, not a skills inventory. The two combine: install the volume marketplace for breadth; run AgentOps for the corpus discipline that turns those skills' outputs into persistent context.

2. Methodology (workflow / discipline)

Examples: obra/superpowers (TDD red-green-refactor); EveryInc/compound-engineering-plugin (ideate→compound, 7-phase loop, 10-target conversion).

Buyer signal: "Use our workflow and your agents will produce better code." A sharp opinion about how to drive agents.

AgentOps's posture: Compete obliquely, do not contest head-on. This lane is structurally hard to win — Superpowers has 29K stars and official-marketplace placement; Compound Engineer has the closest philosophical positioning. Winning the methodology lane is not AgentOps's bet. AgentOps offers methodology surfaces (/rpi, /council, /pre-mortem, /vibe) but its claim is one level deeper: methodology sits on top of context, and the context is what compounds. Anthropic's Managed Agents (May 2026) is also moving into this lane natively.

3. Vertical (specific domain)

Examples: trailofbits/skills (security skills + Trophy Case of findings).

Buyer signal: "We are the authoritative source for skills in domain X." Domain credibility no general-purpose tool can replicate.

AgentOps's posture: Complement. Vertical collections produce skills; AgentOps produces the context library those skills run inside. A security team can install Trail of Bits skills on top of an AgentOps corpus; the security findings then flow into .agents/ as persistent learnings. AgentOps does not chase vertical authority — that's a brand investment, not a tool feature.

4. Curation (small high-quality set)

Examples: anthropics/claude-plugins-official (first-party marketplace, "trust before installing"); selective collections that prioritize quality over volume.

Buyer signal: "We curate so you don't have to." Trust + discovery channel as the value prop.

AgentOps's posture: Ignore as a competitor; respect as distribution. First-party curation is unbeatable for trust and discovery, and the right move is to be installable through it rather than to compete with it. AgentOps's claim is orthogonal: it's not "trust this skill" but "build a corpus that survives whichever skill you used last week."

5. Cross-harness (multi-runtime parity)

Examples: affaan-m/everything-claude-code (DRY parity across Claude Code, Cursor, Codex, OpenCode, Gemini); EveryInc/compound-engineering-plugin (10-target conversion CLI) overlaps here.

Buyer signal: "Our skills run everywhere." Multi-runtime distribution as the value prop.

AgentOps's posture: Compete on a sharper claim. Cross-harness distribution is multi-runtime spread of one workflow shape. AgentOps does cross-harness and model-independent per-phase routing — Claude does discovery, Codex implements, fresh Claude validates, an open-weights local model handles overnight defrag, all in one RPI loop with state preserved across boundaries. Nobody else in the seven-competitor set does this. The pitch is "mix and match models per phase," not "the same skill on five runtimes."

Where AgentOps Wins

Six differentiators no competitor in the seven-competitor set has. Sourced from the 2026-05-07 council research (.agents/council/2026-05-07-research-readme-positioning.md in the repo).

1. Persistent corpus

A bookkeeping schema that grows: learnings, patterns, planning rules, and cited decisions accumulate in .agents/ as plain markdown, version-controlled with the code. Competitors ship skills (static inventory); AgentOps ships the discipline that turns sessions into a wiki. Receipts: as of 2026-05-04, this repo's .agents/ contained ~1,842 learnings, ~186 patterns, ~80 planning rules, and ~3,867 cited decisions captured by the system on itself.

2. Off-API daemon

ao schedule + ao daemon runs dream / evolve / compile / defrag / forge overnight, off-vendor, on your hardware, against your subscription. All seven competitors are in-session plugins. The daemon is the structural answer to "what if a frontier vendor ships native equivalents in 12 months" — your corpus and your scheduler keep running regardless.

3. Multi-model council

/council --mixed runs Claude + Codex (and other) judges in parallel against one evidence packet, producing a verdict before commit. Compound Engineer has reviewer agents but they are single-model and post-implementation. A multi-model commit gate is the strongest validation primitive in the set.

4. Model-independent phase routing

Pick Claude for ideation, Codex for validation, an open-weights local model for overnight defrag — per phase, in one workflow, with state preserved across the boundaries. Cross-harness distribution (everything-claude-code, Compound Engineer's 10-target conversion) is multi-runtime spread of one workflow shape; per-phase routing is mixing models inside one workflow. Different optimization. Nobody in the seven does it.

5. Context-engineering vocabulary

AgentOps owns "wiki for agents" + "context library" + "context compiler"

"CDLC" as a coherent vocabulary, anchored by the SE → context translation table (source code → context, SDLC → CDLC, libraries → context libraries, compilers → context compilers, code review → multi-model councils, CI/CD → validation gates, postmortems → automated postmortems, runbooks → skills + planning rules, software factories → software factory daemon, Markdown/Git /Linux → LLM Wiki of Markdown, open-source corpus → your private corpus). Vocabulary ownership is durable; Superpowers owns "TDD," AgentOps can own "context engineering."

6. Honest empirical disclosure

Δ=+0.0000 at workbench v1 difficulty, published in-repo. Independent 3-judge audit (2026-05-06) confirmed parity with Anthropic Managed Agents on rubric authoring, separate-context grading, and iterate-until-pass. In a market of inflated star counts and ungraded "100-point validations," a published null result and a transparent audit are credibility assets.

Current Vulnerabilities

Vulnerability	Impact	Best next move
Corpus durability under routine cleanup	The receipts claim ("1,842 learnings") becomes fragile if maintenance can wipe `.agents/` subdirs (observed 2026-05-07).	Snapshot/restore mechanism + tracked durability fix; in the meantime, receipts cite the 2026-05-04 stable snapshot with timestamp.
Methodology lane is being eaten	Anthropic's Managed Agents (May 2026) and Superpowers' marketplace placement compress the methodology buyer's choice set.	Stay out of the methodology lane head-on; lead with the wiki framing and lane-segmentation argument.
Compounding proof is still too implicit	Users have to trust the flywheel story before they feel it.	Put Dream reports, `ao demo`, and corpus-stats in the first-run path.
Reviewer routing is less configurable than Compound Engineer	CE can feel more tailored to a stack.	Document per-project validation profile selection; expose council config more visibly.
Volume-marketplace shoppers may bounce off "77 skills" framing	Inventory-comparison buyers will not see the corpus advantage.	Keep the volume comparison explicit (vs-tons-of-skills doc); reframe the question from "how many skills" to "what does the asset look like in 6 months".

Execution Bias

Do not respond to every competitor feature by adding another command. Favor moves that make the wiki visible, automatic, and verifiable:

Make the corpus inspectable on day one (scripts/corpus-stats.sh, ao inject traces).
Make first value obvious in under five minutes (skills install + first session writes to .agents/).
Keep the per-phase model-routing demo as the anchor (it's the killer feature buried in the founder pitch).
Defend the wiki/context-engineering vocabulary in every doc; do not drift to "skills repo" or "methodology" framings.
Keep comparison docs tied to current official sources; re-run scripts/check-competitive-freshness.sh before each release.