using-agentops

Explain AgentOps operating model, lifecycle, skills, hooks, and context.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "using-agentops" with this command: npx skills add boshu2/agentops/boshu2-agentops-using-agentops

AgentOps Operating Model

AgentOps is the operational layer for coding agents.

Publicly, it gives you four things:

  • Bookkeeping — captured learnings, findings, and reusable context
  • Validation — plan and code review before work ships
  • Primitives — single skills, hooks, and CLI surfaces
  • Flows — named compositions like /research, /validation, and /rpi

Technically, AgentOps acts as a context compiler: raw session signal becomes reusable knowledge, compiled prevention, and better next work.

Core Flow: RPI

Research → Plan → Implement → Validate
    ↑                            │
    └──── Knowledge Flywheel ────┘

Research Phase

/research <topic>      # Deep codebase exploration
ao search "<query>"    # Search existing knowledge
ao search "<query>" --cite retrieved  # Record adoption when a search result is reused
ao lookup <id>         # Pull full content of specific learning
ao lookup --query "x"  # Search knowledge by relevance

Output: .agents/research/<topic>.md

Plan Phase

/pre-mortem <spec>     # Simulate failures (error/rescue map, scope modes, prediction tracking)
/plan <goal>           # Decompose into trackable issues

Output: Beads issues with dependencies

Implement Phase

/implement <issue>     # Single issue execution
/crank <epic>          # Autonomous epic loop (uses swarm for waves)
/swarm                 # Parallel execution (fresh context per agent)

Output: Code changes, tests, documentation

Validate Phase

/vibe [target]         # Code validation (finding classification + suppression + domain checklists)
/post-mortem           # Validation + streak tracking + prediction accuracy + retro history
/retro                 # Quick-capture a single learning

Output: .agents/learnings/, .agents/patterns/

Phase-to-Skill Mapping

PhasePrimary SkillSupporting Skills
Discovery/discovery/brainstorm, /research, /plan, /pre-mortem
Implement/crank/implement (single issue), /swarm (parallel execution)
Validate/validation/vibe, /post-mortem, /retro, /forge

Choosing the skill:

  • Use /implement for single issue execution. Now defaults to TDD-first — writes failing tests before implementing. Skip with --no-tdd.
  • Use /crank for autonomous epic execution (loops waves via swarm until done). Auto-generates file-ownership maps to prevent worker conflicts.
  • Use /discovery for the discovery phase only (brainstorm → search → research → plan → pre-mortem).
  • Use /validation for the validation phase only (vibe → post-mortem → retro → forge).
  • Use /rpi for full lifecycle — delegates to /discovery/crank/validation.
  • Use /ratchet to gate/record progress through RPI.

Start Here (12 starters)

These are the skills every user needs first. Everything else is available when you need it.

SkillPurpose
/quickstartGuided onboarding — run this first
/bootstrapOne-command full AgentOps setup — fills gaps only
/researchDeep codebase exploration
/councilMulti-model consensus review + finding auto-extraction
/vibeCode validation (classification + suppression + domain checklists)
/rpiFull RPI lifecycle orchestrator (/discovery/crank/validation)
/implementExecute single issue
/retro --quickQuick-capture a single learning into the flywheel
/statusSingle-screen dashboard of current work and suggested next action
/goalsMaintain GOALS.yaml fitness specification
/pushAtomic test-commit-push workflow
/flywheelKnowledge flywheel health monitoring (σ×ρ > δ/100)

Advanced Skills (when you need them)

SkillPurpose
/compileActive knowledge intelligence — Mine → Grow → Defrag cycle
/llm-wikiExternal reading wiki proposal — raw sources to compiled wiki
/harvestCross-rig knowledge consolidation — sweep, dedup, promote to global hub
/knowledge-activationOperationalize a mature .agents corpus into beliefs, playbooks, briefings, and gap surfaces
/brainstormStructured idea exploration before planning
/discoveryFull discovery phase orchestrator (brainstorm → search → research → plan → pre-mortem)
/planEpic decomposition into issues
/designProduct validation gate — goal alignment, persona fit, competitive differentiation
/pre-mortemFailure simulation (error/rescue, scope modes, temporal, predictions)
/post-mortemValidation + streak tracking + prediction accuracy + retro history
/bug-huntRoot cause analysis
/releasePre-flight, changelog, version bumps, tag
/crankAutonomous epic loop (uses swarm for each wave)
/swarmFresh-context parallel execution (Ralph pattern)
/evolveGoal-driven fitness-scored improvement loop
/autodevPROGRAM.md autonomous development contract setup and validation
/dreamInteractive Dream operator surface for setup, bedtime runs, and morning reports
/docDocumentation generation
/retroQuick-capture a learning (full retro → /post-mortem)
/validationFull validation phase orchestrator (vibe → post-mortem → retro → forge)
/ratchetBrownian Ratchet progress gates for RPI workflow
/forgeMine transcripts for knowledge — decisions, learnings, patterns
/readmeGenerate gold-standard README for any project
/securityContinuous repository security scanning and release gating
/security-suiteBinary and prompt-surface security suite — static analysis, dynamic tracing, offline redteam, policy gating
/testTest generation, coverage analysis, and TDD workflow
/red-teamPersona-based adversarial validation — probe docs and skills from constrained user perspectives
/reviewReview incoming PRs, agent output, or diffs — SCORED checklist
/refactorSafe, verified refactoring with regression testing at each step
/depsDependency audit, update, vulnerability scanning, and license compliance
/perfPerformance profiling, benchmarking, regression detection, and optimization
/scaffoldProject scaffolding, component generation, and boilerplate setup
/scenarioAuthor and manage holdout scenarios for behavioral validation

Expert Skills (specialized workflows)

SkillPurpose
/grafana-platform-dashboardBuild Grafana platform dashboards from templates/contracts
/codex-teamParallel Codex agent execution
/openai-docsOfficial OpenAI docs lookup with citations
/oss-docsOSS documentation scaffold and audit
/reverse-engineer-rpiReverse-engineer a product into feature catalog and specs
/pr-researchUpstream repository research before contribution
/pr-planExternal contribution planning
/pr-implementFork-based PR implementation
/pr-validatePR-specific validation and isolation checks
/pr-prepPR preparation and structured body generation
/pr-retroLearn from PR outcomes
/complexityCode complexity analysis
/productInteractive PRODUCT.md generation
/handoffSession handoff for continuation
/recoverPost-compaction context recovery
/traceTrace design decisions through history
/provenanceTrace artifact lineage to sources
/beadsIssue tracking operations
/heal-skillDetect and fix skill hygiene issues
/converterConvert skills to Codex/Cursor formats
/updateReinstall all AgentOps skills from latest source

Knowledge Flywheel

Every /post-mortem promotes learnings and patterns into .agents/ so future /research starts with better context instead of zero.

Inspect, lint, and triage the .agents/ write surface contract via ao agents inspect | lint | doctor (doctor rolls up inspect + lint + orphan/stray-dir report; --strict fails on orphans).

Runtime Modes

AgentOps has four runtime modes. Do not assume hook automation exists everywhere.

ModeWhen it appliesStart pathCloseout pathGuarantees
gcGas City (gc) binary available and city.toml presentgc controller manages sessions; ao rpi auto-selects gc executorgc event bus captures phase/gate/failure/metric eventsDefault when gc is available. Phase execution via gc sessions, events via gc event bus, agent health via gc health patrol
hook-capableClaude/OpenCode with lifecycle hooks installed (no gc)Runtime hook or ao inject / ao lookupRuntime hook or ao forge transcript + ao flywheel close-loopAutomatic startup/context injection and session-end maintenance when hooks are installed
codex-native-hooksCodex CLI v0.115.0+ with native hook support (March 2026)Runtime hooks (same as hook-capable)Runtime hooks (same as hook-capable)Native lifecycle hooks — same guarantees as hook-capable mode
codex-hookless-fallbackCodex Desktop / Codex CLI pre-v0.115.0 without hook surfacesao codex startao codex stopExplicit startup context, citation tracking, transcript fallback, and close-loop metrics without hooks
manualNo hooks and no Codex-native runtime detectionao inject / ao lookupao forge transcript + ao flywheel close-loopWorks everywhere, but lifecycle actions are operator-driven

Issue Tracking

This workflow uses beads for git-native issue tracking:

bd ready              # Unblocked issues
bd show <id>          # Issue details
bd close <id>         # Close issue
bd vc status          # Inspect Dolt state if needed (JSONL auto-sync is automatic)

Examples

Startup Context Loading

Hook-capable runtimes

  1. session-start.sh (or equivalent) can run at session start.
  2. In manual mode, MEMORY.md is auto-loaded and the hook points to on-demand retrieval (ao search, ao lookup).
  3. In lean mode, the hook extracts pending knowledge and injects prior learnings with a reduced token budget.
  4. This skill can be injected automatically into session context.

Codex (v0.115.0+: native hooks, older: hookless fallback)

  1. v0.115.0+: hooks fire automatically — same behavior as hook-capable runtimes above.
  2. Pre-v0.115.0: run ao codex start explicitly, use ao lookup for citations, end with ao codex stop.

Result: The agent gets the RPI workflow, prior context, and a citation path in all modes.

Workflow Reference During Planning

User says: "How should I approach this feature?"

What happens:

  1. Agent references this skill's RPI workflow section
  2. Agent recommends Research → Plan → Implement → Validate phases
  3. Agent suggests /research for codebase exploration, /plan for decomposition
  4. Agent explains /pre-mortem for failure simulation before implementation
  5. User follows recommended workflow with agent guidance

Result: Agent provides structured workflow guidance based on this meta-skill, avoiding ad-hoc approaches.

Troubleshooting

ProblemCauseSolution
Skill not auto-loadedHook runtime unavailable or startup path not runHook-capable runtimes: verify hooks/session-start.sh exists and is enabled. Codex: run ao codex start explicitly
Outdated skill catalogThis file not synced with actual skills/ directoryUpdate skill list in this file after adding/removing skills
Wrong skill suggestedNatural language trigger ambiguousUser explicitly calls skill with /skill-name syntax
Workflow unclearRPI phases not well-documented hereRead full workflow guide in README.md or docs/ARCHITECTURE.md

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

council

No summary provided by upstream source.

Repository SourceNeeds Review
1.8K-boshu2
Automation

swarm

No summary provided by upstream source.

Repository SourceNeeds Review
532-boshu2
Automation

bug-hunt

No summary provided by upstream source.

Repository SourceNeeds Review
528-boshu2
Automation

readme

No summary provided by upstream source.

Repository SourceNeeds Review
509-boshu2