loop-until-done
Use when a task must be driven to guaranteed completion through a PRD-driven persistence loop — breaking work into user stories with structured acceptance criteria, iterating story-by-story with independent verification, and terminating only when every criterion has fresh passing evidence and an independent reviewer approves. Trigger phrases include "keep going until done", "loop until complete", "don't stop until", "finish this completely", "iterate until done", "persistence loop", "PRD-driven execution", "work through all stories", "drive this to completion", "until all tests pass", "keep iterating", "loop this", "self-loop until finished". Honest termination labels; no self-approval.
Repository SourceNeeds Review
proposal-reviewer
Critically reviews project proposals, grant applications, and business plans. Use when the user asks to review, critique, evaluate, or assess a proposal, pitch, grant application, or business plan for viability, competition, or flaws. Fact-checks claims, maps competitive landscape, identifies structural problems, and provides honest recommendations.
Repository SourceNeeds Review
deep-research-temporal
Use when researching, investigating, or exploring a topic systematically via a durable Temporal-backed workflow. Trigger phrases include "deep-research temporal", "sagaflow research", "durable research", "temporal research". Spawns parallel researchers across WHO/WHAT/HOW/WHERE/WHEN/WHY/LIMITS plus cross-cut dimensions (PRIOR-FAILURE, BASELINE, ADJACENT-EFFORTS, STRATEGIC-TIMING, ACTUAL-USAGE). Fact-verification, novelty classification, vocabulary bootstrap for cold-start topics. Fire-and-forget while you do other work.
Repository SourceNeeds Review
deep-qa-ensemble-v1
Benchmark variant of deep-qa that ensembles the severity-judge phase across three heterogeneous providers (Anthropic Sonnet 4.6 + OpenAI GPT-5.4 + Google Gemini 2.5 Pro) instead of using a single Haiku judge. Purpose is skill-bench A/B comparison against baseline deep-qa to measure whether heterogeneous judges calibrate severity better than a single homogeneous judge. Trigger phrases, argument semantics, and critic phase are identical to deep-qa. ONLY the pass-1 blind and pass-2 informed severity judges differ. Rationalization auditor (Phase 5.6) stays single-Haiku to preserve its independence role.
Repository SourceNeeds Review