smart-model-switching

Auto-route tasks to the cheapest Claude model that works correctly. Three-tier progression: Haiku → Sonnet → Opus. Classify before responding. HAIKU (default): factual Q&A, greetings, reminders, status checks, lookups, simple file ops, heartbeats, casual chat, 1-2 sentence tasks. ESCALATE TO SONNET: code >10 lines, analysis, comparisons, planning, reports, multi-step reasoning, tables, long writing >3 paragraphs, summarization, research synthesis, most user conversations. ESCALATE TO OPUS: architecture decisions, complex debugging, multi-file refactoring, strategic planning, nuanced judgment, deep research, critical production decisions. Rule: If a human needs >30 seconds of focused thinking, escalate. If Sonnet struggles with complexity, go to Opus. Save 50-90% on API costs by starting cheap and escalating only when needed.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "smart-model-switching" with this command: npx skills add victorking2005/opus

Smart Model Switching

Three-tier Claude routing: Haiku → Sonnet → Opus

Start with the cheapest model. Escalate only when needed. Save 50-90% on API costs.

The Golden Rule

If a human would need more than 30 seconds of focused thinking, escalate from Haiku to Sonnet. If the task involves architecture, complex tradeoffs, or deep reasoning, escalate to Opus.

Cost Reality

ModelInputOutputRelative Cost
Haiku$0.25/M$1.25/M1x (baseline)
Sonnet$3.00/M$15.00/M12x
Opus$15.00/M$75.00/M60x

Bottom line: Wrong model selection wastes money OR time. Haiku for simple, Sonnet for standard, Opus for complex.


💚 HAIKU — Default for Simple Tasks

Stay on Haiku for:

  • Factual Q&A — "what is X", "who is Y", "when did Z"
  • Quick lookups — definitions, unit conversions, short translations
  • Status checks — calendar, file reads, session monitoring
  • Heartbeats — periodic checks, HEARTBEAT_OK responses
  • Memory & reminders — "remember this", "remind me to..."
  • Casual conversation — greetings, small talk, acknowledgments
  • Simple file ops — read, list, basic writes
  • One-liner tasks — anything answerable in 1-2 sentences

NEVER do these on Haiku

  • ❌ Write code longer than 10 lines
  • ❌ Create comparison tables
  • ❌ Write more than 3 paragraphs
  • ❌ Do multi-step analysis
  • ❌ Write reports or proposals

💛 SONNET — Standard Work (The Workhorse)

Escalate to Sonnet for:

Code & Technical

  • Code generation — write functions, build features, scripts
  • Code review — PR reviews, quality checks
  • Debugging — standard bug investigation
  • Documentation — README, comments, user guides

Analysis & Planning

  • Analysis & evaluation — compare options, assess trade-offs
  • Planning — project plans, roadmaps, task breakdowns
  • Research synthesis — combining multiple sources
  • Multi-step reasoning — "first... then... finally"

Writing & Content

  • Long-form writing — reports, proposals, articles (>3 paragraphs)
  • Creative writing — blog posts, descriptions, copy
  • Summarization — long documents, transcripts
  • Structured output — tables, outlines, formatted docs

❤️ OPUS — Complex Reasoning Only

Escalate to Opus for:

Architecture & Design

  • System architecture decisions
  • Major codebase refactoring
  • Design pattern selection with tradeoffs
  • Database schema design

Deep Analysis

  • Complex debugging (multi-file, race conditions)
  • Security reviews
  • Performance optimization strategy
  • Root cause analysis of subtle bugs

Strategic & Creative

  • Strategic planning — business decisions, roadmaps
  • Nuanced judgment — ethics, ambiguity, competing values
  • Deep research — comprehensive multi-source analysis

🔄 Implementation

For Subagents

```javascript // Routine monitoring sessions_spawn(task="Check backup status", model="haiku")

// Standard code work
sessions_spawn(task="Build the REST API endpoint", model="sonnet")

// Architecture decisions sessions_spawn(task="Design the database schema for multi-tenancy", model="opus") ```

For Cron Jobs

```json { "payload": { "kind": "agentTurn", "model": "haiku" } } ``` Always use Haiku for cron unless the task genuinely needs reasoning.


📊 Quick Decision Tree

``` Is it a greeting, lookup, status check, or 1-2 sentence answer? YES → HAIKU NO ↓

Is it code, analysis, planning, writing, or multi-step? YES → SONNET
NO ↓

Is it architecture, deep reasoning, or critical decision? YES → OPUS NO → Default to SONNET, escalate if struggling ```


📋 Quick Reference Card

``` ┌─────────────────────────────────────────────────────────────┐ │ SMART MODEL SWITCHING │ │ Haiku → Sonnet → Opus │ ├─────────────────────────────────────────────────────────────┤ │ 💚 HAIKU (cheapest) │ │ • Greetings, status checks, quick lookups │ │ • Factual Q&A, definitions, reminders │ │ • Simple file ops, 1-2 sentence answers │ ├─────────────────────────────────────────────────────────────┤ │ 💛 SONNET (standard) │ │ • Code > 10 lines, debugging │ │ • Analysis, comparisons, planning │ │ • Reports, proposals, long writing │ ├─────────────────────────────────────────────────────────────┤ │ ❤️ OPUS (complex) │ │ • Architecture decisions │ │ • Complex debugging, multi-file refactoring │ │ • Strategic planning, deep research │ ├─────────────────────────────────────────────────────────────┤ │ 💡 RULE: If a human needs > 30 sec thinking → escalate │ │ 💰 COST: Haiku 1x → Sonnet 12x → Opus 60x │ └─────────────────────────────────────────────────────────────┘ ```


Built for Claude-only setups with Haiku, Sonnet, and Opus. Inspired by save-money skill, extended with three-tier progression.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

Web Research Assistant

AI-powered web research assistant that leverages BrowserAct API to supplement restricted web access by searching the internet for additional information. Designed for OpenClaw and Claude Code.

Registry SourceRecently Updated
2.5K3phheng
Coding

Pandoc Convert

Convert documents between 40+ formats using pandoc CLI. Handles Markdown ↔ Word ↔ PDF ↔ HTML ↔ LaTeX ↔ EPUB with smart defaults, professional templates, and comprehensive tooling.

Registry SourceRecently Updated
Coding

Monitored Ralph Loop

Generate copy-paste bash scripts for Ralph Wiggum/AI agent loops (Codex, Claude Code, OpenCode, Goose). Use when asked for a "Ralph loop", "Ralph Wiggum loop", or an AI loop to plan/build code via PROMPT.md + AGENTS.md, SPECS, and IMPLEMENTATION_PLAN.md, including PLANNING vs BUILDING modes, backpressure, sandboxing, and completion conditions.

Registry SourceRecently Updated
1.3K0endogen
Coding

Forge

Autonomous quality engineering swarm that forges production-ready code through continuous behavioral verification, exhaustive E2E testing, and self-healing fix loops. Combines DDD+ADR+TDD methodology with BDD/Gherkin specifications, 7 quality gates, defect prediction, chaos testing, and cross-context dependency awareness. Architecture-agnostic — works with monoliths, microservices, modular monoliths, and any bounded-context topology.

Registry SourceRecently Updated