Test-Driven Development (TDD) Skill

Strict Red-Green-Refactor workflow enforcement with automated validation.

This skill acts as a TDD workflow state machine that enforces discipline through validators. It prevents common TDD anti-patterns and ensures proper test-first development.

Quick Start

Start TDD for a new feature

/sc:tdd "add user password validation"

TDD with specific framework

/sc:tdd payment-module --framework pytest

Fast mode (skip full suite after green, run at end)

/sc:tdd api-endpoint --fast

Allow snapshot tests

/sc:tdd ui-component --allow-snapshots

Behavioral Flow

This skill enforces a 7-step Red-Green-Refactor cycle:

Initialize - Detect scope, framework, create state
RED: Write Failing Test - Create test for missing functionality
RED: Validate Failure - Run validator, confirm SEMANTIC failure (assertion, not compile error)
GREEN: Implement Minimally - Write smallest code to pass test
GREEN: Validate Passing - Run validators, confirm test passes + no regressions
REFACTOR: Improve Design - Clean up with tests as safety net
REFACTOR: Validate Stability - Re-run suite, confirm still green

State Machine

IDLE → RED_PENDING → RED_CONFIRMED → GREEN_PENDING → GREEN_CONFIRMED → REFACTOR_PENDING → REFACTOR_COMPLETE → [cycle repeats]

Critical Rules:

Validator output is source of truth - If validator says blocked, you MUST stop
Exit codes matter: 0 = allowed, 2 = blocked, 3 = error (ask user for help)
No bypassing - Every phase transition requires validator approval

Phase 1: Initialize (IDLE → RED_PENDING)

When: User requests TDD workflow

Your Actions:

Detect scope root:

python .claude/skills/sc-tdd/scripts/framework_detector.py
--detect-scope $(pwd)
--json

Initialize state:

python .claude/skills/sc-tdd/scripts/tdd_state_machine.py
--scope-root <detected_scope>
--init
--json

Transition to RED_PENDING:

python .claude/skills/sc-tdd/scripts/tdd_state_machine.py
--scope-root <scope>
--phase RED_PENDING
--evidence "User requested: <feature_description>"
--json

Output: Confirm initialization, explain RED phase rules to user

Phase 2: Write Failing Test (RED_PENDING)

Rules:

User writes test first
You MAY help write the test
You MAY add empty stubs/interfaces to make test compile (e.g., pass , raise NotImplementedError , empty functions)
You MUST NOT write production implementation yet

Your Actions:

Help user write test file
Explain test must fail with assertion (not compile error)
When test is written, proceed to validation

Phase 3: Validate Red (RED_PENDING → RED_CONFIRMED)

CRITICAL: You MUST run this validator before proceeding.

Command:

python .claude/skills/sc-tdd/scripts/validate_red.py
--scope-root <scope>
--json

Interpretation:

Exit code 0 + "allowed": true → Proceed to RED_CONFIRMED
Exit code 2 + "allowed": false → STOP, show user reasons , ask them to fix

Common Blocking Reasons:

"Test failed but NOT with semantic assertion"
Compile error, need to fix test
"Test PASSED (expected failure)"
Test not actually testing the feature
"No changed test files found"
Forgot to write test

If Blocked: Show user the reasons array, artifacts.test_output , and ask them to address the issue. DO NOT propose production code until validator allows.

If Allowed: Store artifacts.intent_test information, transition state, proceed to GREEN phase.

Phase 4: Implement Minimally (RED_CONFIRMED → GREEN_PENDING)

Rules:

Write MINIMAL implementation to pass the test
Focus on making test pass, not perfection
NO premature optimization
NO extra features beyond test requirements

Your Actions:

Confirm current state is RED_CONFIRMED
Implement minimal production code
Explain your implementation approach
Transition to GREEN_PENDING:

python .claude/skills/sc-tdd/scripts/tdd_state_machine.py
--scope-root <scope>
--phase GREEN_PENDING
--evidence "Implemented: <description>"
--json

Proceed to validation

Phase 5: Validate Green (GREEN_PENDING → GREEN_CONFIRMED)

CRITICAL: You MUST run this validator before claiming success.

Command (standard):

python .claude/skills/sc-tdd/scripts/validate_green.py
--scope-root <scope>
--json

Command (fast mode, skips full suite):

python .claude/skills/sc-tdd/scripts/validate_green.py
--scope-root <scope>
--skip-full-suite
--json

Interpretation:

Exit code 0 + "allowed": true → GREEN_CONFIRMED, proceed to refactor
Exit code 2 + "allowed": false → STOP, implementation insufficient

Common Blocking Reasons:

"Intent test still FAILING"
Implementation didn't make test pass
"Full test suite FAILED - REGRESSION DETECTED"
Broke existing tests

If Blocked: Show user the failure details, fix implementation, re-validate.

If Allowed: Celebrate! Test passes, no regressions. Ask user if they want to refactor or complete cycle.

Phase 6: Refactor (Optional, GREEN_CONFIRMED → REFACTOR_PENDING)

Rules:

Improve code quality WITHOUT changing behavior
Tests must continue passing
Allowed changes: extract methods, rename variables, improve clarity, remove duplication
NOT allowed: add features, change test expectations, weaken tests

Your Actions:

Ask user if they want to refactor
If yes, transition to REFACTOR_PENDING
Propose refactorings
Apply changes
Re-run tests to confirm stability

Phase 7: Validate Refactor (REFACTOR_PENDING → REFACTOR_COMPLETE)

Command:

python .claude/skills/sc-tdd/scripts/validate_green.py
--scope-root <scope>
--json

(Same validator, ensures tests still pass)

If Allowed: Cycle complete! Ask user:

Continue with next test? (start new RED cycle)
Feature complete?

Flags

Flag Type Default Description

--framework

string auto Test framework: auto, pytest, jest, vitest, go, cargo

--fast

bool false Skip full suite in GREEN, run only at feature completion

--allow-snapshots

bool false Allow snapshot tests as semantic failures

--strict

bool false No stubs allowed in RED (must be pure test-first)

Personas Activated

agent-test-automator - Test design expertise
guardian - Test quality validation
agent-python-pro / agent-typescript-react-expert - Language-specific implementation
architect - Architecture guidance

MCP Integration

PAL MCP
mcp__pal__codereview
Review test quality and implementation
mcp__pal__debug
Investigate unexpected test failures

Anti-Patterns Detected & Blocked

Blocked by validators:

Production code before failing test
Compile errors counted as "red" (must be runtime assertion)
Test passing when it should fail
Weakened assertions during GREEN phase
Regressions in full suite

Warned by validators:

Snapshot tests (blocked unless --allow-snapshots)
Flaky patterns: sleep, network, random without mocking
Assertionless tests
Tests that always pass

Tool Coordination

Read - Read existing test files, implementation code, framework configs
Write - Create new test files, state files
Edit - Modify tests and implementation incrementally
Bash - Execute validator scripts (NON-NEGOTIABLE, always run validators)
Glob - Find test files, manifest files, CI configs
Grep - Search for test patterns, framework usage

Quality Dimensions

Evaluated on:

Correctness (30%) - Tests pass, implementation works as specified
Completeness (20%) - All test scenarios covered, edge cases handled
Testability (20%) - Tests are clear, maintainable, fast, deterministic
Maintainability (15%) - Code is clean, well-structured, no duplication
Discipline (15%) - Followed TDD process, no shortcuts, proper Red-Green-Refactor

Threshold: 70+ score required to mark feature complete

Examples

Example 1: Feature Development

/sc:tdd "add password validation: minimum 8 characters"

Workflow enforced:

1. Initialize state for current directory scope

2. User/Claude writes test_password_validation.py:

def test_password_min_length():

assert validate_password("short") == False

assert validate_password("longenough") == True

3. Run validate_red.py → confirms test FAILS with AssertionError (no validate_password function)

4. Claude implements:

def validate_password(password: str) -> bool:

return len(password) >= 8

5. Run validate_green.py → confirms test PASSES, full suite GREEN

6. Ask user about refactoring

7. Feature complete or next cycle

Example 2: Bug Fix with TDD

/sc:tdd "fix: user deletion doesn't cascade to posts" --framework pytest

Workflow:

1. Write test exposing bug (test fails showing cascade doesn't work)

2. Validate RED (confirms bug exists)

3. Fix cascade behavior

4. Validate GREEN (confirms bug fixed, no regressions)

Example 3: Monorepo Package

cd backend/user-service /sc:tdd "add email validation"

Auto-detects:

- scope_root: backend/user-service (nearest package.json)

- framework: jest (from package.json devDependencies)

- Runs tests only in this package scope

Troubleshooting

"No testing framework detected"

Solution:

Check if test framework is installed
Specify manually: --framework pytest
Ensure framework config exists (pytest.ini, jest.config.js, etc.)

"Test failed but NOT with semantic assertion"

Cause: Compile error, import error, syntax error

Solution:

Fix test to compile and run
Add empty stub if needed: def my_function(): raise NotImplementedError
Re-run validate_red.py

"Full test suite timed out"

Solution:

Use --fast mode for faster cycles
Run full suite only at feature completion
Investigate slow tests

"Multiple test files changed"

Cause: More than one test file modified (TDD requires focus)

Solution:

Commit/stash other changes
Work on one test at a time
Use multiple micro-cycles

State Recovery

If state file is lost or corrupted:

Validator will attempt recovery from git diff + current test results
You can manually re-initialize: python tdd_state_machine.py --init --scope-root <path>
State is per-scope, so multiple packages can have independent TDD states

Important Reminders

ALWAYS run validators before phase transitions - This is non-negotiable
Trust validator output over assumptions - If validator says blocked, stop
Exit codes matter: Check both exit code AND "allowed" field in JSON
One test at a time - TDD is about smallest increments
Semantic failures only - Compile errors don't count as proper "red"
Full suite matters - Don't skip unless using --fast mode explicitly

Version: 1.0.0 Last Updated: 2025-01-09 Maintainer: SuperClaude Framework

sc-tdd

Safety Notice

Copy this and send it to your AI assistant to learn

Start TDD for a new feature

TDD with specific framework

Fast mode (skip full suite after green, run at end)

Allow snapshot tests

Workflow enforced:

1. Initialize state for current directory scope

2. User/Claude writes test_password_validation.py:

def test_password_min_length():

assert validate_password("short") == False

assert validate_password("longenough") == True

3. Run validate_red.py → confirms test FAILS with AssertionError (no validate_password function)

4. Claude implements:

def validate_password(password: str) -> bool:

return len(password) >= 8

5. Run validate_green.py → confirms test PASSES, full suite GREEN

6. Ask user about refactoring

7. Feature complete or next cycle

Workflow:

1. Write test exposing bug (test fails showing cascade doesn't work)

2. Validate RED (confirms bug exists)

3. Fix cascade behavior

4. Validate GREEN (confirms bug fixed, no regressions)

Auto-detects:

- scope_root: backend/user-service (nearest package.json)

- framework: jest (from package.json devDependencies)

- Runs tests only in this package scope

Source Transparency

Related Skills

agent-fullstack-developer

sc-estimate

agent-react-specialist

sc-design