Testing Strategies
Build comprehensive, effective test suites by strategically selecting and implementing the right testing approaches across unit, integration, E2E, and contract testing levels.
Purpose
This skill provides strategic frameworks for:
- Test Type Selection: Determine when to use unit vs. integration vs. E2E vs. contract testing
- Test Pyramid Balancing: Optimize test distribution for fast feedback and reliable coverage
- Multi-Language Implementation: Apply consistent testing patterns across TypeScript, Python, Go, and Rust
- Test Data Management: Choose appropriate strategies (fixtures, factories, property-based testing)
- CI/CD Integration: Integrate tests into automated pipelines with optimal execution patterns
Testing is foundational to reliable software. With microservices architectures and continuous delivery becoming standard in 2025, strategic testing across multiple levels is more critical than ever.
When to Use This Skill
Invoke this skill when:
- Building a new feature that requires test coverage
- Designing a testing strategy for a new project
- Refactoring existing tests to improve speed or reliability
- Setting up CI/CD pipelines with testing stages
- Choosing between unit, integration, or E2E testing approaches
- Implementing contract testing for microservices
- Managing test data with fixtures, factories, or property-based testing
The Testing Pyramid Framework
Core Concept
The testing pyramid guides test distribution for optimal speed and confidence:
/\
/ \ E2E Tests (10%)
/----\ - Slow but comprehensive
/ \ - Full stack validation
/--------\
/ \ Integration Tests (20-30%)
/ \ - Moderate speed
/--------------\ - Component interactions
/ \
/------------------\ Unit Tests (60-70%)
- Fast feedback
- Isolated units
Key Principle: More unit tests (fast, isolated), fewer E2E tests (slow, comprehensive). Integration tests bridge the gap.
Modern Adaptations (2025)
Microservices Adjustment:
- Add contract testing layer between unit and integration
- Increase integration/contract tests to 30% (validate service boundaries)
- Reduce E2E tests to critical user journeys only
Cloud-Native Patterns:
- Use containers for integration tests (ephemeral databases, test services)
- Parallel execution for fast CI/CD feedback
- Risk-based test prioritization (focus on high-impact areas)
For detailed pyramid guidance, see references/testing-pyramid.md.
Universal Testing Decision Tree
Which Test Type Should I Use?
START: Need to test [feature]
Q1: Does this involve multiple systems/services?
├─ YES → Q2
└─ NO → Q3
Q2: Is this a critical user-facing workflow?
├─ YES → E2E Test (complete user journey)
└─ NO → Integration or Contract Test
Q3: Does this interact with external dependencies (DB, API, filesystem)?
├─ YES → Integration Test (real DB, mocked API)
└─ NO → Q4
Q4: Is this pure business logic or a pure function?
├─ YES → Unit Test (fast, isolated)
└─ NO → Component or Integration Test
Test Type Selection Examples
| Feature | Test Type | Rationale |
|---|---|---|
calculateTotal(items) | Unit | Pure function, no dependencies |
POST /api/users endpoint | Integration | Tests API + database interaction |
| User registration flow (form → API → redirect) | E2E | Critical user journey, full stack |
| Microservice A → B communication | Contract | Service interface validation |
formatCurrency(amount, locale) | Unit + Property | Pure logic, many edge cases |
| Form validation logic | Unit | Isolated business rules |
| File upload to S3 | Integration | External service interaction |
For comprehensive decision frameworks, see references/decision-tree.md.
Testing Levels in Detail
Unit Testing (Foundation - 60-70%)
Purpose: Validate small, isolated units of code (functions, methods, components)
Characteristics:
- Fast (milliseconds per test)
- Isolated (no external dependencies)
- Deterministic (same input = same output)
- Broad coverage (many tests, small scope each)
When to Use:
- Pure functions (input → output)
- Business logic and algorithms
- Utility functions
- Component rendering (without integration)
- Validation logic
Recommended Tools:
- TypeScript/JavaScript: Vitest (primary, 10x faster than Jest), Jest (legacy)
- Python: pytest (industry standard)
- Go: testing package (stdlib) + testify (assertions)
- Rust: cargo test (stdlib)
For detailed patterns, see references/unit-testing-patterns.md.
Integration Testing (Middle Layer - 20-30%)
Purpose: Validate interactions between components, modules, or services
Characteristics:
- Moderate speed (seconds per test)
- Partial integration (real database, mocked external APIs)
- Focused scope (test component boundaries)
- API and database validation
When to Use:
- API endpoints (request → response)
- Database operations (CRUD, queries)
- Service-to-service communication
- Event handlers and message processing
- File I/O operations
Recommended Tools:
- TypeScript/JavaScript: Vitest + MSW (API mocking), Supertest (HTTP testing)
- Python: pytest + pytest-httpserver, pytest-postgresql
- Go: testing + httptest, testcontainers
- Rust: cargo test + mockito, testcontainers
For detailed patterns, see references/integration-testing-patterns.md.
End-to-End Testing (Top Layer - 10%)
Purpose: Validate complete user workflows across the entire application stack
Characteristics:
- Slow (minutes per test suite)
- Full integration (real browser, services, database)
- Wide scope (user journeys from start to finish)
- Prone to flakiness (requires careful design)
When to Use:
- Critical user journeys (login, checkout, payment)
- Cross-browser compatibility validation
- Real-world scenarios not covered by integration tests
- Regression prevention for core features
Best Practices:
- Limit E2E tests to high-value scenarios (not every edge case)
- Use stable selectors (data-testid, not CSS classes)
- Implement retry logic for network flakiness
- Run tests in parallel for speed
Recommended Tools:
- All Languages: Playwright (cross-browser, fast, Microsoft-backed)
For detailed patterns, see references/e2e-testing-patterns.md.
Contract Testing (Microservices)
Purpose: Validate service interfaces and API contracts without full integration
When to Use:
- Microservices architecture
- Service-to-service communication
- API contract validation
- Reducing E2E testing overhead
Recommended Tool: Pact (pact.io) - supports TypeScript, Python, Go, Rust
For detailed patterns, see references/contract-testing.md.
Test Data Management Strategies
When to Use Each Approach
Fixtures (Static Data):
- Pros: Deterministic, easy to debug
- Cons: Can become stale, doesn't test variety
- Use When: Testing known scenarios, regression tests
Factories (Generated Data):
- Pros: Flexible, generates variety
- Cons: Less deterministic, harder to debug
- Use When: Need diverse test data, testing edge cases
Property-Based Testing (Random Data):
- Pros: Finds edge cases not anticipated
- Cons: Can be slow, failures harder to reproduce
- Use When: Complex algorithms, parsers, validators
Recommended Combination:
- Unit Tests: Fixtures (known inputs) + Property-Based (edge cases)
- Integration Tests: Factories (flexible data) + Database seeding
- E2E Tests: Fixtures (reproducible scenarios)
Property-Based Testing Tools:
- TypeScript/JavaScript: fast-check
- Python: hypothesis (best-in-class)
- Go: gopter
- Rust: proptest (primary)
For detailed strategies, see references/test-data-strategies.md.
Mocking Decision Matrix
When to Mock vs. Use Real Dependencies
| Dependency | Unit Test | Integration Test | E2E Test |
|---|---|---|---|
| Database | Mock (in-memory) | Real (test DB, Docker) | Real (staging DB) |
| External API | Mock (MSW, nock) | Mock (MSW, VCR) | Real (or staging) |
| Filesystem | Mock (in-memory FS) | Real (temp directory) | Real |
| Time/Date | Mock (freezeTime) | Mock (if deterministic) | Real (usually) |
| Environment Variables | Mock (setEnv) | Mock (test config) | Real (test env) |
| Internal Services | Mock (stub) | Real (or container) | Real |
Guiding Principles:
- Unit Tests: Mock everything external
- Integration Tests: Use real database (ephemeral), mock external APIs
- E2E Tests: Use real everything (or staging equivalents)
- Contract Tests: Mock nothing (test real interfaces)
For detailed mocking patterns, see references/mocking-strategies.md.
Language-Specific Quick Starts
TypeScript/JavaScript
Unit Testing with Vitest:
import { describe, test, expect } from 'vitest'
test('calculates total with tax', () => {
const items = [{ price: 10, quantity: 2 }]
expect(calculateTotal(items, 0.1)).toBe(22)
})
Integration Testing with MSW:
import { setupServer } from 'msw/node'
import { http, HttpResponse } from 'msw'
const server = setupServer(
http.get('/api/user/:id', ({ params }) => {
return HttpResponse.json({ id: params.id, name: 'Test User' })
})
)
E2E Testing with Playwright:
import { test, expect } from '@playwright/test'
test('user can checkout', async ({ page }) => {
await page.goto('https://example.com')
await page.getByRole('button', { name: 'Add to Cart' }).click()
await expect(page.getByText('1 item in cart')).toBeVisible()
})
See examples/typescript/ for complete working examples.
Python
Unit Testing with pytest:
def test_calculate_total_with_tax():
items = [{"price": 10, "quantity": 2}]
assert calculate_total(items, tax_rate=0.1) == 22
Property-Based Testing with hypothesis:
from hypothesis import given, strategies as st
@given(st.lists(st.integers()))
def test_reverse_reverse_is_identity(lst):
assert reverse(reverse(lst)) == lst
See examples/python/ for complete working examples.
Go and Rust
See examples/go/ and examples/rust/ for complete working examples in these languages.
Coverage and Quality Metrics
Meaningful Coverage Targets
Recommended Targets:
- Critical Business Logic: 90%+ coverage
- API Endpoints: 80%+ coverage
- Utility Functions: 70%+ coverage
- UI Components: 60%+ coverage (focus on logic, not markup)
- Overall Project: 70-80% coverage
Anti-Pattern: Aiming for 100% coverage leads to testing trivial code and false confidence.
Coverage Tools:
- TypeScript/JavaScript: Vitest coverage (c8/istanbul)
- Python: pytest-cov
- Go: go test -cover
- Rust: cargo-tarpaulin
Mutation Testing (Validate Test Quality)
Mutation testing introduces bugs and verifies tests catch them.
Tools:
- TypeScript/JavaScript: Stryker Mutator
- Python: mutmut
- Go: go-mutesting
- Rust: cargo-mutants
For detailed coverage strategies, see references/coverage-strategies.md.
CI/CD Integration Patterns
Fast Feedback Loop Strategy
Stage 1: Pre-Commit (< 30 seconds)
- Lint and format checks
- Unit tests (critical paths only)
Stage 2: On Commit (< 2 minutes)
- All unit tests
- Static analysis
Stage 3: Pull Request (< 5 minutes)
- Integration tests
- Coverage reporting
Stage 4: Pre-Merge (< 10 minutes)
- E2E tests (critical paths)
- Cross-browser testing
Parallel Execution
Speed Up Test Runs:
- Vitest:
vitest --threads - Playwright:
playwright test --workers=4 - pytest:
pytest -n auto(requires pytest-xdist) - Go:
go test -parallel 4
Common Anti-Patterns to Avoid
The Ice Cream Cone (Inverted Pyramid)
Problem: Too many E2E tests, too few unit tests Result: Slow test suites, flaky tests, long feedback loops Solution: Rebalance toward more unit tests, fewer E2E tests
Testing Implementation Details
Problem: Tests coupled to internal structure, not behavior Solution: Test behavior (inputs → outputs), not implementation
Over-Mocking in Integration Tests
Problem: Mocking everything defeats purpose of integration tests Solution: Use real databases (ephemeral), mock only external APIs
Flaky E2E Tests
Problem: Tests fail intermittently due to timing issues Solutions:
- Use auto-wait features (Playwright has built-in auto-wait)
- Avoid hardcoded waits (
sleep(1000)) - Use stable selectors (data-testid)
- Implement retry logic for network requests
Integration with Other Skills
For form testing: See building-forms skill for form-specific validation and submission testing patterns.
For API testing: See api-patterns skill for REST/GraphQL endpoint testing and contract validation.
For CI/CD pipelines: See building-ci-pipelines skill for test automation, parallel execution, and coverage reporting.
For data visualization testing: See visualizing-data skill for snapshot testing chart configurations and visual regression testing.
Reference Documentation
For deeper exploration of specific topics:
references/testing-pyramid.md- Detailed testing pyramid framework and balancing strategiesreferences/decision-tree.md- Comprehensive decision frameworks for test type selectionreferences/unit-testing-patterns.md- Unit testing patterns across languagesreferences/integration-testing-patterns.md- Integration testing with databases, APIs, and containersreferences/e2e-testing-patterns.md- E2E testing best practices and Playwright patternsreferences/contract-testing.md- Consumer-driven contract testing with Pactreferences/test-data-strategies.md- Fixtures, factories, and property-based testingreferences/mocking-strategies.md- When and how to mock dependenciesreferences/coverage-strategies.md- Meaningful coverage metrics and mutation testing
Working Examples
Complete, runnable code examples are available in the examples/ directory:
examples/typescript/- Vitest unit/integration, Playwright E2E, MSW mocking, fast-check property testsexamples/python/- pytest unit/integration, fixtures, Playwright E2E, hypothesis property testsexamples/go/- stdlib testing, testify assertions, httptest integrationexamples/rust/- cargo test unit tests, proptest property tests, integration patterns
All examples include dependencies, usage instructions, and error handling.