Verification & Quality Assurance Skill
What This Skill Does
This skill provides a comprehensive verification and quality assurance system that ensures code quality and correctness through:
-
Truth Scoring: Real-time reliability metrics (0.0-1.0 scale) for code, agents, and tasks
-
Verification Checks: Automated code correctness, security, and best practices validation
-
Automatic Rollback: Instant reversion of changes that fail verification (default threshold: 0.95)
-
Quality Metrics: Statistical analysis with trends, confidence intervals, and improvement tracking
-
CI/CD Integration: Export capabilities for continuous integration pipelines
-
Real-time Monitoring: Live dashboards and watch modes for ongoing verification
Prerequisites
-
Claude Flow installed (npx claude-flow@alpha )
-
Git repository (for rollback features)
-
Node.js 18+ (for dashboard features)
Quick Start
View current truth scores
npx claude-flow@alpha truth
Run verification check
npx claude-flow@alpha verify check
Verify specific file with custom threshold
npx claude-flow@alpha verify check --file src/app.js --threshold 0.98
Rollback last failed verification
npx claude-flow@alpha verify rollback --last-good
Complete Guide
Truth Scoring System
View Truth Metrics
Display comprehensive quality and reliability metrics for your codebase and agent tasks.
Basic Usage:
View current truth scores (default: table format)
npx claude-flow@alpha truth
View scores for specific time period
npx claude-flow@alpha truth --period 7d
View scores for specific agent
npx claude-flow@alpha truth --agent coder --period 24h
Find files/tasks below threshold
npx claude-flow@alpha truth --threshold 0.8
Output Formats:
Table format (default)
npx claude-flow@alpha truth --format table
JSON for programmatic access
npx claude-flow@alpha truth --format json
CSV for spreadsheet analysis
npx claude-flow@alpha truth --format csv
HTML report with visualizations
npx claude-flow@alpha truth --format html --export report.html
Real-time Monitoring:
Watch mode with live updates
npx claude-flow@alpha truth --watch
Export metrics automatically
npx claude-flow@alpha truth --export .claude-flow/metrics/truth-$(date +%Y%m%d).json
Truth Score Dashboard
Example dashboard output:
📊 Truth Metrics Dashboard ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Overall Truth Score: 0.947 ✅ Trend: ↗️ +2.3% (7d)
Top Performers: verification-agent 0.982 ⭐ code-analyzer 0.971 ⭐ test-generator 0.958 ✅
Needs Attention: refactor-agent 0.821 ⚠️ docs-generator 0.794 ⚠️
Recent Tasks: task-456 0.991 ✅ "Implement auth" task-455 0.967 ✅ "Add tests" task-454 0.743 ❌ "Refactor API"
Metrics Explained
Truth Scores (0.0-1.0):
-
1.0-0.95 : Excellent ⭐ (production-ready)
-
0.94-0.85 : Good ✅ (acceptable quality)
-
0.84-0.75 : Warning ⚠️ (needs attention)
-
<0.75 : Critical ❌ (requires immediate action)
Trend Indicators:
-
↗️ Improving (positive trend)
-
→ Stable (consistent performance)
-
↘️ Declining (quality regression detected)
Statistics:
-
Mean Score: Average truth score across all measurements
-
Median Score: Middle value (less affected by outliers)
-
Standard Deviation: Consistency of scores (lower = more consistent)
-
Confidence Interval: Statistical reliability of measurements
Verification Checks
Run Verification
Execute comprehensive verification checks on code, tasks, or agent outputs.
File Verification:
Verify single file
npx claude-flow@alpha verify check --file src/app.js
Verify directory recursively
npx claude-flow@alpha verify check --directory src/
Verify with auto-fix enabled
npx claude-flow@alpha verify check --file src/utils.js --auto-fix
Verify current working directory
npx claude-flow@alpha verify check
Task Verification:
Verify specific task output
npx claude-flow@alpha verify check --task task-123
Verify with custom threshold
npx claude-flow@alpha verify check --task task-456 --threshold 0.99
Verbose output for debugging
npx claude-flow@alpha verify check --task task-789 --verbose
Batch Verification:
Verify multiple files in parallel
npx claude-flow@alpha verify batch --files "*.js" --parallel
Verify with pattern matching
npx claude-flow@alpha verify batch --pattern "src/**/*.ts"
Integration test suite
npx claude-flow@alpha verify integration --test-suite full
Verification Criteria
The verification system evaluates:
Code Correctness
-
Syntax validation
-
Type checking (TypeScript)
-
Logic flow analysis
-
Error handling completeness
Best Practices
-
Code style adherence
-
SOLID principles
-
Design patterns usage
-
Modularity and reusability
Security
-
Vulnerability scanning
-
Secret detection
-
Input validation
-
Authentication/authorization checks
Performance
-
Algorithmic complexity
-
Memory usage patterns
-
Database query optimization
-
Bundle size impact
Documentation
-
JSDoc/TypeDoc completeness
-
README accuracy
-
API documentation
-
Code comments quality
JSON Output for CI/CD
Get structured JSON output
npx claude-flow@alpha verify check --json > verification.json
Example JSON structure:
{ "overallScore": 0.947, "passed": true, "threshold": 0.95, "checks": [ { "name": "code-correctness", "score": 0.98, "passed": true }, { "name": "security", "score": 0.91, "passed": false, "issues": [...] } ] }
Automatic Rollback
Rollback Failed Changes
Automatically revert changes that fail verification checks.
Basic Rollback:
Rollback to last known good state
npx claude-flow@alpha verify rollback --last-good
Rollback to specific commit
npx claude-flow@alpha verify rollback --to-commit abc123
Interactive rollback with preview
npx claude-flow@alpha verify rollback --interactive
Smart Rollback:
Rollback only failed files (preserve good changes)
npx claude-flow@alpha verify rollback --selective
Rollback with automatic backup
npx claude-flow@alpha verify rollback --backup-first
Dry-run mode (preview without executing)
npx claude-flow@alpha verify rollback --dry-run
Rollback Performance:
-
Git-based rollback: <1 second
-
Selective file rollback: <500ms
-
Backup creation: Automatic before rollback
Verification Reports
Generate Reports
Create detailed verification reports with metrics and visualizations.
Report Formats:
JSON report
npx claude-flow@alpha verify report --format json
HTML report with charts
npx claude-flow@alpha verify report --export metrics.html --format html
CSV for data analysis
npx claude-flow@alpha verify report --format csv --export metrics.csv
Markdown summary
npx claude-flow@alpha verify report --format markdown
Time-based Reports:
Last 24 hours
npx claude-flow@alpha verify report --period 24h
Last 7 days
npx claude-flow@alpha verify report --period 7d
Last 30 days with trends
npx claude-flow@alpha verify report --period 30d --include-trends
Custom date range
npx claude-flow@alpha verify report --from 2025-01-01 --to 2025-01-31
Report Content:
-
Overall truth scores
-
Per-agent performance metrics
-
Task completion quality
-
Verification pass/fail rates
-
Rollback frequency
-
Quality improvement trends
-
Statistical confidence intervals
Interactive Dashboard
Launch Dashboard
Run interactive web-based verification dashboard with real-time updates.
Launch dashboard on default port (3000)
npx claude-flow@alpha verify dashboard
Custom port
npx claude-flow@alpha verify dashboard --port 8080
Export dashboard data
npx claude-flow@alpha verify dashboard --export
Dashboard with auto-refresh
npx claude-flow@alpha verify dashboard --refresh 5s
Dashboard Features:
-
Real-time truth score updates (WebSocket)
-
Interactive charts and graphs
-
Agent performance comparison
-
Task history timeline
-
Rollback history viewer
-
Export to PDF/HTML
-
Filter by time period/agent/score
Configuration
Default Configuration
Set verification preferences in .claude-flow/config.json :
{ "verification": { "threshold": 0.95, "autoRollback": true, "gitIntegration": true, "hooks": { "preCommit": true, "preTask": true, "postEdit": true }, "checks": { "codeCorrectness": true, "security": true, "performance": true, "documentation": true, "bestPractices": true } }, "truth": { "defaultFormat": "table", "defaultPeriod": "24h", "warningThreshold": 0.85, "criticalThreshold": 0.75, "autoExport": { "enabled": true, "path": ".claude-flow/metrics/truth-daily.json" } } }
Threshold Configuration
Adjust verification strictness:
Strict mode (99% accuracy required)
npx claude-flow@alpha verify check --threshold 0.99
Lenient mode (90% acceptable)
npx claude-flow@alpha verify check --threshold 0.90
Set default threshold
npx claude-flow@alpha config set verification.threshold 0.98
Per-environment thresholds:
{ "verification": { "thresholds": { "production": 0.99, "staging": 0.95, "development": 0.90 } } }
Integration Examples
CI/CD Integration
GitHub Actions:
name: Quality Verification
on: [push, pull_request]
jobs: verify: runs-on: ubuntu-latest steps: - uses: actions/checkout@v3
- name: Install Dependencies
run: npm install
- name: Run Verification
run: |
npx claude-flow@alpha verify check --json > verification.json
- name: Check Truth Score
run: |
score=$(jq '.overallScore' verification.json)
if (( $(echo "$score < 0.95" | bc -l) )); then
echo "Truth score too low: $score"
exit 1
fi
- name: Upload Report
uses: actions/upload-artifact@v3
with:
name: verification-report
path: verification.json
GitLab CI:
verify: stage: test script: - npx claude-flow@alpha verify check --threshold 0.95 --json > verification.json - | score=$(jq '.overallScore' verification.json) if [ $(echo "$score < 0.95" | bc) -eq 1 ]; then echo "Verification failed with score: $score" exit 1 fi artifacts: paths: - verification.json reports: junit: verification.json
Swarm Integration
Run verification automatically during swarm operations:
Swarm with verification enabled
npx claude-flow@alpha swarm --verify --threshold 0.98
Hive Mind with auto-rollback
npx claude-flow@alpha hive-mind --verify --rollback-on-fail
Training pipeline with verification
npx claude-flow@alpha train --verify --threshold 0.99
Pair Programming Integration
Enable real-time verification during collaborative development:
Pair with verification
npx claude-flow@alpha pair --verify --real-time
Pair with custom threshold
npx claude-flow@alpha pair --verify --threshold 0.97 --auto-fix
Advanced Workflows
Continuous Verification
Monitor codebase continuously during development:
Watch directory for changes
npx claude-flow@alpha verify watch --directory src/
Watch with auto-fix
npx claude-flow@alpha verify watch --directory src/ --auto-fix
Watch with notifications
npx claude-flow@alpha verify watch --notify --threshold 0.95
Monitoring Integration
Send metrics to external monitoring systems:
Export to Prometheus
npx claude-flow@alpha truth --format json |
curl -X POST https://pushgateway.example.com/metrics/job/claude-flow
-d @-
Send to DataDog
npx claude-flow@alpha verify report --format json |
curl -X POST "https://api.datadoghq.com/api/v1/series?api_key=${DD_API_KEY}"
-H "Content-Type: application/json"
-d @-
Custom webhook
npx claude-flow@alpha truth --format json |
curl -X POST https://metrics.example.com/api/truth
-H "Content-Type: application/json"
-d @-
Pre-commit Hooks
Automatically verify before commits:
Install pre-commit hook
npx claude-flow@alpha verify install-hook --pre-commit
.git/hooks/pre-commit example:
#!/bin/bash npx claude-flow@alpha verify check --threshold 0.95 --json > /tmp/verify.json
score=$(jq '.overallScore' /tmp/verify.json) if (( $(echo "$score < 0.95" | bc -l) )); then echo "❌ Verification failed with score: $score" echo "Run 'npx claude-flow@alpha verify check --verbose' for details" exit 1 fi
echo "✅ Verification passed with score: $score"
Performance Metrics
Verification Speed:
-
Single file check: <100ms
-
Directory scan: <500ms (per 100 files)
-
Full codebase analysis: <5s (typical project)
-
Truth score calculation: <50ms
Rollback Speed:
-
Git-based rollback: <1s
-
Selective file rollback: <500ms
-
Backup creation: <2s
Dashboard Performance:
-
Initial load: <1s
-
Real-time updates: <100ms latency (WebSocket)
-
Chart rendering: 60 FPS
Troubleshooting
Common Issues
Low Truth Scores:
Get detailed breakdown
npx claude-flow@alpha truth --verbose --threshold 0.0
Check specific criteria
npx claude-flow@alpha verify check --verbose
View agent-specific issues
npx claude-flow@alpha truth --agent <agent-name> --format json
Rollback Failures:
Check git status
git status
View rollback history
npx claude-flow@alpha verify rollback --history
Manual rollback
git reset --hard HEAD~1
Verification Timeouts:
Increase timeout
npx claude-flow@alpha verify check --timeout 60s
Verify in batches
npx claude-flow@alpha verify batch --batch-size 10
Exit Codes
Verification commands return standard exit codes:
-
0 : Verification passed (score ≥ threshold)
-
1 : Verification failed (score < threshold)
-
2 : Error during verification (invalid input, system error)
Related Commands
-
npx claude-flow@alpha pair
-
Collaborative development with verification
-
npx claude-flow@alpha train
-
Training with verification feedback
-
npx claude-flow@alpha swarm
-
Multi-agent coordination with quality checks
-
npx claude-flow@alpha report
-
Generate comprehensive project reports
Best Practices
-
Set Appropriate Thresholds: Use 0.99 for critical code, 0.95 for standard, 0.90 for experimental
-
Enable Auto-rollback: Prevent bad code from persisting
-
Monitor Trends: Track improvement over time, not just current scores
-
Integrate with CI/CD: Make verification part of your pipeline
-
Use Watch Mode: Get immediate feedback during development
-
Export Metrics: Track quality metrics in your monitoring system
-
Review Rollbacks: Understand why changes were rejected
-
Train Agents: Use verification feedback to improve agent performance
Additional Resources
-
Truth Scoring Algorithm: See /docs/truth-scoring.md
-
Verification Criteria: See /docs/verification-criteria.md
-
Integration Examples: See /examples/verification/
-
API Reference: See /docs/api/verification.md