openclaw-self-healing

4-tier autonomous self-healing system for OpenClaw Gateway with persistent learning, reasoning logs, and multi-channel alerts. Features Claude Code as Level 3 emergency doctor for AI-powered diagnosis and repair.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "openclaw-self-healing" with this command: npx skills add ramsbaby/openclaw-self-healing/ramsbaby-openclaw-self-healing-openclaw-self-healing

OpenClaw Self-Healing System

"The system that heals itself — or calls for help when it can't."

A 4-tier autonomous self-healing system for OpenClaw Gateway.

Architecture

Level 1: Watchdog (180s)     → Process monitoring (OpenClaw built-in)
Level 2: Health Check (300s) → HTTP 200 + 3 retries
Level 3: Claude Recovery     → 30min AI-powered diagnosis 🧠
Level 4: Discord Alert       → Human escalation

What's Special (v2.0)

  • World's first Claude Code as Level 3 emergency doctor
  • Persistent Learning - Automatic recovery documentation (symptom → cause → solution → prevention)
  • Reasoning Logs - Explainable AI decision-making process
  • Multi-Channel Alerts - Discord + Telegram support
  • Metrics Dashboard - Success rate, recovery time, trending analysis
  • Production-tested (verified recovery Feb 5-6, 2026)
  • macOS LaunchAgent integration

Quick Setup

1. Install Dependencies

brew install tmux
npm install -g @anthropic-ai/claude-code

2. Configure Environment

# Copy template to OpenClaw config directory
cp .env.example ~/.openclaw/.env

# Edit and add your Discord webhook (optional)
nano ~/.openclaw/.env

3. Install Scripts

# Copy scripts
cp scripts/*.sh ~/openclaw/scripts/
chmod +x ~/openclaw/scripts/*.sh

# Install LaunchAgent
cp launchagent/com.openclaw.healthcheck.plist ~/Library/LaunchAgents/
launchctl load ~/Library/LaunchAgents/com.openclaw.healthcheck.plist

4. Verify

# Check Health Check is running
launchctl list | grep openclaw.healthcheck

# View logs
tail -f ~/openclaw/memory/healthcheck-$(date +%Y-%m-%d).log

Scripts

ScriptLevelDescription
gateway-healthcheck.sh2HTTP 200 check + 3 retries + escalation
emergency-recovery.sh3Claude Code PTY session for AI diagnosis (v1)
emergency-recovery-v2.sh3Enhanced with learning + reasoning logs (v2) ⭐
emergency-recovery-monitor.sh4Discord/Telegram notification on failure
metrics-dashboard.sh-Visualize recovery statistics (NEW)

Configuration

All settings via environment variables in ~/.openclaw/.env:

VariableDefaultDescription
DISCORD_WEBHOOK_URL(none)Discord webhook for alerts
OPENCLAW_GATEWAY_URLhttp://localhost:18789/Gateway health check URL
HEALTH_CHECK_MAX_RETRIES3Restart attempts before escalation
EMERGENCY_RECOVERY_TIMEOUT1800Claude recovery timeout (30 min)

Testing

Test Level 2 (Health Check)

# Run manually
bash ~/openclaw/scripts/gateway-healthcheck.sh

# Expected output:
# ✅ Gateway healthy

Test Level 3 (Claude Recovery)

# Inject a config error (backup first!)
cp ~/.openclaw/openclaw.json ~/.openclaw/openclaw.json.bak

# Wait for Health Check to detect and escalate (~8 min)
tail -f ~/openclaw/memory/emergency-recovery-*.log

Links

License

MIT License - do whatever you want with it.

Built by @ramsbaby + Jarvis 🦞

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

Client Intake Bot Pro

Automated client qualification and intake system. Captures leads through conversational forms, scores them based on fit criteria, sends personalized auto-res...

Registry SourceRecently Updated
Coding

LMFiles

Upload files to lmfiles.com and return public download links via API. Use when a user wants CLI-based file hosting, quick share links, bot-accessible file up...

Registry SourceRecently Updated
Coding

Github App Authentication

Give your AI agents and automations their own GitHub (App) identity. Authenticate using GitHub Apps so every commit, PR, and action is attributed to the bot...

Registry SourceRecently Updated
5950rmorse
Coding

vet

Run vet immediately after ANY logical unit of code changes. Do not batch your changes, do not wait to be asked to run vet, make sure you are proactive.

Registry SourceRecently Updated