error-monitoring-agent

Error monitoring and alerting for AI agents. Detect, track, and resolve errors in real-time. Triggers: error monitoring, error tracking, alerting, exception handling, error analysis.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "error-monitoring-agent" with this command: npx skills add sky-lv/skylv-error-monitoring-agent

Error Monitoring Agent

Real-time error monitoring and alerting for AI agents. Detect, track, analyze, and resolve errors automatically.

Overview

A comprehensive error monitoring system that helps agents detect exceptions in real-time, track error patterns, set up intelligent alerts, and automate resolution workflows.

Capabilities

1. Error Detection

node monitor.js watch --source logs,api,workers --threshold 5/min
node monitor.js watch --pattern "UnhandledPromiseRejection|ENOTFOUND"

Monitors multiple sources for errors with configurable thresholds and pattern matching.

2. Error Aggregation

node monitor.js aggregate --group-by stacktrace --min-similarity 0.85
node monitor.js aggregate --time-window 1h --top 20

Groups similar errors together to reduce noise and identify patterns.

3. Alert Rules

node monitor.js alert --rule "error_rate > 10/min" --channel slack
node monitor.js alert --rule "new_error_type" --channel pagerduty --severity critical
node monitor.js alert --rule "error_spike > 3x_baseline" --channel email

Configurable alerting with rate thresholds, new error detection, and spike monitoring.

4. Root Cause Analysis

node monitor.js analyze --error-id err_abc123 --depth 5
node monitor.js analyze --correlate deploy-log,config-change

Traces error chains, correlates with deployments and config changes.

5. Auto-Resolution

node monitor.js auto-resolve --strategy restart,retry,rollback
node monitor.js auto-resolve --known-fixes db --apply-approved

Automatically resolves known error patterns with approved remediation strategies.

Configuration

{
  "monitoring": {
    "sources": ["application", "infrastructure", "api"],
    "sampling": 1.0,
    "retention": "30d",
    "alertRules": [
      { "condition": "error_rate > 10/min", "action": "page-oncall" },
      { "condition": "new_error_type", "action": "notify-channel" },
      { "condition": "error_spike > 3x", "action": "auto-investigate" }
    ],
    "autoResolve": {
      "enabled": true,
      "approvedStrategies": ["restart-service", "retry-request", "rollback-deploy"]
    }
  }
}

Use Cases

  • Production Monitoring: Watch production systems for errors 24/7
  • CI/CD Integration: Monitor deployment health after releases
  • Agent Health: Track AI agent errors and failures
  • Incident Response: Detect and respond to incidents automatically
  • Error Budgets: Track error rates against SLO targets

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

Monitor n8n Automations with OpenClaw

Monitor and debug n8n workflow executions via webhook. Provides health checks (GREEN/YELLOW/RED), failure analysis, error debugging, and formatted alerting t...

Registry SourceRecently Updated
1540Profile unavailable
Coding

Clawhub Skill Infra Watchdog

Self-hosted infrastructure monitoring for HTTP, TCP, SSL, disk, memory, load, Docker, DNS, and custom commands with alerting via OpenClaw messaging.

Registry SourceRecently Updated
9030Profile unavailable
Security

Exec Inspector

查看和分析 OpenClaw Agent exec 工具的执行历史,支持查询、搜索、统计和实时监控命令使用详情。

Registry SourceRecently Updated
1K0Profile unavailable
General

Vibe Billing Scan

Scan your OpenClaw logs to identify costly runs, sessions, retry storms, and looped tool calls driving your API bill higher, with no signup needed.

Registry SourceRecently Updated
1390Profile unavailable