Agent Safety Monitor
Monitor AI agent behavior in real-time, detect anomalies, and enforce safety constraints.
Quick Start
Run the monitor against an agent log stream:
python3 scripts/monitor.py --config safety_rules.yaml --input agent_logs/
Architecture
Agent Actions → Event Stream → Rule Engine → Alerts → Dashboard
↓
Anomaly Detector
↓
Auto-Pause/Kill
Safety Rule Types
- Rate limits: Max N actions per minute/hour
- Action allowlists: Only permit specific tool calls
- Content filters: Block PII exfiltration, harmful outputs
- Budget caps: Stop when cost exceeds threshold
- Scope limits: Restrict file/API/network access
- Behavioral anomalies: Flag unusual patterns (tool call frequency spikes, repetitive loops)
Rule Configuration
See references/rules-reference.md for the YAML schema.
Alert Channels
- Console (default)
- Webhook (Slack, Discord)
- File log
- Dashboard (HTTP server)
Dashboard
Launch the monitoring dashboard:
python3 scripts/dashboard.py --port 8080