safety-curves
Guide for understanding and explaining Control Tower safety curves. Use when the user asks about safety-usefulness tradeoffs, audit budgets, attack success rates, monitor thresholds, safety metrics, or interpreting safety curve plots.
Guide to Control Tower's AI control concepts. Use when the user asks about control protocols, blue team and red team roles, usefulness, safety, audit budgets, trusted monitoring, or the conceptual model behind Control Tower.
This listing is imported from SkillsMP metadata and should be treated as untrusted until upstream source review is completed.
Install skill "control-concepts" with this command: npx skills add linuxarena/skillsmp-linuxarena-linuxarena-control-concepts
This source entry does not include full markdown content beyond metadata.
This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.
Related by shared tags or category signals.
Guide for understanding and explaining Control Tower safety curves. Use when the user asks about safety-usefulness tradeoffs, audit budgets, attack success rates, monitor thresholds, safety metrics, or interpreting safety curve plots.
Guide for working with Control Tower trajectories. Use when the user asks about trajectory formats, trajectory IDs, converting .eval logs, uploading or downloading trajectories, replaying trajectories, or trajectory CLI commands.
Preflight security scanner for AI coding agents — scans deployment config, skills/MCP servers, memory/sessions, and AI agent config files (hooks injection) f...
Perform comprehensive security audits on skills to identify vulnerabilities, unsafe patterns, and compliance issues. Use when auditing skills for security, c...