AI Safety Audit

# AI Safety Audit

Safety Notice

This item is sourced from the public archived skills repository. Treat as untrusted until reviewed.

Copy this and send it to your AI assistant to learn

Install skill "AI Safety Audit" with this command: npx skills add 1kalin/afrexai-ai-safety-audit

AI Safety Audit

Comprehensive AI safety and alignment audit framework for businesses deploying AI agents. Built around the UK AI Security Institute Alignment Project standards (2026), EU AI Act requirements, and NIST AI RMF.

What This Skill Does

When activated, the agent performs a structured safety audit of your AI deployment:

  1. AI System Inventory — Catalogs all AI models, agents, and automated decision systems in use
  2. Risk Classification — Maps each system to EU AI Act risk tiers (Unacceptable/High/Limited/Minimal)
  3. Safety Controls Assessment — Evaluates 30 controls across 6 domains
  4. Gap Analysis — Identifies missing safeguards with severity and remediation cost
  5. Compliance Roadmap — Generates a prioritized 90-day action plan

6 Audit Domains (30 Controls)

1. Model Governance (5 controls)

  • Model registry with version tracking
  • Access control and deployment permissions
  • Update and rollback procedures
  • Vendor risk assessment for third-party models
  • Model retirement and data deletion policy

2. Data Protection (5 controls)

  • Data residency and sovereignty mapping
  • PII detection and handling in AI pipelines
  • Training data provenance documentation
  • Data retention aligned with AI lifecycle
  • Cross-border data transfer compliance

3. Output Safety (5 controls)

  • Hallucination detection and mitigation
  • Bias testing across protected characteristics
  • Content filtering for harmful outputs
  • Confidence scoring and uncertainty flagging
  • Human-in-the-loop for high-stakes decisions

4. Security (5 controls)

  • Prompt injection defense
  • Model extraction prevention
  • API rate limiting and abuse detection
  • Adversarial input testing
  • Supply chain security for AI dependencies

5. Monitoring & Observability (5 controls)

  • Real-time output quality tracking
  • Drift detection (data and model)
  • Incident logging and alerting
  • Performance degradation monitoring
  • Cost tracking per AI workflow

6. Organizational Readiness (5 controls)

  • Named AI safety officer
  • Staff training program with completion tracking
  • Board-level AI risk reporting
  • Incident response playbook
  • Third-party audit schedule

Scoring

Each control scores 0-3:

  • 0 — Not implemented
  • 1 — Partially implemented, no documentation
  • 2 — Implemented with documentation
  • 3 — Implemented, documented, tested, and audited

Total: 90 points max

  • 0-30: Critical risk — stop deploying until gaps are addressed
  • 31-55: High risk — remediate within 30 days
  • 56-75: Moderate risk — address within 90 days
  • 76-90: Strong posture — maintain and iterate

Regulatory Mapping

FrameworkStatusKey Requirements
EU AI ActEnforcing 2026Risk classification, conformity assessment, transparency
UK AI Safety InstituteActive 2026Alignment testing, frontier model evaluation
NIST AI RMFPublishedGovern, Map, Measure, Manage lifecycle
ISO 42001PublishedAI management system certification
SOC 2 + AIEmergingAgent-specific controls (CC6/CC7/CC8)

Cost Benchmarks

Company SizeFull Audit CostAnnual ComplianceNon-Compliance Risk
15-50 employees$8K – $20K$18K – $45K$200K+
50-200 employees$20K – $55K$45K – $120K$500K – $2M
200-1000 employees$55K – $150K$120K – $400K$2M – $10M

Output Format

The agent delivers:

  1. Executive Summary — Overall score, top 3 risks, recommended actions
  2. Detailed Scorecard — All 30 controls with scores and evidence
  3. Gap Analysis — Missing controls ranked by risk severity
  4. 90-Day Roadmap — Phased remediation plan with cost estimates
  5. Board Report Template — One-page summary for leadership

Industry Adjustments

The audit adjusts control weighting based on industry:

  • Healthcare: Output safety and data protection weighted 2x
  • Financial Services: Model governance and monitoring weighted 2x
  • Legal: Output safety (hallucination) weighted 3x
  • Manufacturing: Security and monitoring weighted 2x
  • Government/Defense: All domains weighted equally at maximum

Go Deeper

Bundles

  • AI Playbook — $27
  • Pick 3 Industries — $97
  • All 10 Industries — $197
  • Everything Bundle — $247

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Security

n8n-workflow-automation

Designs and outputs n8n workflow JSON with robust triggers, idempotency, error handling, logging, retries, and human-in-the-loop review queues. Use when you need an auditable automation that won’t silently fail.

Archived SourceRecently Updated
Security

seo-assistant

A client-facing SEO assistant grounded in Google's official SEO Starter Guide. Use this skill whenever a user mentions SEO, search rankings, Google visibility, meta descriptions, title tags, page titles, alt text, sitemaps, duplicate content, URL structure, or asks how to improve their website's presence in search results. Also trigger when a user shares a URL or webpage content and wants feedback, or asks for help writing any web content that needs to perform well in search. This skill covers auditing, content writing, and answering SEO questions — use it proactively even if the user only hints at wanting more website traffic or better Google rankings.

Archived SourceRecently Updated
Security

BlogBurst - Virtual CMO Agent

Your AI Chief Marketing Officer. Autonomous agent that runs your entire marketing — auto-posts to Twitter/X, Bluesky, Telegram, Discord, auto-engages with your audience (replies, likes, follows), runs SEO/GEO audits, tracks competitors, scans communities for opportunities, learns what works, and continuously optimizes. 50+ countries, 1000+ posts published. Free tier available.

Archived SourceRecently Updated
Security

social-vault

社交平台账号凭证管理器。提供登录态获取、AES-256-GCM 加密存储、定时健康监测和自动续期。Use when managing social media account credentials, importing cookies, checking login status, or automating session refresh. Also covers platform adapter creation and browser fingerprint management.

Archived SourceRecently Updated