Skill Validator

When to use this skill

Use this skill when you need to validate an agent skill folder, checking its structure, content, and adherence to best practices. This includes verifying frontmatter, readability, workflow definitions, validation steps, cross-references, and isolation from other skills.

Validation steps

Read the skill folder structure: Ensure the folder contains a SKILL.md file. Check for optional subdirectories like scripts/ , references/ , assets/ , but note that the skill must work in isolation without relying on other skills.

Validate frontmatter:

The SKILL.md file must start with YAML frontmatter containing at least name (short identifier) and description (clear indication of when to use the skill).
The description must be clear enough for agents to determine relevance without ambiguity.
The description should include either "When to use" or "Use this skill when" to clearly indicate applicability. If this phrasing is missing, it can lead to confusion about when the skill should be applied. To fix this, ensure the description contains a clear statement of the conditions or scenarios in which the skill is relevant, using one of the recommended phrases for clarity.

Check cross-references: Parse the markdown content for links and references. Ensure internal links (e.g., to headings) point to existing sections. For file references, verify they exist within the skill's directory.

Assess readability and conciseness:

Instructions should use clear, concise language.
Avoid overly verbose explanations; aim for direct, actionable content.
Check for grammar, spelling, and logical flow.

Verify clear workflow definitions:

The skill should provide step-by-step instructions for performing the task.
Workflows must be unambiguous, with clear prerequisites, steps, and expected outcomes.

Check for validation steps:

The skill must include steps at the end to validate that it was correctly executed (e.g., verify output, check for side effects).
This ensures the skill can confirm success or failure.

Detect hallucinations:

Ensure instructions do not assume or reference non-existent tools, libraries, or capabilities.
All referenced tools or methods must be realistic and available in standard environments.

Confirm isolation:

The skill should not reference or depend on other skills.
All necessary assets, scripts, and references must be bundled within the skill's directory.

Detect duplicate content:

Scan all files (SKILL.md and supporting files) for overlapping or duplicate sections/instructions.
Check for repeated explanations, examples, or workflows across multiple files.
Identify content that could be consolidated or cross-referenced more efficiently.
Flag redundant subsections within the same file (e.g., repeated step descriptions).
Duplicates waste token budget and confuse users; consolidate where possible.

Estimate token cost (skill weight):

Calculate approximate token count for the entire skill (SKILL.md + references + assets)
Consider all text content, code examples, and documentation
Categorize the skill's "weight" based on token consumption:
Lightweight (< 2,000 tokens): Simple, focused skills
Small (2,000-4,000 tokens): Moderate skills with examples
Medium (4,000-8,000 tokens): Comprehensive skills with multiple sections
Large (8,000-15,000 tokens): Extensive skills with many examples
Heavy (15,000-25,000 tokens): Very comprehensive skills
Overweight (> 25,000 tokens): Potentially too large; consider splitting
Include weight in validation report for context awareness

Security audit ⭐ NEW:

Run the security validation module to check for common security vulnerabilities
Check for untrusted data sources (version control, subprocess calls, files, remote APIs, external input)
Verify all untrusted data is properly sanitized before use
Identify high-privilege operations and verify they have user confirmation
Detect injection attack vulnerabilities (prompt, shell, database, code)
Verify error handling is comprehensive and doesn't leak sensitive data
Confirm secrets/credentials are not hardcoded and .env is documented
The security module will flag potential issues for remediation
Tools: Use scripts/security_audit.py to automatically scan the skill
Details: See references/SECURITY_AUDIT_GUIDE.md for security check patterns

Summarize and validate execution:

After completing all checks, provide a concise summary of the validation results, confirming the skill's status (valid or invalid), listing any issues, and suggesting fixes.
Categorize issues by severity (Critical 🚨, Warning ⚠️, Info ℹ️) and group them accordingly.
Include the skill's weight classification and token estimate.
If issues are found, include examples and suggestions for fixes. If no issues, confirm validity with a positive note.
This step ensures the validation process itself was correctly executed and provides closure.

Check for user information presentation examples:

If the skill involves displaying or outputting information to the user (e.g., validation results, reports, or checklists), IT IS MANDATORY for it to include concrete examples of output formats.
Specify sample outputs, such as validation summaries with categorized issues (Critical 🚨, Warning ⚠️, Info ℹ️), checklists, or formatted messages.
This sets clear expectations and improves user experience by demonstrating the exact presentation style.

Validate security audit report (automated via scripts/security_audit.py):

The security audit script generates a detailed report with security findings
Review any flagged issues and ensure they are addressed
Critical issues (🚨) must be resolved before the skill is approved
Warnings (⚠️) should be reviewed and justified if not addressed
Info items (ℹ️) are recommendations for future improvements

Examples

When issues are found:

🚨 Critical Issues:

Missing required frontmatter (e.g., no name field): Fix by adding the missing field to the YAML frontmatter.

⚠️ Warnings:

Unclear description: Improve by making it more specific about when to use the skill.
Duplicate instructions detected in SKILL.md and references/workflow.md: Consolidate by moving to one location and cross-referencing.

ℹ️ Info:

Minor readability suggestions: Consider shortening verbose sections for conciseness.
Skill weight: Medium (6,500 tokens) - Consider breaking into smaller, focused skills if it grows beyond 8,000 tokens.

When no issues are found:

✅ No issues found. The skill is valid and ready for use.

Skill weight: Lightweight (1,200 tokens) - Efficient for loading and execution.

Duplicate Content Detection

Detection Strategy

Identify sections: Extract all major sections (headers) from SKILL.md and all supporting files
Extract content blocks: For each section, identify paragraphs, lists, code blocks, and examples
Semantic comparison: Compare content blocks across files for:
Exact duplicates (word-for-word matches)
Near-duplicates (same concept, slightly different wording, > 80% similarity)
Partial duplicates (repeated phrases or examples within a file)
Context analysis: Determine if duplication serves a purpose or is redundant
Report findings: List all duplicates with file locations and consolidation suggestions

Common Duplication Patterns to Flag

Pattern Example Action

Repeated workflow steps Step description appears in both SKILL.md and references/workflow.md Consolidate; cross-reference

Duplicate examples Same code example shown in multiple sections Keep in one place; reference from others

Overlapping explanations Same concept explained twice with different wording Merge explanations; remove redundancy

Repeated guidelines Same best practices listed in two sections Single source of truth; reference

Tool descriptions Same tool explained in multiple files Define once; reference elsewhere

Token Cost Estimation

Token Calculation Method

Estimate word count: Count all words across all skill files
Apply conversion ratio: Use ~1.3 tokens per word for English text (average for LLM tokenization)
Add overhead: Account for:
YAML frontmatter (50 tokens base)
Markdown formatting overhead (+10% of content tokens)
Code blocks (count as 1.0 tokens per word due to tokenization patterns)
Total calculation: Total Tokens = (SKILL.md words × 1.3) + (Reference files words × 1.3) + (Code blocks words × 1.0) + (Formatting overhead 10%) + 50

Weight Classification

Weight Token Range Description Agent Impact

🟢 Lightweight < 2,000 Simple, focused skill Minimal context usage; fast loading

🟢 Small 2,000-4,000 Moderate skill with examples Low context overhead; responsive

🟡 Medium 4,000-8,000 Comprehensive skill Balanced context usage; standard

🟠 Large 8,000-15,000 Extensive skill with many examples Significant context usage

🔴 Heavy 15,000-25,000 Very comprehensive skill High context consumption

🔴 Overweight

25,000 Too large; consider splitting Problematic for context limits

Weight Assessment Examples

Example 1: Lightweight Skill (1,200 tokens)

Simple workflow: 3-4 steps
Minimal supporting files
Few examples (1-2)
Limited configuration options

Example 2: Medium Skill (6,500 tokens)

Comprehensive workflow: 6-8 steps
2-3 reference files
Multiple examples (4-6)
Detailed configuration guide
Best practices section

Example 3: Heavy Skill (18,000 tokens)

Complex multi-phase workflow: 10+ steps
4-5 reference files with extensive content
Many examples (8+) with detailed output
Comprehensive configuration guide
Multiple use cases and edge cases
Troubleshooting section
Recommendation: Consider splitting into focused sub-skills

Output Format Example

Skill Validation Report

═══════════════════════════════════════════════════════════ SKILL VALIDATION REPORT ═══════════════════════════════════════════════════════════

Skill: custom-agent-creator Validation Date: 2024-02-21

─────────────────────────────────────────────────────────── GENERAL INFORMATION ───────────────────────────────────────────────────────────

Status: ✅ VALID Skill Weight: 🟡 Medium (6,800 tokens) Files Analyzed: 4

SKILL.md (3,200 tokens)
references/copilot-agents.md (1,500 tokens)
references/opencode-agents.md (1,400 tokens)
assets/ (2 templates, 700 tokens)

─────────────────────────────────────────────────────────── VALIDATION RESULTS ───────────────────────────────────────────────────────────

✅ Frontmatter: Valid ✅ Cross-References: All valid (3 internal, 2 file refs) ✅ Readability: Clear and concise ✅ Workflow: Well-defined (6 steps) ✅ Validation Steps: Comprehensive (5 categories) ✅ No Hallucinations: All tools/libraries verified ✅ Isolation: Self-contained (no skill dependencies) ✅ User Examples: 4 concrete examples with output ⚠️ Duplicate Content: 1 minor (see below)

─────────────────────────────────────────────────────────── DUPLICATE CONTENT DETECTED ───────────────────────────────────────────────────────────

⚠️ WARNING: Overlapping tool descriptions found

Location 1: SKILL.md, line 47 (OpenCode tools section) Location 2: references/opencode-agents.md, line 282 (tools config section)

Issue: "Tool permissions are boolean or ask/allow/deny" described in both locations with 85% similarity

Recommendation: Keep in SKILL.md (main reference), add cross-link in references file for clarity

─────────────────────────────────────────────────────────── WEIGHT ANALYSIS ───────────────────────────────────────────────────────────

Total Content: 6,800 tokens Content Distribution:

Instructions: 35% (2,380 tokens)
Examples: 40% (2,720 tokens)
References: 20% (1,360 tokens)
Formatting: 5% (340 tokens)

Classification: 🟡 MEDIUM Impact: Balanced context usage; suitable for most use cases Recommendation: Current size is optimal. No splitting needed.

If future expansion needed, consider:

Moving Copilot agent examples to separate skill
Creating OpenCode-specific variant
Extracting template examples to assets folder

─────────────────────────────────────────────────────────── ISSUES SUMMARY ───────────────────────────────────────────────────────────

🚨 Critical Issues: 0 ⚠️ Warnings: 1 (duplicate content - minor) ℹ️ Info: 0

─────────────────────────────────────────────────────────── CONCLUSION ───────────────────────────────────────────────────────────

Status: ✅ APPROVED FOR PRODUCTION

The skill is well-structured, comprehensive, and ready for use. Recommend addressing the minor duplicate content warning in the next maintenance cycle for optimization.

═══════════════════════════════════════════════════════════

Security Audit Module

The skill-validator now includes a built-in security audit module (scripts/security_audit.py ) that checks for common security vulnerabilities. This module implements six comprehensive validation rules:

Security Rules

Rule 1: Untrusted Data Detection

Identifies external data sources (version control, subprocess calls, files, remote APIs, external input)
Flags sources that need sanitization
Severity: HIGH (CRITICAL for version control data and subprocess calls)

Rule 2: Sanitization Requirement Verification

Verifies untrusted data is sanitized before use
Checks for sanitization functions in the skill code
Severity: CRITICAL if untrusted data found without sanitization

Rule 3: High-Privilege Operation Detection

Identifies dangerous operations: write/alter operations, remote repository operations, shell execution
Requires human confirmation for these operations
Severity: CRITICAL for force operations

Rule 4: Injection Risk Analysis

Detects potential injection attack vulnerabilities: prompt, shell, database, code
Flags suspicious keywords that indicate attack attempts
Severity: CRITICAL

Rule 5: Error Handling Completeness

Verifies try/catch blocks for external operations
Checks for timeout protection
Ensures no sensitive data in error messages
Severity: HIGH

Rule 6: Secrets Protection

Detects hardcoded credentials
Verifies .env and environment variables are documented
Flags missing secrets protection
Severity: CRITICAL for hardcoded secrets

Running Security Audit

Basic usage

python3 scripts/security_audit.py /path/to/SKILL.md

Example output

════════════════════════════════════════════════════════════ SECURITY AUDIT REPORT ════════════════════════════════════════════════════════════

Skill: /path/to/SKILL.md Status: ✅ PASSED

──────────────────────────────────────────────────────────── SUMMARY ──────────────────────────────────────────────────────────── 🚨 Critical Issues: 0 ⚠️ High Priority: 0 ℹ️ Medium Priority: 0 Total Issues: 0

✅ No security issues detected! ════════════════════════════════════════════════════════════

Integration with Validation Workflow

The security audit is automatically run as Step 11 of the validation process. Security issues are categorized by severity:

🚨 Critical: Must be fixed before production deployment
⚠️ Warning: Should be reviewed and justified
ℹ️ Info: Recommendations for future improvements

Tools to use

File reading and parsing tools to examine SKILL.md and associated files.
Markdown parsing for cross-reference checking and header extraction.
Text analysis for readability assessment and duplicate detection.
Token counting for weight estimation (approximate: 1.3 tokens/word).

Security Considerations

Data Validation & Sanitization

This skill validates user-provided skill directories and their YAML/markdown content. All validation is read-only:

Skill directory paths and file contents are validated and analyzed
No external input is executed or passed to command shells
All file paths are verified to exist before reading
YAML parsing errors are caught and reported as validation failures
Malformed content is reported without processing or modification

Safe Operations Only

This skill performs read-only validation only:

Does not create, write, or alter any files from validated skills
Does not execute version control commands (clone, push, pull, merges)
Does not modify remote repositories or version control systems
All operations are isolated to reading and analyzing skill content
Results are provided through validation reports only

Comprehensive Error Handling

All validator scripts include defensive error handling:

File I/O operations wrapped in try/except blocks with specific error messages
Subprocess calls configured with 30-second timeout protection
YAML parse errors caught and reported clearly
Missing or invalid skill paths handled gracefully
Invalid file permissions or access errors reported to user
No sensitive skill content included in error messages

Validator Scripts & References

Main validator scripts (in scripts/ ):

validator.py
Unified validator that runs all checks (structure, security, evals)
quick_validate.py
Structure validation for SKILL.md format
run_security_checks.py
Security vulnerability scanning
run_skill_evals.py
Skill trigger effectiveness testing
security_audit.py
Core security detection engine

Documentation (in references/ ):

VALIDATOR.md
Complete guide to the validator system with examples
SECURITY_AUDIT_GUIDE.md
Detailed security check patterns and remediation

Test data (in tests/ ):

security_test_cases.json
11 security test scenarios for validator validation
Security audit: scripts/security_audit.py for automated vulnerability scanning (NEW)

skill-validator

Safety Notice

Copy this and send it to your AI assistant to learn

Basic usage

Example output

Source Transparency

Related Skills

custom-agent-creator

semver-changelog

markdown-crossref-validator