Rate Skill

Overview

Audit SKILL.md files against quality standards from generate-skill best practices. Provides letter grade (A-F) and actionable recommendations.

Core principle: Measure skill quality objectively to improve activation reliability and context efficiency.

When to Use

Always use when:

Reviewing skills before publishing
Validating skill structure and formatting
Checking if skill meets quality standards
User asks to "rate", "grade", or "review" a skill

Useful for:

Skill authors validating their work
Maintainers reviewing PRs with new skills
Quality audits of skill repositories
Before submitting skills to marketplaces

Avoid when:

Evaluating non-skill documentation
Reviewing code (not skill definitions)
General code quality auditing

How It Works

Read specified SKILL.md file
Evaluate against quality criteria
Calculate scores per category
Generate letter grade (A-F)
Output findings with priorities
Provide actionable recommendations

Quality Criteria

Category	Weight	Criteria
Length	20%	Under 500 lines (or progressive disclosure)
Conciseness	20%	Clear, scannable, no fluff
Repetitiveness	15%	No redundant content
Structure	15%	Required sections present and ordered
Triggers	15%	3-5+ specific activation phrases
Examples	10%	Good/Bad code comparisons
Troubleshooting	5%	Common issues addressed

Length (20%)

Scores: A: <500 or progressive disclosure | B: 500-600 | C: 600-800 | D: 800-1000 | F: >1000

Checks: Line count, reference/ directory, progressive disclosure links

Conciseness (20%)

Scores: A: High info density, scannable | B: Mostly concise | C: Some wordiness | D: Verbose | F: Excessive

Red flags: Long paragraphs (>5 sentences), redundant explanations, flowery language

Repetitiveness (15%)

Scores: A: Zero redundancy | B: 1-2 overlaps | C: 3-4 overlaps | D: 5+ overlaps | F: Heavy redundancy

Common: Format in section AND example, repeated "use when", duplicate trigger phrases

Structure (15%)

Scores: A: All required sections | B: Missing 1 optional | C: Missing 2-3 | D: Missing required | F: Severely lacking

Required: Frontmatter, Overview, When to Use, Main content, Examples (Good/Bad), Troubleshooting, Integration

Triggers (15%)

Scores: A: 5+ specific | B: 3-4 good | C: 2 phrases | D: 1 vague | F: None

Quality: User language ("when asked to X"), specific situations, multiple contexts, concrete not abstract

Examples (10%)

Scores: A: 3+ with Good/Bad | B: 2 with comparisons | C: 1 comparison | D: No comparisons | F: None

Quality: Uses tags, includes explanations, real scenarios, syntax highlighting

Troubleshooting (5%)

Scores: A: 5+ pairs | B: 3-4 pairs | C: 1-2 basic | D: Vague | F: None

Quality: Clear problem, cause identified, solution with code, explanation

Output Format

# Skill Rating: [Letter Grade]

## Summary
- **File:** path/to/SKILL.md
- **Lines:** XXX lines
- **Overall Grade:** [A/B/C/D/F] ([Score]/100)
- **Status:** [Production Ready / Needs Work / Not Ready]

## Category Scores

| Category | Score | Grade | Status |
|----------|-------|-------|--------|
| Length | XX/20 | [A-F] | [✅/⚠️/❌] |
| Conciseness | XX/20 | [A-F] | [✅/⚠️/❌] |
| Repetitiveness | XX/15 | [A-F] | [✅/⚠️/❌] |
| Structure | XX/15 | [A-F] | [✅/⚠️/❌] |
| Triggers | XX/15 | [A-F] | [✅/⚠️/❌] |
| Examples | XX/10 | [A-F] | [✅/⚠️/❌] |
| Troubleshooting | XX/5 | [A-F] | [✅/⚠️/❌] |

## Findings by Priority

### ❌ Critical Issues (Fix Before Publishing)
1. [Issue description]
   - Impact: [Why this matters]
   - Fix: [Specific action to take]

### ⚠️ Important Issues (Should Fix)
1. [Issue description]
   - Impact: [Why this matters]
   - Fix: [Specific action to take]

### 📋 Nice to Have
1. [Suggestion]
   - Benefit: [Why this helps]

## Strengths
- [What this skill does well]
- [Another strength]

## Priority Action Items
1. [Priority 1 action]
2. [Priority 2 action]
3. [Priority 3 action]

## Estimated Improvements
- Fix critical issues: +[X] points
- Address important issues: +[X] points
- Potential grade: [Current] → [Target]

Usage

Basic rating:

/rate-skill skills/example-skill/SKILL.md

Rate after changes:

# Make improvements
[edit SKILL.md]

# Re-rate
/rate-skill skills/example-skill/SKILL.md

Compare before/after:

# Rate original
/rate-skill skills/track-session/SKILL.md

# Make improvements
[condense, remove redundancy]

# Rate again to see improvement
/rate-skill skills/track-session/SKILL.md

Grading Scale

Grade	Score	Meaning
A	90-100	Excellent - Production ready
B	80-89	Good - Minor improvements recommended
C	70-79	Acceptable - Needs work before publishing
D	60-69	Poor - Significant issues to address
F	0-59	Failing - Major overhaul needed

Status mapping:

A-B: Production Ready ✅
C: Needs Work ⚠️
D-F: Not Ready ❌

Examples

Example 1: Rating a High-Quality Skill

Input: /rate-skill skills/track-session/SKILL.md

Output:

# Skill Rating: A

## Summary
- **File:** skills/track-session/SKILL.md
- **Lines:** 489 lines
- **Overall Grade:** A (92/100)
- **Status:** Production Ready

## Category Scores

| Category | Score | Grade | Status |
|----------|-------|-------|--------|
| Length | 20/20 | A | Pass |
| Conciseness | 18/20 | A | Pass |
| Repetitiveness | 14/15 | A | Pass |
| Structure | 15/15 | A | Pass |
| Triggers | 15/15 | A | Pass |
| Examples | 9/10 | A | Pass |
| Troubleshooting | 5/5 | A | Pass |

## Strengths
- Excellent progressive disclosure with reference/VERIFICATION.md
- 10+ diverse trigger phrases
- Strong examples with Good/Bad comparisons

## Priority Action Items
1. Consider adding one more example for edge cases (optional)

Note: High-scoring skills get a short report focused on strengths and optional improvements.

Example 2: Rating a Skill That Needs Work

Input: /rate-skill skills/problematic-skill/SKILL.md

Output:

# Skill Rating: C

## Summary
- **File:** skills/problematic-skill/SKILL.md
- **Lines:** 742 lines
- **Overall Grade:** C (74/100)
- **Status:** Needs Work

## Findings by Priority

### Critical Issues
1. **Length: 742 lines without progressive disclosure**
   - Impact: High context usage, harder to scan
   - Fix: Move detailed content to reference/ directory

2. **Only 2 trigger phrases in description**
   - Impact: Poor activation reliability
   - Fix: Add 3-5 specific user phrases and situations

### Important Issues
1. **Verbose mode descriptions (30+ lines each)**
   - Fix: Condense to 2-3 lines per mode

## Priority Action Items
1. Implement progressive disclosure (move 200+ lines to reference/)
2. Add 3+ trigger phrases to description
3. Condense verbose sections

## Estimated Improvements
- Fix critical issues: +12 points -> 86 (B)
- Potential grade: C -> A

Note: Lower-scoring skills get detailed findings with specific fixes and an improvement roadmap.

Troubleshooting

Problem: Can't find SKILL.md file

Cause: Path incorrect or file doesn't exist.

Solution:

# Verify file exists
ls skills/skill-name/SKILL.md

# Use correct path
/rate-skill skills/skill-name/SKILL.md

Problem: Rating seems too harsh

Cause: Standards are strict for good reason - quality matters for activation.

Solution:

Review specific findings
Compare to high-quality skills
Focus on critical issues first
Remember: B grade is still "good"

Problem: Grade improved but still low

Cause: Multiple categories need attention.

Solution:

Focus on highest-weight categories first (Length, Conciseness)
Fix critical issues before nice-to-haves
Re-rate after each major change
Use "Estimated Improvements" as roadmap

Problem: Don't know how to fix an issue

Cause: Fix recommendation unclear.

Solution:

Check generate-skill examples for patterns
Review high-rated skills for reference
Ask for specific help on that issue
Consult CLAUDE.md for SkillBox guidelines

Integration

This skill works with:

generate-skill - Use after generating to validate quality
Skill development workflow - Rate before committing/publishing
Quality control - Gate for accepting skills into repositories
Continuous improvement - Track quality metrics over time

Workflow:

# Create skill
/generate-skill new-feature

# Rate it
/rate-skill skills/new-feature/SKILL.md

# Fix issues
[make improvements]

# Re-rate
/rate-skill skills/new-feature/SKILL.md

# When A or B grade, publish
git add skills/new-feature/
git commit -m "Add new-feature skill"

Quality gates:

A-B: Merge to main ✅
C: Request changes ⚠️
D-F: Reject until improved ❌

References

Based on:

generate-skill best practices
SkillBox CLAUDE.md guidelines
obra/superpowers patterns
Vercel agent-skills standards

Related:

rate-skill

Safety Notice

Copy this and send it to your AI assistant to learn

Rate Skill

Overview

When to Use

How It Works

Quality Criteria

Length (20%)

Conciseness (20%)

Repetitiveness (15%)

Structure (15%)

Triggers (15%)

Examples (10%)

Troubleshooting (5%)

Output Format

Usage

Grading Scale

Examples

Example 1: Rating a High-Quality Skill

Example 2: Rating a Skill That Needs Work

Troubleshooting

Problem: Can't find SKILL.md file

Problem: Rating seems too harsh

Problem: Grade improved but still low

Problem: Don't know how to fix an issue

Integration

References

Source Transparency

Related Skills

Sharedintellect Quorum

Canary

track-session

ideal-react-component