skill creator

Agent Skill Creator Standard

Priority: P0 (CRITICAL)

Strict guidelines for High-Density Agent Skills. Maximize info/token ratio.

Core Principles (Token Economy First ⚡)

Three-Level Loading System

Writing Rules

Strict Size Limits

Element Limit Action if Exceeded

SKILL.md total 100 lines Extract to references/

Inline code block 10 lines Extract to references/

Anti-pattern item 15 words Compress to imperative

Tables 8 rows Extract to references/

Strict Formatting Rules

Anti-Patterns: No X: Do Y[, not Z]. [Context <= 15 words]
Example: No Logic in Builder: Perform calculations in BLoC, not UI.
No Redundancy: Do not repeat frontmatter descriptions.
Oversized Skills: If SKILL.md >100 lines, extract step-by-step guides and complex scenarios to references/ .
Nested Formatting: Avoid Bold:
More Bold``.

Test, Measure & Iterate

After writing a skill draft, validate it before shipping:

Write eval cases: Create evals/evals.json with 2–3 realistic prompts (see testing.md).
Design trigger queries: Generate 8–10 should-trigger and 8–10 should-not-trigger queries to measure description accuracy.
Optimize description: Make it "pushy" — list explicit trigger contexts, not just what it does.
Catch regressions: Snapshot the skill before editing; compare before/after outputs for existing test cases.
Iterate: Re-run evals after each change; stop when trigger rate ≥ 80% on held-out queries.

See Testing, Trigger Rate & Regression Guide for eval schema, query design rules, and regression protocol.

Resources & Deep Knowledge

Source Transparency