axioma-skill-evaluator

Advanced skill evaluation for OpenClaw agents. Use when: (1) evaluating a skill before publishing, (2) improving a skill based on evaluation results, (3) checking skill quality with automated + manual analysis, (4) any skill audit or quality check. Combines dual evaluation systems: Axioma 5-dimension framework (100 max) with ISO 25010 international framework (25 criteria, 100 max). Features: colorful terminal output, dual evaluation, 25-criteria rubric, self-contained bundled scripts.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "axioma-skill-evaluator" with this command: npx skills add kofna3369/axioma-skill-evaluator

AXIOMA SKILL EVALUATOR 🧙‍♂️

Advanced Skill Evaluation: Dual System (Automated + Manual)

InfoValue
Version2.1.0 — 2026-05-07
StatusOPERATIONAL

1. PURPOSE AND SCOPE

Objective

Provide comprehensive skill evaluation using dual systems:

  • Axioma System (5 dimensions, 100 max) — colorful, fast
  • ISO 25010 System (25 criteria, 100 max) — international standard

When to Use

TriggerAction
Before publishing a skillRun both evaluations
Improving a skillGet both automated + manual scores
Quality auditUse 25-criteria rubric
Pre-publication checkRun all checks

2. BUNDLED TOOLS

evaluator.py (Axioma System)

# Run Axioma 5-dimension evaluation
python3 evaluator.py <skill-path> --verbose --improve

eval-skill.py (ISO 25010 System)

# Run automated ISO 25010 checks
python3 eval-skill.py <skill-path> --verbose

# JSON output
python3 eval-skill.py <skill-path> --json

3. AXIOMA EVALUATION SYSTEM

Quick Start

python3 evaluator.py <skill-path> --verbose --improve

5 Dimensions (100 max)

DimensionWeightFocus
Structure20%Header, sections, formatting, meta
Clarity20%Description, instructions, examples
Completeness20%Tools, prerequisites, errors, edge cases
Consistency20%Style, naming, integration
Functionality20%Commands work, expected results

Output Format

╔═══════════════════════════════════════════════════════════╗
║ 📊 SKILL EVALUATION REPORT — [Skill Name]              ║
║ Score: XX/100 [STATUS]                                  ║
╠═══════════════════════════════════════════════════════════╣
║ STRUCTURE:     XX/20 ████████████░░░░ XX%                ║
║ CLARITY:      XX/20 ████████████░░░░ XX%                ║
║ COMPLETENESS: XX/20 ████████████░░░░ XX%                ║
║ CONSISTENCY:  XX/20 ████████████░░░░ XX%                ║
║ FUNCTIONALITY: XX/20 ████████████░░░░ XX%                ║
╠═══════════════════════════════════════════════════════════╣
║ STATUS: ✅ APPROVED (score >= 70%)                      ║
╚═══════════════════════════════════════════════════════════╝

Thresholds

ScoreStatusAction
90-100🟢 EXCELLENTReady for production
70-89🟡 GOODPublishable, minor notes
50-69🟠 NEEDS_WORKFix before publishing
<50🔴 POORMajor rework needed

4. ISO 25010 EVALUATION SYSTEM

Automated Checks (eval-skill.py)

Runs 13 automated checks:

  • File structure validation
  • Frontmatter YAML parsing
  • Description quality (65+ words, trigger contexts)
  • Script syntax validation
  • Credential scanning
  • Dependency audit

Target: 90%+ (12+/13 checks passed)

Manual Assessment (25 Criteria)

CategoryFrameworkMaxCriteria
1. Functional SuitabilityISO 25010/12Completeness, Correctness, Appropriateness
2. ReliabilityISO 25010/12Fault Tolerance, Error Reporting, Recoverability
3. PerformanceISO 25010/8Token Cost, Execution Efficiency
4. Usability (AI)Shneiderman/12Learnability, Consistency, Feedback
5. Usability (Human)Tognazzini/8Discoverability, Forgiveness
6. SecurityISO 25010/12Credentials, Input Validation, Data Safety
7. MaintainabilityISO 25010/12Modularity, Modifiability, Testability
8. Agent-SpecificNovel/24Trigger Precision, Progressive Disclosure, Composability
TOTAL/100

5. COMPLETE EVALUATION WORKFLOW

1. AUTOMATED: python3 eval-skill.py <path> --verbose
   → Target: 90%+ structural score
   ↓
2. AXIOMA: python3 evaluator.py <path> --verbose --improve
   → Target: 70+ score
   ↓
3. MANUAL: Score 25 criteria rubric
   → Target: 80+ score
   ↓
4. FIX: Issues from all three sources
   ↓
5. RE-EVALUATE: Until all targets met
   ↓
6. PUBLISH: To ClawHub

6. ERROR HANDLING

Common Issues

IssueCauseSolution
No frontmatterYAML not at startAdd --- at start of SKILL.md
Poor descriptionMissing triggersAdd "Use when:" clauses
Empty directoriesUnused foldersRemove or populate
Name mismatchDirectory ≠ frontmatterRename to match

Security Issues

IssueSeverityAction
Hardcoded credentialsCRITICALRemove immediately
Missing input validationHIGHAdd validation
No error handlingMEDIUMAdd try/catch blocks

7. EDGE CASES

CaseInputExpected Output
Empty SKILL.mdEmpty fileError message, suggest template
Very long SKILL.md>500 linesWarning, recommend split
Missing descriptionNo frontmatterFail with instructions
No scriptsNo scripts/ dirPass, document as standalone

8. DEPENDENCIES

DependencyPurposeRequired
Python 3.6+Script executionYes
PyYAMLFrontmatter parsingOptional

In Altum Per Quality. 🧙‍♂️ Axioma Skill Evaluator v2.1

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

TeamWork

Dynamically creates and manages AI agent teams for complex tasks. Invoke when user requests multi-agent collaboration, complex project execution, or when tasks require specialized roles and coordinated workflow.

Registry SourceRecently Updated
Automation

Website Usability Testing using Nova Act

AI-orchestrated usability testing using Amazon Nova Act. The agent generates personas, runs tests to collect raw data, interprets responses to determine goal achievement, and generates HTML reports. Tests real user workflows (booking, checkout, posting) with safety guardrails. Use when asked to "test website usability", "run usability test", "generate usability report", "evaluate user experience", "test checkout flow", "test booking process", or "analyze website UX".

Registry SourceRecently Updated
Automation

Gen Paylink Govilo

Upload files to Govilo and generate unlock links via Bot API. Use when: (1) Creating a Govilo unlock link from a ZIP, folder, or individual files, (2) Automating file upload to Govilo R2 storage with presigned URLs, (3) Managing Govilo Bot API interactions (presign → upload → create item). Requires GOVILO_API_KEY and SELLER_ADDRESS env vars. If missing, guides user to register at https://govilo.xyz/.

Registry SourceRecently Updated
Automation

FlowFi

REST API instructions for FlowFi—authorization, smart accounts, workflows (AI generate, edit, deploy, undeploy, delete, pause, resume, stop), execution (list...

Registry SourceRecently Updated