axioma-skill-evaluator

Advanced skill evaluation for OpenClaw agents. Use when: (1) evaluating a skill before publishing, (2) improving a skill based on evaluation results, (3) checking skill quality with automated + manual analysis, (4) any skill audit or quality check. Combines dual evaluation systems: Axioma 5-dimension framework (100 max) with ISO 25010 international framework (25 criteria, 100 max). Features: colorful terminal output, dual evaluation, 25-criteria rubric, self-contained bundled scripts.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "axioma-skill-evaluator" with this command: npx skills add kofna3369/axioma-skill-evaluator

AXIOMA SKILL EVALUATOR 🧙‍♂️

Advanced Skill Evaluation: Dual System (Automated + Manual)

InfoValue
Version2.1.0 — 2026-05-07
StatusOPERATIONAL

1. PURPOSE AND SCOPE

Objective

Provide comprehensive skill evaluation using dual systems:

  • Axioma System (5 dimensions, 100 max) — colorful, fast
  • ISO 25010 System (25 criteria, 100 max) — international standard

When to Use

TriggerAction
Before publishing a skillRun both evaluations
Improving a skillGet both automated + manual scores
Quality auditUse 25-criteria rubric
Pre-publication checkRun all checks

2. BUNDLED TOOLS

evaluator.py (Axioma System)

# Run Axioma 5-dimension evaluation
python3 evaluator.py <skill-path> --verbose --improve

eval-skill.py (ISO 25010 System)

# Run automated ISO 25010 checks
python3 eval-skill.py <skill-path> --verbose

# JSON output
python3 eval-skill.py <skill-path> --json

3. AXIOMA EVALUATION SYSTEM

Quick Start

python3 evaluator.py <skill-path> --verbose --improve

5 Dimensions (100 max)

DimensionWeightFocus
Structure20%Header, sections, formatting, meta
Clarity20%Description, instructions, examples
Completeness20%Tools, prerequisites, errors, edge cases
Consistency20%Style, naming, integration
Functionality20%Commands work, expected results

Output Format

╔═══════════════════════════════════════════════════════════╗
║ 📊 SKILL EVALUATION REPORT — [Skill Name]              ║
║ Score: XX/100 [STATUS]                                  ║
╠═══════════════════════════════════════════════════════════╣
║ STRUCTURE:     XX/20 ████████████░░░░ XX%                ║
║ CLARITY:      XX/20 ████████████░░░░ XX%                ║
║ COMPLETENESS: XX/20 ████████████░░░░ XX%                ║
║ CONSISTENCY:  XX/20 ████████████░░░░ XX%                ║
║ FUNCTIONALITY: XX/20 ████████████░░░░ XX%                ║
╠═══════════════════════════════════════════════════════════╣
║ STATUS: ✅ APPROVED (score >= 70%)                      ║
╚═══════════════════════════════════════════════════════════╝

Thresholds

ScoreStatusAction
90-100🟢 EXCELLENTReady for production
70-89🟡 GOODPublishable, minor notes
50-69🟠 NEEDS_WORKFix before publishing
<50🔴 POORMajor rework needed

4. ISO 25010 EVALUATION SYSTEM

Automated Checks (eval-skill.py)

Runs 13 automated checks:

  • File structure validation
  • Frontmatter YAML parsing
  • Description quality (65+ words, trigger contexts)
  • Script syntax validation
  • Credential scanning
  • Dependency audit

Target: 90%+ (12+/13 checks passed)

Manual Assessment (25 Criteria)

CategoryFrameworkMaxCriteria
1. Functional SuitabilityISO 25010/12Completeness, Correctness, Appropriateness
2. ReliabilityISO 25010/12Fault Tolerance, Error Reporting, Recoverability
3. PerformanceISO 25010/8Token Cost, Execution Efficiency
4. Usability (AI)Shneiderman/12Learnability, Consistency, Feedback
5. Usability (Human)Tognazzini/8Discoverability, Forgiveness
6. SecurityISO 25010/12Credentials, Input Validation, Data Safety
7. MaintainabilityISO 25010/12Modularity, Modifiability, Testability
8. Agent-SpecificNovel/24Trigger Precision, Progressive Disclosure, Composability
TOTAL/100

5. COMPLETE EVALUATION WORKFLOW

1. AUTOMATED: python3 eval-skill.py <path> --verbose
   → Target: 90%+ structural score
   ↓
2. AXIOMA: python3 evaluator.py <path> --verbose --improve
   → Target: 70+ score
   ↓
3. MANUAL: Score 25 criteria rubric
   → Target: 80+ score
   ↓
4. FIX: Issues from all three sources
   ↓
5. RE-EVALUATE: Until all targets met
   ↓
6. PUBLISH: To ClawHub

6. ERROR HANDLING

Common Issues

IssueCauseSolution
No frontmatterYAML not at startAdd --- at start of SKILL.md
Poor descriptionMissing triggersAdd "Use when:" clauses
Empty directoriesUnused foldersRemove or populate
Name mismatchDirectory ≠ frontmatterRename to match

Security Issues

IssueSeverityAction
Hardcoded credentialsCRITICALRemove immediately
Missing input validationHIGHAdd validation
No error handlingMEDIUMAdd try/catch blocks

7. EDGE CASES

CaseInputExpected Output
Empty SKILL.mdEmpty fileError message, suggest template
Very long SKILL.md>500 linesWarning, recommend split
Missing descriptionNo frontmatterFail with instructions
No scriptsNo scripts/ dirPass, document as standalone

8. DEPENDENCIES

DependencyPurposeRequired
Python 3.6+Script executionYes
PyYAMLFrontmatter parsingOptional

In Altum Per Quality. 🧙‍♂️ Axioma Skill Evaluator v2.1

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

电动车智能检测技能

Automatically detects electric motorcycles and e-bikes in restricted areas based on computer vision. It supports real-time detection for both video streams a...

Registry SourceRecently Updated
Automation

MEV五层操作引擎

MEV五层操作引擎 v5.3 — 巴巴塔操作系统的核心任务执行框架。 每层标配:核心问题 + 检验标准 + 异常路径 + 行为准则 + 质量门禁 + 回退机制。 安装后agent自动获得从就绪→感知→思考→产出→复盘的完整执行能力。 v5.1更新:①用户指令完整性检查(Suit层)②工具唤醒检查+自适应Tool...

Registry SourceRecently Updated
Automation

WeChat Mail Bridge (Windows/OpenClaw)

Install, configure, run, and troubleshoot a Windows WeChat desktop automation and BHMailer/OpenClaw mail bridge bundle, including File Transfer Assistant wor...

Registry SourceRecently Updated
Automation

Clawlective

Share and discover learnings across the Clawlective agent network. Contribute patterns, solutions, and pitfalls — unlock the weekly AI digest and searchable...

Registry SourceRecently Updated