verification & quality assurance

Verification & Quality Assurance Skill

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "verification & quality assurance" with this command: npx skills add ruvnet/claude-flow/ruvnet-claude-flow-verification-quality-assurance

Verification & Quality Assurance Skill

What This Skill Does

This skill provides a comprehensive verification and quality assurance system that ensures code quality and correctness through:

  • Truth Scoring: Real-time reliability metrics (0.0-1.0 scale) for code, agents, and tasks

  • Verification Checks: Automated code correctness, security, and best practices validation

  • Automatic Rollback: Instant reversion of changes that fail verification (default threshold: 0.95)

  • Quality Metrics: Statistical analysis with trends, confidence intervals, and improvement tracking

  • CI/CD Integration: Export capabilities for continuous integration pipelines

  • Real-time Monitoring: Live dashboards and watch modes for ongoing verification

Prerequisites

  • Claude Flow installed (npx claude-flow@alpha )

  • Git repository (for rollback features)

  • Node.js 18+ (for dashboard features)

Quick Start

View current truth scores

npx claude-flow@alpha truth

Run verification check

npx claude-flow@alpha verify check

Verify specific file with custom threshold

npx claude-flow@alpha verify check --file src$app.js --threshold 0.98

Rollback last failed verification

npx claude-flow@alpha verify rollback --last-good

Complete Guide

Truth Scoring System

View Truth Metrics

Display comprehensive quality and reliability metrics for your codebase and agent tasks.

Basic Usage:

View current truth scores (default: table format)

npx claude-flow@alpha truth

View scores for specific time period

npx claude-flow@alpha truth --period 7d

View scores for specific agent

npx claude-flow@alpha truth --agent coder --period 24h

Find files$tasks below threshold

npx claude-flow@alpha truth --threshold 0.8

Output Formats:

Table format (default)

npx claude-flow@alpha truth --format table

JSON for programmatic access

npx claude-flow@alpha truth --format json

CSV for spreadsheet analysis

npx claude-flow@alpha truth --format csv

HTML report with visualizations

npx claude-flow@alpha truth --format html --export report.html

Real-time Monitoring:

Watch mode with live updates

npx claude-flow@alpha truth --watch

Export metrics automatically

npx claude-flow@alpha truth --export .claude-flow$metrics$truth-$(date +%Y%m%d).json

Truth Score Dashboard

Example dashboard output:

📊 Truth Metrics Dashboard ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Overall Truth Score: 0.947 ✅ Trend: ↗️ +2.3% (7d)

Top Performers: verification-agent 0.982 ⭐ code-analyzer 0.971 ⭐ test-generator 0.958 ✅

Needs Attention: refactor-agent 0.821 ⚠️ docs-generator 0.794 ⚠️

Recent Tasks: task-456 0.991 ✅ "Implement auth" task-455 0.967 ✅ "Add tests" task-454 0.743 ❌ "Refactor API"

Metrics Explained

Truth Scores (0.0-1.0):

  • 1.0-0.95 : Excellent ⭐ (production-ready)

  • 0.94-0.85 : Good ✅ (acceptable quality)

  • 0.84-0.75 : Warning ⚠️ (needs attention)

  • <0.75 : Critical ❌ (requires immediate action)

Trend Indicators:

  • ↗️ Improving (positive trend)

  • → Stable (consistent performance)

  • ↘️ Declining (quality regression detected)

Statistics:

  • Mean Score: Average truth score across all measurements

  • Median Score: Middle value (less affected by outliers)

  • Standard Deviation: Consistency of scores (lower = more consistent)

  • Confidence Interval: Statistical reliability of measurements

Verification Checks

Run Verification

Execute comprehensive verification checks on code, tasks, or agent outputs.

File Verification:

Verify single file

npx claude-flow@alpha verify check --file src$app.js

Verify directory recursively

npx claude-flow@alpha verify check --directory src/

Verify with auto-fix enabled

npx claude-flow@alpha verify check --file src$utils.js --auto-fix

Verify current working directory

npx claude-flow@alpha verify check

Task Verification:

Verify specific task output

npx claude-flow@alpha verify check --task task-123

Verify with custom threshold

npx claude-flow@alpha verify check --task task-456 --threshold 0.99

Verbose output for debugging

npx claude-flow@alpha verify check --task task-789 --verbose

Batch Verification:

Verify multiple files in parallel

npx claude-flow@alpha verify batch --files "*.js" --parallel

Verify with pattern matching

npx claude-flow@alpha verify batch --pattern "src/**/*.ts"

Integration test suite

npx claude-flow@alpha verify integration --test-suite full

Verification Criteria

The verification system evaluates:

Code Correctness

  • Syntax validation

  • Type checking (TypeScript)

  • Logic flow analysis

  • Error handling completeness

Best Practices

  • Code style adherence

  • SOLID principles

  • Design patterns usage

  • Modularity and reusability

Security

  • Vulnerability scanning

  • Secret detection

  • Input validation

  • Authentication$authorization checks

Performance

  • Algorithmic complexity

  • Memory usage patterns

  • Database query optimization

  • Bundle size impact

Documentation

  • JSDoc/TypeDoc completeness

  • README accuracy

  • API documentation

  • Code comments quality

JSON Output for CI/CD

Get structured JSON output

npx claude-flow@alpha verify check --json > verification.json

Example JSON structure:

{ "overallScore": 0.947, "passed": true, "threshold": 0.95, "checks": [ { "name": "code-correctness", "score": 0.98, "passed": true }, { "name": "security", "score": 0.91, "passed": false, "issues": [...] } ] }

Automatic Rollback

Rollback Failed Changes

Automatically revert changes that fail verification checks.

Basic Rollback:

Rollback to last known good state

npx claude-flow@alpha verify rollback --last-good

Rollback to specific commit

npx claude-flow@alpha verify rollback --to-commit abc123

Interactive rollback with preview

npx claude-flow@alpha verify rollback --interactive

Smart Rollback:

Rollback only failed files (preserve good changes)

npx claude-flow@alpha verify rollback --selective

Rollback with automatic backup

npx claude-flow@alpha verify rollback --backup-first

Dry-run mode (preview without executing)

npx claude-flow@alpha verify rollback --dry-run

Rollback Performance:

  • Git-based rollback: <1 second

  • Selective file rollback: <500ms

  • Backup creation: Automatic before rollback

Verification Reports

Generate Reports

Create detailed verification reports with metrics and visualizations.

Report Formats:

JSON report

npx claude-flow@alpha verify report --format json

HTML report with charts

npx claude-flow@alpha verify report --export metrics.html --format html

CSV for data analysis

npx claude-flow@alpha verify report --format csv --export metrics.csv

Markdown summary

npx claude-flow@alpha verify report --format markdown

Time-based Reports:

Last 24 hours

npx claude-flow@alpha verify report --period 24h

Last 7 days

npx claude-flow@alpha verify report --period 7d

Last 30 days with trends

npx claude-flow@alpha verify report --period 30d --include-trends

Custom date range

npx claude-flow@alpha verify report --from 2025-01-01 --to 2025-01-31

Report Content:

  • Overall truth scores

  • Per-agent performance metrics

  • Task completion quality

  • Verification pass$fail rates

  • Rollback frequency

  • Quality improvement trends

  • Statistical confidence intervals

Interactive Dashboard

Launch Dashboard

Run interactive web-based verification dashboard with real-time updates.

Launch dashboard on default port (3000)

npx claude-flow@alpha verify dashboard

Custom port

npx claude-flow@alpha verify dashboard --port 8080

Export dashboard data

npx claude-flow@alpha verify dashboard --export

Dashboard with auto-refresh

npx claude-flow@alpha verify dashboard --refresh 5s

Dashboard Features:

  • Real-time truth score updates (WebSocket)

  • Interactive charts and graphs

  • Agent performance comparison

  • Task history timeline

  • Rollback history viewer

  • Export to PDF/HTML

  • Filter by time period$agent$score

Configuration

Default Configuration

Set verification preferences in .claude-flow$config.json :

{ "verification": { "threshold": 0.95, "autoRollback": true, "gitIntegration": true, "hooks": { "preCommit": true, "preTask": true, "postEdit": true }, "checks": { "codeCorrectness": true, "security": true, "performance": true, "documentation": true, "bestPractices": true } }, "truth": { "defaultFormat": "table", "defaultPeriod": "24h", "warningThreshold": 0.85, "criticalThreshold": 0.75, "autoExport": { "enabled": true, "path": ".claude-flow$metrics$truth-daily.json" } } }

Threshold Configuration

Adjust verification strictness:

Strict mode (99% accuracy required)

npx claude-flow@alpha verify check --threshold 0.99

Lenient mode (90% acceptable)

npx claude-flow@alpha verify check --threshold 0.90

Set default threshold

npx claude-flow@alpha config set verification.threshold 0.98

Per-environment thresholds:

{ "verification": { "thresholds": { "production": 0.99, "staging": 0.95, "development": 0.90 } } }

Integration Examples

CI/CD Integration

GitHub Actions:

name: Quality Verification

on: [push, pull_request]

jobs: verify: runs-on: ubuntu-latest steps: - uses: actions$checkout@v3

  - name: Install Dependencies
    run: npm install

  - name: Run Verification
    run: |
      npx claude-flow@alpha verify check --json > verification.json

  - name: Check Truth Score
    run: |
      score=$(jq '.overallScore' verification.json)
      if (( $(echo "$score &#x3C; 0.95" | bc -l) )); then
        echo "Truth score too low: $score"
        exit 1
      fi

  - name: Upload Report
    uses: actions$upload-artifact@v3
    with:
      name: verification-report
      path: verification.json

GitLab CI:

verify: stage: test script: - npx claude-flow@alpha verify check --threshold 0.95 --json > verification.json - | score=$(jq '.overallScore' verification.json) if [ $(echo "$score < 0.95" | bc) -eq 1 ]; then echo "Verification failed with score: $score" exit 1 fi artifacts: paths: - verification.json reports: junit: verification.json

Swarm Integration

Run verification automatically during swarm operations:

Swarm with verification enabled

npx claude-flow@alpha swarm --verify --threshold 0.98

Hive Mind with auto-rollback

npx claude-flow@alpha hive-mind --verify --rollback-on-fail

Training pipeline with verification

npx claude-flow@alpha train --verify --threshold 0.99

Pair Programming Integration

Enable real-time verification during collaborative development:

Pair with verification

npx claude-flow@alpha pair --verify --real-time

Pair with custom threshold

npx claude-flow@alpha pair --verify --threshold 0.97 --auto-fix

Advanced Workflows

Continuous Verification

Monitor codebase continuously during development:

Watch directory for changes

npx claude-flow@alpha verify watch --directory src/

Watch with auto-fix

npx claude-flow@alpha verify watch --directory src/ --auto-fix

Watch with notifications

npx claude-flow@alpha verify watch --notify --threshold 0.95

Monitoring Integration

Send metrics to external monitoring systems:

Export to Prometheus

npx claude-flow@alpha truth --format json |
curl -X POST https:/$pushgateway.example.com$metrics$job$claude-flow
-d @-

Send to DataDog

npx claude-flow@alpha verify report --format json |
curl -X POST "https:/$api.datadoghq.com$api$v1$series?api_key=${DD_API_KEY}"
-H "Content-Type: application$json"
-d @-

Custom webhook

npx claude-flow@alpha truth --format json |
curl -X POST https:/$metrics.example.com$api$truth
-H "Content-Type: application$json"
-d @-

Pre-commit Hooks

Automatically verify before commits:

Install pre-commit hook

npx claude-flow@alpha verify install-hook --pre-commit

.git$hooks$pre-commit example:

#!$bin$bash npx claude-flow@alpha verify check --threshold 0.95 --json > $tmp$verify.json

score=$(jq '.overallScore' $tmp$verify.json) if (( $(echo "$score < 0.95" | bc -l) )); then echo "❌ Verification failed with score: $score" echo "Run 'npx claude-flow@alpha verify check --verbose' for details" exit 1 fi

echo "✅ Verification passed with score: $score"

Performance Metrics

Verification Speed:

  • Single file check: <100ms

  • Directory scan: <500ms (per 100 files)

  • Full codebase analysis: <5s (typical project)

  • Truth score calculation: <50ms

Rollback Speed:

  • Git-based rollback: <1s

  • Selective file rollback: <500ms

  • Backup creation: <2s

Dashboard Performance:

  • Initial load: <1s

  • Real-time updates: <100ms latency (WebSocket)

  • Chart rendering: 60 FPS

Troubleshooting

Common Issues

Low Truth Scores:

Get detailed breakdown

npx claude-flow@alpha truth --verbose --threshold 0.0

Check specific criteria

npx claude-flow@alpha verify check --verbose

View agent-specific issues

npx claude-flow@alpha truth --agent <agent-name> --format json

Rollback Failures:

Check git status

git status

View rollback history

npx claude-flow@alpha verify rollback --history

Manual rollback

git reset --hard HEAD~1

Verification Timeouts:

Increase timeout

npx claude-flow@alpha verify check --timeout 60s

Verify in batches

npx claude-flow@alpha verify batch --batch-size 10

Exit Codes

Verification commands return standard exit codes:

  • 0 : Verification passed (score ≥ threshold)

  • 1 : Verification failed (score < threshold)

  • 2 : Error during verification (invalid input, system error)

Related Commands

  • npx claude-flow@alpha pair

  • Collaborative development with verification

  • npx claude-flow@alpha train

  • Training with verification feedback

  • npx claude-flow@alpha swarm

  • Multi-agent coordination with quality checks

  • npx claude-flow@alpha report

  • Generate comprehensive project reports

Best Practices

  • Set Appropriate Thresholds: Use 0.99 for critical code, 0.95 for standard, 0.90 for experimental

  • Enable Auto-rollback: Prevent bad code from persisting

  • Monitor Trends: Track improvement over time, not just current scores

  • Integrate with CI/CD: Make verification part of your pipeline

  • Use Watch Mode: Get immediate feedback during development

  • Export Metrics: Track quality metrics in your monitoring system

  • Review Rollbacks: Understand why changes were rejected

  • Train Agents: Use verification feedback to improve agent performance

Additional Resources

  • Truth Scoring Algorithm: See $docs$truth-scoring.md

  • Verification Criteria: See $docs$verification-criteria.md

  • Integration Examples: See $examples$verification/

  • API Reference: See $docs$api$verification.md

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

skill builder

No summary provided by upstream source.

Repository SourceNeeds Review
General

swarm-advanced

No summary provided by upstream source.

Repository SourceNeeds Review
General

sparc-methodology

No summary provided by upstream source.

Repository SourceNeeds Review
General

v3 ddd architecture

No summary provided by upstream source.

Repository SourceNeeds Review