bench-debug | V50.AI

bench-debug

/bench-debug <doc_id>

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "bench-debug" with this command: npx skills add opendataloader-project/opendataloader-pdf/opendataloader-project-opendataloader-pdf-bench-debug

/bench-debug <doc_id>

Compares parsing output with ground-truth for a specific document and analyzes failure causes.

Usage

/bench-debug 01030000000189

Execution Steps

Run benchmark for the specific document

./scripts/bench.sh --doc-id <doc_id>

Compare files

Ground-truth: tests/benchmark/ground-truth/markdown/<doc_id>.md
Prediction: tests/benchmark/prediction/opendataloader/markdown/<doc_id>.md
Original PDF: tests/benchmark/pdfs/<doc_id>.pdf

Analyze differences

Missing/extra text locations
Table structure differences (TEDS score causes)
Heading level mismatches (MHS score causes)
Reading order errors (NID score causes)

Identify root causes

Which PDF elements caused the issue
Which Java core components are involved

Suggest improvements

Java classes/methods that need modification
Expected impact scope

Reference Files

ground-truth/reference.json : Per-document element info (categories, coordinates, etc.)
java/opendataloader-pdf-core/ : Core parsing logic

Example Output

Document 01030000000189 Analysis:

Overall: 0.2763 (one of the worst performing documents)

Issues:

2 of 3 tables not detected (TEDS: 0.15)
- Table boundary detection failed
- Related code: TableDetector.java
Reading order errors (NID: 0.45)
- Multi-column layout handling failed
- Related code: ColumnDetector.java

Recommended Actions:

Adjust clustering threshold in TableDetector
Improve multi-column detection logic

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open in GitHub Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

bench

No summary provided by upstream source.

Repository SourceNeeds Review

-7

opendataloader-project

General

yuqing-bitable-and-label

Incrementally sync data from XiaoAi API to Feishu Bitable and optionally auto-label records with machine-based type and sentiment annotations.

Registry SourceRecently Updated

015

General

张律师综合套装

张律师法律AI中台 - 中国首个开源法律AI技能库，涵盖刑事辩护、民商事诉讼、合同审查全流程

Registry SourceRecently Updated

02