extraction-form

Extraction Form (systematic review)

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "extraction-form" with this command: npx skills add willoscar/research-units-pipeline-skills/willoscar-research-units-pipeline-skills-extraction-form

Extraction Form (systematic review)

Goal: create a consistent, analysis-ready extraction table that is directly grounded in the protocol.

Inputs

Required:

  • papers/screening_log.csv

  • output/PROTOCOL.md

Optional:

  • papers/paper_notes.jsonl (if you already have structured notes)

Outputs

  • papers/extraction_table.csv

Workflow

Determine the included set

  • From papers/screening_log.csv , collect all rows with decision=include .

Build/confirm the schema

  • Use the extraction schema defined in output/PROTOCOL.md .

  • If the protocol does not define fields yet, stop and update output/PROTOCOL.md first.

Populate papers/extraction_table.csv

  • One row per included paper.

  • If papers/paper_notes.jsonl exists, use it as a structured source for values/provenance (but keep the table schema governed by output/PROTOCOL.md ).

  • Always include provenance columns:

  • paper_id , title , year , url

  • For each protocol-defined field:

  • fill concrete values (units explicit)

  • use an explicit sentinel for unknowns (recommended: empty cell + notes )

Keep it auditable

  • If a value is inferred (not directly stated), mark it in a notes column.

  • Do not write synthesis; only extraction.

Quick QA

  • Ensure 1:1 coverage: included papers == extraction rows.

  • Spot-check a few rows against the paper text/notes.

Definition of Done

  • papers/extraction_table.csv exists.

  • Every included paper from papers/screening_log.csv has exactly one extraction row.

  • Column meanings match output/PROTOCOL.md (no ad-hoc columns without updating the protocol).

Troubleshooting

Issue: the protocol does not specify extraction fields

Fix:

  • Update output/PROTOCOL.md (extraction schema section) and re-run extraction.

Issue: extraction table mixes narrative text with fields

Fix:

  • Move narrative into a notes column and keep the rest as atomic values (numbers/enums/short strings).

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

pdf-text-extractor

No summary provided by upstream source.

Repository SourceNeeds Review
Research

latex-compile-qa

No summary provided by upstream source.

Repository SourceNeeds Review
Research

draft-polisher

No summary provided by upstream source.

Repository SourceNeeds Review
Research

citation-verifier

No summary provided by upstream source.

Repository SourceNeeds Review