Literature Review
Processes user-provided PDFs one at a time using isolated subagents to prevent context overflow, then synthesizes findings and drafts the Introduction section.
CRITICAL CONSTRAINTS
┌────────────────────────────────────────────────────────────────────┐ │ ⛔ ORCHESTRATOR MUST NEVER READ PDF FILES DIRECTLY ⛔ │ │ │ │ The orchestrator (you, running this skill) may ONLY read: │ │ - scope.md │ │ - inventory.md │ │ - notes/ethics-summary.md │ │ - notes/papers/*.md (the condensed notes, NOT PDFs) │ │ - notes/bibliography.md │ │ - notes/literature-synthesis.md │ │ │ │ If you find yourself about to use the Read tool on a .pdf file, │ │ STOP. Spawn a subagent instead. │ ├────────────────────────────────────────────────────────────────────┤ │ ⛔ EACH SUBAGENT PROCESSES EXACTLY ONE PAPER ⛔ │ │ │ │ - One Task tool call = One PDF │ │ - Never pass multiple PDFs to a single subagent │ │ - Never ask a subagent to "process the remaining papers" │ │ - Wait for each subagent to complete before spawning the next │ │ │ │ WRONG: Task("Process papers/a.pdf, papers/b.pdf, papers/c.pdf") │ │ RIGHT: Task("Process papers/a.pdf") → wait → Task("Process b.pdf")│ └────────────────────────────────────────────────────────────────────┘
Architecture: Subagent Pattern
Each paper is processed by an isolated subagent to prevent context overflow:
┌─────────────────────────────────────────────────────────────────┐ │ LITERATURE REVIEW WORKFLOW │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ [ORCHESTRATOR] │ │ │ │ │ ├──► Read scope.md (extract research context) │ │ │ │ │ ├──► List papers/.pdf │ │ │ │ │ ├──► For each PDF, spawn isolated subagent: │ │ │ │ │ │ │ ├──► [Subagent] Read ONE PDF │ │ │ ├──► [Subagent] Generate notes/papers/.md │ │ │ └──► [Subagent] Exit (context cleared) │ │ │ │ │ ├──► Read all notes/papers/*.md (small files) │ │ │ │ │ ├──► Generate bibliography + synthesis │ │ │ │ │ └──► Draft Introduction │ │ │ └─────────────────────────────────────────────────────────────────┘
Why subagents?
-
Each PDF can be 10-50 pages of dense content
-
Processing 30 papers in one context overflows limits
-
Subagents process one paper, write condensed notes, then exit
-
Orchestrator only reads small markdown note files
Prerequisites
-
scope.md must exist (from scoping step)
-
notes/ethics-summary.md may exist (provides study objectives and population context)
-
papers/ folder with PDF files to process
-
Create directories: notes/papers/ , notes/papers-library/ , drafts/
Workflow Overview
[Step 1: Read scope.md for context] │ ▼ [Step 2: Inventory papers/ folder] │ ▼ [Step 3: Process each PDF via subagent] ─── One at a time │ └── notes/papers/*.md ▼ [Step 4: Generate bibliography] ─── From all notes │ └── notes/bibliography.md ▼ [Step 5: Synthesize literature] ─── Aggregate findings │ └── notes/literature-synthesis.md ▼ [Step 6: Draft Introduction] │ ▼ [Output] ─── drafts/introduction.md
Step 1: Read Scope Context
From scope.md , extract comprehensive context to pass to each subagent:
-
Research question (frames how to evaluate relevance)
-
Key findings (what results need literature context)
-
Target journal (determines citation depth)
-
Hypothesis (what the manuscript argues)
-
Population/methods (for identifying comparable studies)
Create a detailed scope summary to pass to each subagent:
Manuscript Context
You are helping write a scientific research manuscript. Your job is to read ONE paper and extract information most relevant to this manuscript.
Our Research Question
[Full research question from scope.md]
Our Hypothesis
[What we expect to find / are arguing]
Our Key Findings
[Summary of main results - what we found]
Our Methods
[Brief description of study design, population, techniques used]
Target Journal
[Journal name] - [word limit] words - [citation style]
What to Look For in This Paper
As you read, focus on extracting:
- Background claims we can cite to establish the problem's importance
- Methodological precedents that justify our approach
- Comparable results we can compare our findings against
- Contradictory findings we need to address in Discussion
- Gaps identified that our study fills
- Quotable statements that frame the field well
If notes/ethics-summary.md exists, also include:
-
Study objectives from ethics document
-
Target population details
-
Approved procedures
Step 2: Inventory Papers
List all PDFs in the papers/ folder:
ls papers/*.pdf
Create processing queue:
Papers to Process
| # | Filename | Status |
|---|---|---|
| 1 | smith-2023.pdf | Pending |
| 2 | jones-2022.pdf | Pending |
| 3 | wilson-2021.pdf | Pending |
| ... |
Copy PDFs to central library:
mkdir -p notes/papers-library cp papers/*.pdf notes/papers-library/
Step 3: Process Each PDF via Subagent
CRITICAL: Process ONE paper at a time using the Task tool.
For each PDF in the queue:
3a. Spawn Subagent
Use the Task tool to spawn an isolated subagent.
IMPORTANT: Each Task call must specify exactly ONE PDF file. Do not batch.
Task( subagent_type: "general-purpose", prompt: """ ⛔ YOU ARE PROCESSING EXACTLY ONE PAPER. DO NOT READ ANY OTHER PDFs. ⛔
Your Role
You are a research assistant helping write a scientific manuscript. Your job is to:
- Read ONE paper
- Extract information most relevant to our manuscript
- Write condensed notes highlighting what's useful for our paper
- Exit immediately after
Your ONE Paper
PDF to read: papers/{filename}.pdf Output file: notes/papers/{filename}.md
Manuscript Context (What We're Writing)
{scope_summary}
Instructions
- Read ONLY the PDF file specified above
- Extract citation metadata (authors, title, journal, DOI, PMID)
- Summarize the paper's main contribution
- Focus on relevance: What from this paper helps our manuscript?
- Background claims we can cite?
- Methods similar to ours?
- Results we can compare against?
- Contradictions we need to address?
- Quotes that frame the problem well?
- Recommend inclusion: Should this paper be cited in our manuscript?
- Write condensed notes to the output file
- Return a summary with your recommendation
- EXIT - Do not process any other papers
Note Template
[First Author] et al., [Year]
Processed: [timestamp]
Citation
- Authors: [full author list]
- Title: [title]
- Journal: [journal]
- Year: [year]
- DOI: [doi or "not found"]
- PMID: [pmid or "not found"]
Paper Summary
[2-3 sentences on what this paper is about]
Study Design
- Type: [RCT, cohort, retrospective, etc.]
- Population: [N, key characteristics]
- Methods: [key techniques used]
Key Findings
| Finding | Statistic | Interpretation |
|---|---|---|
| [finding] | [p-value, CI, effect size] | [what it means] |
Relevance to Our Manuscript
Relationship: [Supports / Contradicts / Provides Context / Methodological Reference]
For Introduction
- [Claims or context we can cite to frame our problem]
- [Gaps this paper identifies that we address]
For Methods
- [Methodological precedents that justify our approach]
For Discussion
- [Results to compare against ours]
- [Contradictions we need to explain]
Key Quotes
"[Quotable statement that frames the field or problem well]" (p. X)
"[Another useful quote if available]" (p. X)
Recommendation
Include in draft?: [Yes / No] Rationale: [1-2 sentences - Is this paper directly relevant? Would citing it feel natural or forced?]
Tags
#[topic1] #[topic2] #[methodology] """ )
3b. Wait for Completion
Wait for the subagent to complete before processing the next paper.
3c. Update Status
Track the subagent's recommendation for each paper:
| # | Filename | Status | Recommendation |
|---|---|---|---|
| 1 | smith-2023.pdf | ✓ Complete | Include |
| 2 | jones-2022.pdf | ✓ Complete | Include |
| 3 | wilson-2021.pdf | ✓ Complete | Exclude - different population |
| 4 | brown-2020.pdf | Processing... | - |
3d. Handle Errors
If a subagent fails:
-
Log the error
-
Mark as "Needs manual review"
-
Continue with next paper
Step 4: Generate Bibliography
After ALL papers are processed, read the notes and compile bibliography.
Read each notes/papers/*.md and extract citations:
Create notes/bibliography.md :
Master Bibliography
Generated: [timestamp] Total References: [n]
References
| # | Citation | DOI | PMID |
|---|---|---|---|
| 1 | Smith JA, et al. Title. Journal. 2023;1:1-10. | 10.xxx | 123456 |
| 2 | Jones BB, et al. Title. Journal. 2022;2:20-30. | 10.xxx | 234567 |
Papers Needing Attention
| Paper | Issue |
|---|---|
| [filename] | [missing DOI / couldn't process / etc.] |
Step 5: Synthesize Literature
Read all notes/papers/*.md files and create synthesis.
Note: At this point you're only reading small markdown files, not full PDFs.
Create notes/literature-synthesis.md :
Literature Synthesis
Generated: [timestamp] Research Question: [from scope.md] Papers Reviewed: [n]
Paper Citation Tracker
Use the subagent recommendations from Step 3 to populate this table. Papers recommended for exclusion by the subagent should be marked "Not Applicable."
| # | Citation | Note File | Subagent Rec | Natural Fit | Status |
|---|---|---|---|---|---|
| 1 | Smith et al., 2023 | notes/papers/smith-2023.md | Include | Intro: establishes prevalence | Planned |
| 2 | Jones et al., 2022 | notes/papers/jones-2022.md | Include | Discussion: comparable methods | Planned |
| 3 | Wilson et al., 2021 | notes/papers/wilson-2021.md | Exclude | Different population | Not Applicable |
Status Legend:
- Planned: Subagent recommended inclusion and paper has a natural place
- Not Applicable: Subagent recommended exclusion or paper doesn't fit naturally
Guidance: Trust the subagent's assessment of relevance. The goal is a coherent manuscript, not maximum citations.
Source Inventory
| # | Citation | Type | Relationship |
|---|---|---|---|
| 1 | Smith et al., 2023 | Original Research | Supports |
| 2 | Jones et al., 2022 | Review | Context |
Key Themes
Theme 1: [Theme Name]
Summary: [2-3 sentences]
Sources: Smith 2023, Jones 2022
Relevance: [connection to our study]
Theme 2: [Theme Name]
[repeat pattern]
Supporting Findings
| Finding | Source | How It Supports Our Work |
|---|---|---|
| [finding] | Smith 2023 | [explanation] |
Contradictory Findings
| Finding | Source | Possible Explanation |
|---|---|---|
| [finding] | Jones 2022 | [why it differs] |
Gaps in Literature
| Gap | Evidence | How We Address It |
|---|---|---|
| [gap] | No studies on X | We examine X |
Implications for Manuscript
Introduction
- Paragraph 1 (Problem): Cite [sources]
- Paragraph 2 (Knowledge): Cite [sources]
- Paragraph 3 (Gap): Cite [sources]
- Paragraph 4 (This Study): Reference scope.md
Discussion
- Comparisons: [sources for comparing results]
- Interpretation: [sources for mechanisms]
Step 6: Draft Introduction
Using notes/literature-synthesis.md , draft the Introduction.
Structure (4 paragraphs)
Paragraph 1: The Problem
-
Significance of the clinical/scientific problem
-
Epidemiology, impact, importance
Paragraph 2: Current Knowledge
-
What is currently known
-
Existing approaches and successes
Paragraph 3: The Gap
-
What remains unknown
-
Limitations of current approaches
Paragraph 4: This Study
-
Purpose and approach
-
What this paper contributes
Writing Guidelines
-
Present tense for established facts ("MRI provides...")
-
Past tense for specific findings ("Smith et al. found...")
-
Every claim needs a citation [1]
-
Numbered citations in order of appearance
-
Aim for ~500-800 words
Draft Format
Create drafts/introduction.md :
Introduction
[Paragraph 1 - Problem and significance]
[Paragraph 2 - Current knowledge]
[Paragraph 3 - Gap and limitations]
[Paragraph 4 - This study's purpose]
References Used
- [Citation] - notes/papers/smith-2023.md
- [Citation] - notes/papers/jones-2022.md ...
Output
Save to:
-
notes/papers-library/*.pdf
-
Central PDF storage
-
notes/papers/*.md
-
Condensed notes per paper
-
notes/bibliography.md
-
Master reference list
-
notes/literature-synthesis.md
-
Aggregated synthesis
-
drafts/introduction.md
-
Introduction draft
Return summary:
-
Papers processed: [n]
-
Papers recommended for inclusion: [n]
-
Papers recommended for exclusion: [n]
-
Papers with issues: [n]
-
Key themes identified: [n]
-
Introduction word count: [n]