ebook-extractor

Use when user wants to extract text from ebooks (EPUB, MOBI, PDF). Use for converting ebooks to plain text for analysis, processing, or reading. Handles all common ebook formats.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "ebook-extractor" with this command: npx skills add ratacat/claude-skills/ratacat-claude-skills-ebook-extractor

Ebook Text Extractor

Overview

Extract plain text from EPUB, MOBI, and PDF files using Python scripts. No LLM calls - pure text extraction.

Supported Formats

FormatTool UsedNotes
EPUBebooklib + BeautifulSoupDirect parsing, preserves structure
MOBICalibre ebook-convertConverts to EPUB first, then extracts
PDFPyMuPDF (fitz)Fast, handles most PDFs well

Usage

Unified extractor (auto-detects format):

python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.epub
python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.mobi
python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.pdf

Output options:

# To stdout (default)
python3 scripts/extract.py book.epub

# To file
python3 scripts/extract.py book.epub -o output.txt
python3 scripts/extract.py book.epub > output.txt

Format-specific scripts:

python3 scripts/extract_epub.py book.epub
python3 scripts/extract_mobi.py book.mobi
python3 scripts/extract_pdf.py book.pdf

Setup

# One-command setup (installs all dependencies)
~/.claude/skills/ebook-extractor/setup.sh

# Or manually:
pip install -r ~/.claude/skills/ebook-extractor/requirements.txt
brew install calibre  # macOS, for MOBI support

Script Location

~/.claude/skills/ebook-extractor/scripts/

Common Issues

ProblemSolution
Missing packageRun setup.sh or pip install -r requirements.txt
MOBI failsEnsure Calibre is installed: brew install calibre
PDF garbledSome PDFs are image-based; OCR needed (not supported)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

repo-research-analyst

No summary provided by upstream source.

Repository SourceNeeds Review
Research

best-practices-researcher

No summary provided by upstream source.

Repository SourceNeeds Review
Research

learnings-researcher

No summary provided by upstream source.

Repository SourceNeeds Review
Research

framework-docs-researcher

No summary provided by upstream source.

Repository SourceNeeds Review