instaparser-api

Use the Instaparser API to parse articles, PDFs, and generate summaries from URLs. Trigger when users want to extract content from web pages, parse PDF documents, or summarize articles using the Instaparser service.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "instaparser-api" with this command: npx skills add donohue/instaparser

Instaparser API Skill

Use this skill when the user wants to interact with the Instaparser API to parse articles, PDFs, or generate summaries.

Requirements

  • Network access: This skill makes HTTPS requests to https://www.instaparser.com/api/. The user must grant network access when prompted.
  • API key: All requests require an Instaparser API key set as the INSTAPARSER_API_KEY environment variable.

Getting an API key

  1. Go to https://www.instaparser.com and create an account.
  2. After signing in, navigate to the API section of your dashboard to generate an API key.
  3. Set the key in your environment:
    export INSTAPARSER_API_KEY="your_api_key_here"
    
  4. The free Trial plan includes a limited number of monthly credits. Paid plans are available for higher usage.

Authentication

All API requests require a Bearer token. The API key should be provided via the INSTAPARSER_API_KEY environment variable, or the user can provide it directly.

Authorization: Bearer $INSTAPARSER_API_KEY

API Endpoints

Article API

POST https://www.instaparser.com/api/1/article

Parse an article from a URL and extract its title, author, body content, images, and more. Uses 1 credit per call.

Request body (JSON):

ParameterTypeRequiredDescription
urlstringYesURL of the article to parse
contentstringNoRaw HTML content to parse instead of fetching from url
outputstringNo"html" (default) or "text"
use_cacheboolNoWhether to use cache. Defaults to true

Example:

curl -X POST https://www.instaparser.com/api/1/article \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article", "output": "text"}'

Response fields:

FieldDescription
urlCanonical URL
titleArticle title
site_nameWebsite name
authorAuthor name
datePublished date (UNIX timestamp)
descriptionArticle description
thumbnailThumbnail image URL
htmlHTML body (when output is "html")
textPlain text body (when output is "text")
wordsWord count
is_rtltrue if Arabic or Hebrew
imagesArray of image URLs
videosArray of video URLs

PDF API

Parse PDFs from a URL (GET) or by uploading a file (POST). Uses 5 credits per page.

Parse from URL

GET https://www.instaparser.com/api/1/pdf

ParameterTypeRequiredDescription
urlstringYesURL of the PDF to parse
outputstringNo"html" (default) or "text"
use_cacheboolNoWhether to use cache. Defaults to true
curl "https://www.instaparser.com/api/1/pdf?url=https://example.com/report.pdf&output=text" \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY"

Upload a file

POST https://www.instaparser.com/api/1/pdf

Send as multipart form-data with a file field.

curl -X POST https://www.instaparser.com/api/1/pdf \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY" \
  -F "file=@report.pdf" \
  -F "output=text"

Response fields: Same as Article API.


Summary API

POST https://www.instaparser.com/api/1/summary

Generate an AI-powered summary with key sentences. Uses 10 credits per call.

Request body (JSON):

ParameterTypeRequiredDescription
urlstringYesURL of the article to summarize
contentstringNoHTML content to parse instead of fetching from URL
use_cacheboolNoWhether to use cache. Defaults to true
streamboolNoStream the response. Defaults to false
curl -X POST https://www.instaparser.com/api/1/summary \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article"}'

Response fields:

FieldDescription
key_sentencesArray of key sentences extracted from the article
summaryConcise summary of the article

Status Codes

CodeReason
200Success
400Parameter missing or malformed
401API key is invalid
403Account suspended (payment error)
409Exceeded monthly credits (Trial plan only)
412Upstream parsing error
429Rate limit exceeded

SDK Usage

Python:

from instaparser import InstaparserClient

client = InstaparserClient(api_key="YOUR_API_KEY")

# Article
article = client.Article(url="https://example.com/article", output="text")

# PDF
pdf = client.PDF(url="https://example.com/report.pdf")

# Summary
summary = client.Summary(url="https://example.com/article")

JavaScript:

import { InstaparserClient } from 'instaparser-api';

const client = new InstaparserClient({ apiKey: 'YOUR_API_KEY' });

// Article
const article = await client.article({ url: 'https://example.com/article', output: 'text' });

// PDF
const pdf = await client.pdf({ url: 'https://example.com/report.pdf' });

// Summary
const summary = await client.summary({ url: 'https://example.com/article' });

Instructions

When the user asks to parse an article, PDF, or generate a summary:

  1. Check if INSTAPARSER_API_KEY is set in the environment. If not, ask the user for their API key.
  2. Use curl via the Bash tool to make the API request.
  3. For article parsing, default to output: "text" unless the user specifically wants HTML.
  4. For PDF parsing from a local file, use the multipart form-data POST method.
  5. For PDF parsing from a URL, use the GET method with query parameters.
  6. Present the results clearly — show title, author, word count, and the extracted content.
  7. For summaries, display both the overview/summary and the key sentences.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

Jina Reader

Extract clean, readable markdown content from any URL using Jina Reader API. Use when you need to fetch and parse web pages without dealing with HTML, JavaScript rendering, or paywalls. Ideal for research, article summarization, content analysis, and working with search results from tavily-search, web_search, or searxng skills.

Registry SourceRecently Updated
1.1K0Profile unavailable
General

Ai Meeting Helper

会议纪要生成器 - 自动将会议录音转为结构化纪要

Registry SourceRecently Updated
2530Profile unavailable
General

XPR Web Scraping

Tools for fetching and extracting cleaned text, metadata, and links from single or multiple web pages with format options and link filtering.

Registry SourceRecently Updated
1.9K0Profile unavailable
General

Huo15 Js Scraper

JavaScript渲染网站抓取工具。当需要抓取JS渲染的页面(如企微文档、Vue/React SPA)、企查查企业数据获取)、绕过反爬、或者普通curl/wget/web_fetch无法获取内容的网站时使用此技能。支持Playwright和scrapling双引擎自动切换。

Registry SourceRecently Updated
1720Profile unavailable