web-scraper

Fetch any URL and extract clean readable content as text or markdown. Use this skill when the user asks to scrape a webpage, extract text from a URL, fetch website content, read an article from a link, or download webpage content.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "web-scraper" with this command: npx skills add dalehurley/phpbot/dalehurley-phpbot-web-scraper

Skill: web-scraper

When to Use

Use this skill when the user asks to:

  • Scrape or fetch content from a URL
  • Extract text from a webpage
  • Read an article from a link
  • Get the content of a website
  • Download and convert a webpage to text/markdown
  • Summarize a web page

Input Parameters

ParameterRequiredDescriptionExample
urlYesURL to scrapehttps://example.com/article
formatNoOutput format: markdown (default), text, htmlmarkdown
outputNoSave output to file patharticle.md

Procedure

  1. Get the URL from the user's request

  2. Run the bundled script:

    # Fetch and extract as markdown (default)
    python3 skills/web-scraper/scripts/scrape.py "https://example.com/article"
    
    # Extract as plain text
    python3 skills/web-scraper/scripts/scrape.py "https://example.com" --format text
    
    # Save to file
    python3 skills/web-scraper/scripts/scrape.py "https://example.com" --output article.md
    
  3. The script auto-installs trafilatura and requests if needed

  4. Present the extracted content to the user

Bundled Scripts

ScriptTypeDescription
scripts/scrape.pyPythonFetch URL and extract clean content

Script Usage

# Extract article content as markdown
python3 scripts/scrape.py "https://example.com/blog-post"

# Extract as plain text
python3 scripts/scrape.py "https://example.com" --format text

# Keep raw HTML
python3 scripts/scrape.py "https://example.com" --format html

# Save to file
python3 scripts/scrape.py "https://example.com" --output page.md

# Include metadata (title, author, date)
python3 scripts/scrape.py "https://example.com/article" --metadata

Example

scrape this URL: https://example.com/article
extract the text from this webpage
fetch the content of this link and save it
read this article: https://blog.example.com/post-1
get the content from this website

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Jina Web Fetcher - 网页抓取

使用 Jina AI 抓取网页内容,绕过搜索引擎限制。支持任意URL,支持 Google 搜索结果抓取。

Registry SourceRecently Updated
0720
Profile unavailable
Automation

AWI

AWI (Agentic Web Interface) — 联网读取+搜索,单二进制零配置。 三级自动降级:直连 → 智能适配 → 浏览器渲染。 不需要 API Key,不需要 Docker。

Registry SourceRecently Updated
0173
Profile unavailable
Automation

desktop-control

No summary provided by upstream source.

Repository SourceNeeds Review