web-extractor

使用 jina.ai 提取网页干净文本并让 Agent 总结。触发词:提取网页、总结新闻、提取文章、获取页面内容

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "web-extractor" with this command: npx skills add kukuxnd/web-extractor

网页内容提取技能

使用 r.jina.ai 提取网页干净文本,过滤垃圾代码,提取干货内容。

工作流程

1. 提取网页内容

# 提取网页并保存为 md 文件
curl -s https://r.jina.ai/<URL> > /tmp/web-content.md

2. 读取并总结

让 Agent 读取生成的 md 文件,总结核心观点。

使用示例

用户说:"帮我总结这个新闻 https://www.bbc.com/news/tech..."

执行:

curl -s "https://r.jina.ai/https://www.bbc.com/news/technology-xxx" > /tmp/news.md

然后读取 /tmp/news.md 文件内容,分析并总结。

注意事项

  • r.jina.ai 会过滤掉 script、nav、广告 CSS 等垃圾代码
  • 提取后是极干净的纯文本,对 AI 零负担
  • 支持任何新闻网站、技术博客、文章页面
  • 文件默认保存到 /tmp/ 目录,可自定义路径

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

植物病害识别技能

Accurately identifies plant diseases based on computer vision and deep learning, supports both image and video input, outputs structured diagnostic reports i...

Registry SourceRecently Updated
Automation

Unbrowser

Cheap first-pass web browsing without launching Chrome — fetch SSR pages, follow links, query the DOM, run JS, detect bot-wall challenges. Escalate to OpenCl...

Registry SourceRecently Updated
Automation

Agent Analytics

Headless analytics management for AI builders shipping multi-surface products. Let your agent create projects, install tracking, compare surfaces, query resu...

Registry SourceRecently Updated
Automation

ml-engineer

Expert ML engineer specializing in machine learning model lifecycle, production deployment, and ML system optimization. Masters both traditional ML and deep...

Registry SourceRecently Updated