spider-cli-extraction

Spider CLI Extraction

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "spider-cli-extraction" with this command: npx skills add spider-rs/spider_skills/spider-rs-spider-skills-spider-cli-extraction

Spider CLI Extraction

Overview

Use this skill to run Spider CLI workflows with explicit runtime mode control.

Canonical source for cross-agent behavior: skills/core/spider-cli-extraction.md

Load references/cli-workflows.md when you need exact command patterns or mode-selection rules.

Workflow

  • Confirm CLI availability.

  • Prefer cargo run -p spider_cli -- ... from the Spider repo root.

  • If spider is globally installed, use spider ... for quick checks.

  • Choose the task mode.

  • Use crawl to collect links.

  • Use scrape to emit per-page JSON records and optionally include HTML.

  • Use download to persist page markup to disk.

  • Select runtime execution mode.

  • Use --headless for browser-rendered mode.

  • Use --http to force HTTP-only mode.

  • Omit both for default HTTP behavior.

  • Add scope controls.

  • Set --limit , --depth , --budget , and --blacklist-url .

  • Add --respect-robots-txt when policy compliance is required.

Quick Commands

Crawl links (default HTTP mode)

cargo run -p spider_cli -- --url https://example.com crawl --output-links

Browser mode on demand

cargo run -p spider_cli -- --url https://example.com --headless crawl --output-links

Scrape with HTML output

cargo run -p spider_cli -- --url https://example.com scrape --output-html

Script

Use scripts/spider_cli_helper.sh for wrappers:

./scripts/spider_cli_helper.sh verify-headless ./scripts/spider_cli_helper.sh crawl https://example.com --limit 20 --depth 2 ./scripts/spider_cli_helper.sh scrape https://example.com --output-html --output-links

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

github-tools

Interact with GitHub using the `gh` CLI. Use `gh issue`, `gh pr`, `gh run`, and `gh api` for issues, PRs, CI runs, and advanced queries.

Archived SourceRecently Updated
Coding

openclaw-version-monitor

监控 OpenClaw GitHub 版本更新,获取最新版本发布说明,翻译成中文, 并推送到 Telegram 和 Feishu。用于:(1) 定时检查版本更新 (2) 推送版本更新通知 (3) 生成中文版发布说明

Archived SourceRecently Updated
Coding

ask-claude

Delegate a task to Claude Code CLI and immediately report the result back in chat. Supports persistent sessions with full context memory. Safe execution: no data exfiltration, no external calls, file operations confined to workspace. Use when the user asks to run Claude, delegate a coding task, continue a previous Claude session, or any task benefiting from Claude Code's tools (file editing, code analysis, bash, etc.).

Archived SourceRecently Updated
Coding

ai-dating

This skill enables dating and matchmaking workflows. Use it when a user asks to make friends, find a partner, run matchmaking, or provide dating preferences/profile updates. The skill should execute `dating-cli` commands to complete profile setup, task creation/update, match checking, contact reveal, and review.

Archived SourceRecently Updated