pinchtab

Browser automation via HTTP API. Use for headless browser control, web automation, form filling, data extraction, and interactive element interaction. Supports launching instances, navigating pages, taking screenshots, extracting page structure, and clicking elements.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "pinchtab" with this command: npx skills add dexifried/dexifried-pinchtab

PinchTab Skill

Description

PinchTab is an HTTP server that provides programmatic control over a browser. It supports launching browser instances, navigating to pages, extracting page structure, and interacting with elements like buttons or forms.

When to Use

Use this skill for tasks like:

  • Automating browser workflows (e.g., logins, form submissions).
  • Extracting data or snapshots from web pages.
  • Testing interactive web elements.

Quick Start

Below is a guide to using the PinchTab skill:

1. Launching a Browser Instance

You can launch a new browser instance via the API:

bash scripts/launch_browser.sh

2. Navigating to a URL

Navigate to a URL with the following command:

bash scripts/navigate_to_url.sh https://example.com

3. Extracting Page Snapshot

Get the page structure and save it locally:

bash scripts/get_page_snapshot.sh

4. Clicking an Element

Simulate a button click on a webpage:

bash scripts/click_element.sh "<css_selector>"

5. Taking Screenshots (Base64 Decode + Send to Telegram)

Capture a screenshot, decode the base64, and send to Telegram:

Bash:

export PINCHTAB_TOKEN="your_token"
export TELEGRAM_BOT_TOKEN="your_bot_token"
bash scripts/screenshot_and_send.sh <tab_id> <telegram_chat_id>

Python (more features):

export PINCHTAB_TOKEN="your_token"
python3 scripts/decode_screenshot.py <tab_id> \
  --output /path/to/screenshot.jpg \
  --send-telegram <chat_id> \
  --caption "My screenshot"

Example: Google Homepage

PinchTab successfully navigated to Google and extracted the page structure:

Google Screenshot

This demonstrates:

  • Browser launch and page navigation
  • Interactive element extraction (About, Store, Advertising links, etc.)
  • Ready for automation (clicking, form filling, data extraction)

Documentation

Check the references/ folder for detailed API documentation, common workflows, and troubleshooting tips.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Web Scraper Pro Zhuyu28

Browser automation skill for web interaction and data extraction. Enhanced version by zhuyu28.

Registry SourceRecently Updated
Automation

Multi Agent Coordinator Zhuyu28

Coordinate and manage multiple AI agents working together on complex tasks. Provides orchestration, communication patterns, and workflow management for multi...

Registry SourceRecently Updated
Automation

AI-native Bitcoin payments. Buy, sell, send, and request Bitcoin directly through any existing messenger app (Telegram, WhatsApp, Signal, Email) or create your own email accounts to start messaging via email.

Payment rails between humans and AI agents via BitChat. Telegram-first.

Registry SourceRecently Updated
Automation

Subagent Distiller

自动增量提取对话中的结构化知识,智能过滤无用信息,动态聚类主题,支持状态追踪和长期价值沉淀。

Registry SourceRecently Updated