redpincer

AI/LLM red team testing skill. Point at any LLM API endpoint and run automated security assessments. 160+ attack payloads across prompt injection, jailbreak, data extraction, and guardrail bypass. 20 variant transforms. Adaptive attack engine analyzes weaknesses and generates follow-ups. Heuristic response classifier, vulnerability heatmaps, regression testing, and exportable pen-test reports. For authorized security testing only.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "redpincer" with this command: npx skills add rustyorb/redpincer

RedPincer — AI/LLM Red Team Suite

Automated security testing for language models. Point at any LLM API endpoint, select attack modules, and run assessments with real-time results and exportable reports.

⚠️ For authorized security testing and research only. Only test systems you own or have explicit permission to audit.

Quick Start

# Clone and install
git clone https://github.com/rustyorb/pincer.git {baseDir}/redpincer
cd {baseDir}/redpincer
npm ci

# Run
npm run dev
# Dashboard at http://localhost:3000

For production:

npm run build
npx next start -H 0.0.0.0 -p 3000

What It Tests

CategoryPayloadsDescription
💉 Prompt Injection40Instruction override, delimiter confusion, indirect injection, payload smuggling
🔓 Jailbreak40Persona splitting, gradual escalation, hypothetical framing, roleplay exploitation
🔍 Data Extraction40System prompt theft, training data probing, membership inference, embedding extraction
🛡️ Guardrail Bypass40Output filter evasion, multi-language bypass, homoglyph tricks, context overflow

Total: 160 base payloads × 20 variant transforms = 3,200 test permutations

Supported Providers

OpenAI  ·  Anthropic  ·  OpenRouter  ·  Any OpenAI-compatible endpoint

Features

Attack Engine

  • 160+ payloads across 4 categories
  • Model-specific attacks (GPT, Claude, Llama variants)
  • 20 variant transforms (unicode, encoding, case rotation, etc.)
  • Attack chaining with template variables ({{previous_response}})
  • AI-powered payload generation — uses the target LLM to generate novel attacks against itself
  • Stop/cancel running attacks instantly

Analysis & Reporting

  • Heuristic response classifier with context-aware analysis
  • Reduced false positives — detects "explain then refuse" patterns
  • Vulnerability heatmap — visual category × severity matrix
  • Custom scoring rubrics with weighted grades (A+ to F)
  • Verbose 10-section pen-test reports with appendices
  • Multi-target comparison — side-by-side security profiles
  • Regression testing — save baselines, track fixes over time

Advanced Tools

ToolWhat It Does
CompareSame payloads against 2-4 targets simultaneously
AdaptiveAnalyzes weaknesses, generates targeted follow-ups
HeatmapVisual matrix of vulnerability rates by category/severity
RegressionSave baseline → re-run later → detect fixes or regressions
ScoringCustom rubrics with weighted category/severity/classification scores
ChainsMulti-step attacks with {{previous_response}} templates
Payload EditorCreate custom payloads with syntax highlighting + AI generation

Usage Workflow

1. Configure Target → Add LLM endpoint + API key + model
2. Select Categories → Pick attack types to test
3. Run Attack      → Stream results in real-time
4. Review Results  → Heuristic classification + severity scores
5. Adaptive        → Auto-generate follow-up attacks on weaknesses
6. Generate Report → Export comprehensive findings as Markdown

Architecture

  • All client-side — no server components, your API keys stay local
  • NDJSON streaming — real-time results during attack runs
  • Heuristic analysis — pattern-matching classifier (no LLM-based grading = no extra cost)
  • Zustand + localStorage — state persists across sessions

Companion Tool: RedClaw

For autonomous multi-strategy campaigns (CLI/TUI), see RedClaw — the autonomous red-teaming agent framework.

  • RedPincer = web dashboard, manual + automated testing
  • RedClaw = autonomous CLI agent, adaptive multi-strategy campaigns
  • Together = complete LLM security testing suite

Built by @rustyorb — Crack open those guardrails. 🦞

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Security

ZipCracker

The ultimate, high-performance ZIP password cracking suite by Hx0 Team. Empowers the Agent with autonomous CTF-level cracking workflows, dynamic dictionary g...

Registry SourceRecently Updated
0165
Profile unavailable
Security

web-recon

Website vulnerability scanner and security audit toolkit. Scan any website for security issues: open ports (nmap), exposed secrets, subdomain enumeration, di...

Registry SourceRecently Updated
1264
Profile unavailable
Security

Agentshield Audit

Trust Infrastructure for AI Agents - Like SSL/TLS for agent-to-agent communication. 77 security tests, cryptographic certificates, and Trust Handshake Protoc...

Registry SourceRecently Updated
0655
Profile unavailable