chough

Installation

Arch Linux: paru -S chough-bin macOS: brew install --cask hyperpuncher/tap/chough Windows: winget install chough Source: go install github.com/hyperpuncher/chough/cmd/chough@latest

Requires: ffmpeg for audio/video support

Quick Reference

# Basic transcription (text to stdout)
chough audio.mp3

# Pipe audio from stdin
cat audio.mp3 | chough

# JSON with timestamps
chough -f json podcast.mp3 > transcript.json

# WebVTT subtitles
chough -f vtt -o subs.vtt video.mp4

# Low memory (30s chunks)
chough -c 30 audiobook.mp3

# Use remote server (requires CHOUGH_URL)
chough --remote audio.mp3

Flags

Flag	Description	Default
`-c, --chunk-size`	Chunk size in seconds	60
`-f, --format`	Output: text, json, vtt	text
`-o, --output`	Output file	stdout
`-r, --remote`	Transcribe via CHOUGH_URL server	-
`--version`	Show version	-

Chunk Size Guide

15-30s: Low memory (~500MB), higher error rate
60s: Balanced (default) - ~1.6GB RAM

Remote Mode

Use --remote flag to transcribe via an HTTP server instead of local processing. Requires CHOUGH_URL environment variable.

# Set server URL
export CHOUGH_URL=http://localhost:8080

# Transcribe via remote server
chough --remote audio.mp3

Check for CHOUGH_URL env var → verify /health endpoint → use server if healthy, otherwise fallback to local CLI.

Endpoints

Method	Endpoint	Description
POST	`/transcribe`	Transcribe audio (file upload, URL, or base64)
GET	`/health`	Health check with queue status

Examples

# Upload file
curl -X POST http://localhost:8080/transcribe \
  -F "file=@audio.mp3" \
  -F "format=json" \
  -F "chunk_size=60"

# Transcribe from URL
curl -X POST http://localhost:8080/transcribe \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/audio.mp3", "format": "vtt"}'

# Base64 audio
curl -X POST http://localhost:8080/transcribe \
  -H "Content-Type: application/json" \
  -d '{"base64": "...", "format": "text"}'

# Health check
curl http://localhost:8080/health

Performance

Duration	Time	Speed
15s	2.0s	7.4x realtime
1min	4.3s	14.1x realtime
5min	16.2s	18.5x realtime
30min	90.2s	19.9x realtime

Troubleshooting

Out of memory: Use -c 30 or -c 15 Model fails: Check internet, verify $XDG_CACHE_HOME is writable ffmpeg errors: Ensure ffmpeg is installed

Notes

First run downloads ~650MB model to $XDG_CACHE_HOME/chough/models
Auto-extracts audio from video files
Set CHOUGH_MODEL env var to use custom model path
Set CHOUGH_URL env var for --remote mode (must start with http:// or https://)
VTT groups tokens into subtitle cues automatically

Docs

GitHub

Safety Notice

Copy this and send it to your AI assistant to learn

Installation

Quick Reference

Flags

Chunk Size Guide

Remote Mode

Endpoints

Examples

Performance

Troubleshooting

Notes

Docs

Source Transparency

Related Skills

scrapling

brave-search

simplify-code

requesting-code-review