Groq Whisper
Cloud speech-to-text via Groq's Whisper API. No local model, no GPU, no fan spin.
Setup
- Get a free API key at
console.groq.com - Store it:
Or setmkdir -p ~/.config/groq echo '{"api_key":"gsk_your_key_here"}' > ~/.config/groq/credentials.json chmod 600 ~/.config/groq/credentials.jsonGROQ_API_KEYenv var.
Usage
# Transcribe an audio file
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg
# Specify language (default: en)
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg es
When to use
Call this script whenever you receive an audio/voice message attachment. Pass the file path directly — Groq handles ogg, mp3, wav, m4a, webm, and flac natively. No format conversion needed.
Details
- Model: whisper-large-v3 (best accuracy)
- Speed: Faster than real-time (typically <2s for a 5-minute clip)
- Cost: Free tier available, no credit card required
- Privacy: Groq does not retain input data or train on it
- Requires:
curl,jq, Groq API key