elevenlabs

Complete ElevenLabs AI audio platform: text-to-speech (TTS), speech-to-text (STT/Scribe), voice cloning, voice design, sound effects, music generation, dubbing, voice changer, voice isolator, and conversational voice agents. Use when working with audio generation, voice synthesis, transcription, audio processing, or building voice-enabled applications. Triggers: generate speech, clone voice, transcribe audio, create sound effects, compose music, dub video, change voice, isolate vocals, build voice agent, ElevenLabs API/SDK/CLI/MCP.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "elevenlabs" with this command: npx skills add jakerains/agentskills/jakerains-agentskills-elevenlabs

ElevenLabs AI Audio Platform

Complete guide to ElevenLabs' audio AI capabilities: speech synthesis, transcription, voice cloning, sound effects, music generation, dubbing, and conversational voice agents.

Quick Reference

CapabilityAPI/ToolUse Case
Text-to-Speechtext_to_speechGenerate lifelike speech from text
Speech-to-Textspeech_to_textTranscribe audio with Scribe v2
Voice Cloningvoice_cloneClone voices from audio samples
Voice Designtext_to_voiceCreate voices from text descriptions
Sound Effectstext_to_sound_effectsGenerate SFX from prompts
Musiccompose_musicGenerate studio-grade music
DubbingDubbing APITranslate video/audio (32 languages)
Voice Changerspeech_to_speechTransform voice while preserving emotion
Voice Isolatorisolate_audioRemove background noise
Voice AgentsAgents CLI/APIBuild conversational AI agents

Setup

API Key

# Environment variable
export ELEVENLABS_API_KEY="your-api-key"

# Or in .env file
ELEVENLABS_API_KEY=your-api-key

SDK Installation

# Python
pip install elevenlabs

# TypeScript/Node
npm install elevenlabs

MCP Server (for Claude Code, Cursor, etc.)

{
  "mcpServers": {
    "ElevenLabs": {
      "command": "uvx",
      "args": ["elevenlabs-mcp"],
      "env": {
        "ELEVENLABS_API_KEY": "your-api-key"
      }
    }
  }
}

Text-to-Speech (TTS)

Convert text to lifelike speech. See references/tts-models.md for model details.

Python SDK

from elevenlabs.client import ElevenLabs
from elevenlabs import play

client = ElevenLabs(api_key="your-api-key")

audio = client.text_to_speech.convert(
    text="Hello world!",
    voice_id="JBFqnCBsd6RMkjVDRZzb",  # George
    model_id="eleven_multilingual_v2",
    output_format="mp3_44100_128"
)
play(audio)

MCP Tool

mcp__ElevenLabs__text_to_speech
- text: "Your text here"
- voice_name: "Rachel" (or voice_id)
- model_id: "eleven_multilingual_v2"
- stability: 0.5, similarity_boost: 0.75
- speed: 1.0 (range: 0.7-1.2)

Model Selection

ModelLatencyLanguagesBest For
eleven_multilingual_v2~500ms29High quality, long-form
eleven_flash_v2_5~75ms32Real-time, agents
eleven_turbo_v2_5~250ms32Balanced quality/speed
eleven_v3 (alpha)Higher70+Emotional, dramatic

Speech-to-Text (Scribe)

Transcribe audio with 90+ language support. See references/stt-scribe.md for details.

Python SDK

result = client.speech_to_text.convert(
    file=open("audio.mp3", "rb"),
    model_id="scribe_v2",
    diarize=True  # Speaker detection
)
print(result.text)

MCP Tool

mcp__ElevenLabs__speech_to_text
- input_file_path: "/path/to/audio.mp3"
- diarize: true (speaker detection)
- language_code: "eng" (or auto-detect)

Features

  • 90+ languages with word-level timestamps
  • Speaker diarization (up to 48 speakers)
  • Keyterm prompting (bias toward specific words)
  • Entity detection (names, numbers, dates)
  • Realtime mode (~150ms latency)

Voice Cloning

Instant Voice Clone (MCP)

mcp__ElevenLabs__voice_clone
- name: "My Voice"
- files: ["/path/to/sample1.mp3", "/path/to/sample2.mp3"]
- description: "Professional male voice"

Requirements

  • Instant: 30+ seconds of clean audio
  • Professional: 30+ minutes for hyper-realistic clones
  • Creator plan or higher required

Voice Design

Create entirely new voices from text descriptions.

MCP Tool

mcp__ElevenLabs__text_to_voice
- voice_description: "A warm, friendly male voice with a slight British accent,
  perfect for audiobook narration"

Creates 3 voice previews to choose from. Use create_voice_from_preview to save.

Sound Effects

Generate cinematic sound effects from text. See references/sound-effects.md.

MCP Tool

mcp__ElevenLabs__text_to_sound_effects
- text: "Heavy wooden door creaking open slowly"
- duration_seconds: 3.0 (0.5-30 seconds)
- loop: false

Prompting Tips

  • Simple: "Glass shattering on concrete"
  • Sequences: "Footsteps on gravel, then a metallic door opens"
  • Musical: "90s hip-hop drum loop, 90 BPM"

Music Generation

Generate studio-grade music. See references/music-generation.md.

MCP Tool

mcp__ElevenLabs__compose_music
- prompt: "Upbeat electronic track with driving synths, 120 BPM"
- music_length_ms: 60000 (10s-5min)

Features

  • Complete control over genre, style, structure
  • Vocals or instrumental
  • Multilingual lyrics
  • Edit sections individually

Dubbing

Translate audio/video while preserving speaker identity. See references/dubbing.md.

  • 32 languages supported
  • Preserves emotion, timing, tone
  • Speaker separation (up to 9 speakers)
  • Files up to 1GB / 2.5 hours via API

Voice Changer (Speech-to-Speech)

Transform any voice while preserving performance nuances.

MCP Tool

mcp__ElevenLabs__speech_to_speech
- input_file_path: "/path/to/recording.mp3"
- voice_id: "target_voice_id"
  • Preserves whispers, laughs, emotional cues
  • 29 languages supported
  • Billed at 1000 chars/minute

Voice Isolator

Remove background noise from recordings.

MCP Tool

mcp__ElevenLabs__isolate_audio
- input_file_path: "/path/to/noisy_audio.mp3"
  • Supports audio and video files
  • Files up to 500MB / 1 hour

Conversational Voice Agents

Build and deploy voice-enabled AI agents. See references/voice-agents.md for comprehensive guide.

CLI Quick Start

# Install
npm install -g @elevenlabs/cli

# Initialize and authenticate
elevenlabs agents init
elevenlabs auth login

# Create agent
elevenlabs agents add "Support Bot" --template customer-service

# Deploy
elevenlabs agents push

Templates

TemplateUse Case
customer-serviceProfessional support, low temp
assistantGeneral purpose, balanced
voice-onlyVoice interactions only
text-onlyText conversations only
minimalQuick prototyping

Agent Tools

  • Server Tools: Webhook API calls
  • Client Tools: Frontend events
  • MCP Tools: Model Context Protocol servers
  • System Tools: transfer_to_number, agent_transfer, end_call

Voice Library

Search Voices (MCP)

mcp__ElevenLabs__search_voices
- search: "professional narrator"
- sort: "name" | "created_at_unix"

Search Public Library

mcp__ElevenLabs__search_voice_library
- search: "deep male"
- page_size: 10

Popular Voice IDs

VoiceIDStyle
Rachel21m00Tcm4TlvDq8ikWAMNeutral, professional
AdampNInz6obpgDQGcFmaJgBDeep, warm
BellaEXAVITQu4vr4xnSDxMaLSoft, gentle

Browse: elevenlabs.io/voice-library

Account & Billing

Check Subscription

mcp__ElevenLabs__check_subscription

List Models

mcp__ElevenLabs__list_models

Reference Documentation

TopicFile
TTS Models & Parametersreferences/tts-models.md
Speech-to-Text (Scribe)references/stt-scribe.md
Sound Effects Promptingreferences/sound-effects.md
Music Generationreferences/music-generation.md
Voice Agents (CLI/API)references/voice-agents.md
Agent Prompting Guidereferences/agent-prompting.md
Dubbing Guidereferences/dubbing.md

Pricing & Limits

  • TTS: Per character (Flash models 50% cheaper)
  • STT: Per hour of audio
  • Sound Effects: 40 credits/second when duration specified
  • Music: Per generation
  • See: elevenlabs.io/pricing

Concurrency Limits (by plan)

PlanMultilingual v2Flash/TurboSTT
Free248
Starter3612
Creator51020
Pro102040
Scale153060

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

onnx-webgpu-converter

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

nextjs-pwa

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

vercel-workflow

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

apple-foundation-models

No summary provided by upstream source.

Repository SourceNeeds Review