inworld

Inworld TTS API. Covers voice cloning, audio markups, timestamps. Use when integrating Inworld text-to-speech, cloning voices, adding audio markups (SSML-like), or aligning viseme timestamps. Keywords: Inworld, text-to-speech, TTS, voice cloning, visemes.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "inworld" with this command: npx skills add itechmeat/llm-code/itechmeat-llm-code-inworld

Inworld AI

Text-to-Speech platform with voice cloning, audio markups, and timestamp alignment.

Quick Navigation

TopicReference
Installationinstallation.md
Voice Cloningcloning.md
Voice Controlvoice-control.md
API Referenceapi.md

When to Use

  • Text-to-speech audio generation
  • Voice cloning from 5-15 seconds of audio
  • Emotion-controlled speech ([happy], [sad], etc.)
  • Word/phoneme timestamps for lip sync
  • Custom pronunciation with IPA

Models

ModelIDLatencyPrice
TTS 1.5 Maxinworld-tts-1.5-max~200ms$10/1M chars
TTS 1.5 Miniinworld-tts-1.5-mini~120ms$5/1M chars

Minimal Example

import requests, base64, os

response = requests.post(
    "https://api.inworld.ai/tts/v1/voice",
    headers={"Authorization": f"Basic {os.getenv('INWORLD_API_KEY')}"},
    json={"text": "Hello!", "voiceId": "Ashley", "modelId": "inworld-tts-1.5-max"}
)
audio = base64.b64decode(response.json()['audioContent'])

Key Features

  • 15 languages — en, zh, ja, ko, ru, it, es, pt, fr, de, pl, nl, hi, he, ar
  • Instant cloning — 5-15 seconds audio, no training
  • Audio markups[happy], [laughing], [sigh] (English only)
  • Timestamps — word, phoneme, viseme timing for lip sync
  • Streaming/voice:stream endpoint

Prohibitions

  • Audio markups work only in English
  • Use ONE emotion markup at text beginning
  • Match voice language to text language
  • Instant cloning may not work for children's voices or unique accents

Links

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

react-testing-library

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

social-writer

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

commits

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

mantine-dev

No summary provided by upstream source.

Repository SourceNeeds Review