mimo-tts

Generate speech audio (WAV) from text using Xiaomi MiMo TTS (mimo-v2-tts model). Supports preset voices (mimo_default, default_zh, default_en), style control (emotion, dialect, role-play, speed), and audio tags for fine-grained expression. Use when the user asks to convert text to speech, generate audio, read text aloud with a specific style/emotion/dialect, or create voice files.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "mimo-tts" with this command: npx skills add heimaojingzhang888/xiaomimimotts

MiMo TTS — Speech Synthesis

First-Time Setup

If MIMO_API_KEY is not configured, the skill cannot function. Guide the user:

  1. Obtain an API key from Xiaomi MiMo (https://api.xiaomimimo.com).
  2. Set it via OpenClaw config:
openclaw config set skills.entries.mimo-tts.apiKey "your-api-key-here"

Or set the environment variable MIMO_API_KEY directly. After configuring, the user should restart or start a new session.

Generate Speech

Use scripts/tts.py to synthesize text to audio:

python3 "{baseDir}/scripts/tts.py" "要合成的文本" -o output.wav

Options

FlagDefaultDescription
-ooutput.wavOutput file path
-vmimo_defaultVoice: mimo_default, default_zh, default_en
-s(none)Style tag, e.g. 开心, 东北话, 悄悄话, 孙悟空
-fwavAudio format
--user-msg(none)Optional user-role context to adjust tone
--api-keyenv MIMO_API_KEYAPI key override

Examples

# Basic
python3 "{baseDir}/scripts/tts.py" "你好,今天天气真好" -o hello.wav

# Dialect style
python3 "{baseDir}/scripts/tts.py" "哎呀妈呀,这天儿也忒冷了吧" -s "东北话" -o dongbei.wav

# English voice
python3 "{baseDir}/scripts/tts.py" "Hello, how are you today?" -v default_en -o hello_en.wav

# Emotion + speed
python3 "{baseDir}/scripts/tts.py" "明天就是周五了,真开心!" -s "开心 变快" -o happy.wav

Style & Audio Tags

  • Place <style>style</style> at the beginning of text to set overall style.
  • Use inline audio tags for fine control: (紧张), (小声), (语速加快), (深呼吸), (苦笑), (沉默片刻).
  • Multiple styles: <style>开心 变快</style>text.
  • Singing: <style>唱歌</style>lyrics.

Voices

Namevoice param
MiMo-默认mimo_default
MiMo-中文女声default_zh
MiMo-英文女声default_en

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Xiaomi MiMo TTS

使用小米 MiMo TTS (mimo-v2-tts) 生成语音。 支持多种音色、风格控制、情感标签和方言。 需要 MIMO_API_KEY。

Registry SourceRecently Updated
3541Profile unavailable
General

Xiaomi MiMo Voice

小米 MiMo V2 TTS 语音合成。支持中文、英文及多种风格(情感、角色扮演、方言、语速控制等)。

Registry Source
1460Profile unavailable
General

MiMo 联网搜索

基于小米 MiMo 模型提供实时联网搜索,支持最新信息查询与资料核对,需配置 API Key 并付费使用。

Registry SourceRecently Updated
2241Profile unavailable
General

Omnicast

A local multi-modal podcast pipeline. Ingests media, drafts scripts, synthesizes audio, renders cover art, and uploads to YouTube.

Registry SourceRecently Updated
2421Profile unavailable