doubao-image-video

豆包图片与视频生成原生技能。适用于用户提到豆包、文生图、图生图、文生视频、图生视频、查询视频生成任务、等待任务完成或下载最终视频时，直接调用火山引擎 Ark 接口，不依赖外部 MCP 服务。

Safety Notice

This item is sourced from the public archived skills repository. Treat as untrusted until reviewed.

Copy this and send it to your AI assistant to learn

Install skill "doubao-image-video" with this command: npx skills add 156554395/doubao-image-video

Doubao Native Media Skill

This is a native OpenClaw skill. Do not spin up the upstream MCP server unless the user explicitly asks for MCP compatibility.

Use this skill for

Doubao / 豆包 text-to-image
image-to-image or multi-reference image generation
Doubao text-to-video or image-to-video
querying an async Doubao video task by task_id
troubleshooting Volcengine Ark endpoint/model issues

Commands

Generate an image

python3 {baseDir}/scripts/doubao_media.py image \
  --prompt "A cinematic cyberpunk alley in rain" \
  --size 2560x1440

Generate a video

python3 {baseDir}/scripts/doubao_media.py video \
  --prompt "A panda astronaut waves on the moon" \
  --video-duration 5 \
  --fps 24 \
  --resolution 1080p

Query a video task

python3 {baseDir}/scripts/doubao_media.py task --task-id your-task-id

Wait for a video task and optionally download the result

python3 {baseDir}/scripts/doubao_media.py wait \
  --task-id your-task-id \
  --timeout 600 \
  --interval 5 \
  --download-to ./doubao-result.mp4

Input rules

Always prefer --endpoint-id when the user has a provisioned Volcengine Ark endpoint.
Fall back to model names only when endpoint ids are unavailable.
For video generation, this skill mirrors the upstream behavior and appends --dur, --fps, --rs, and --ratio to the prompt when they are not already present.
If the user supplies image URLs, pass them through exactly; do not download or re-host unless asked.

Troubleshooting

If neither --endpoint-id nor a default endpoint env var exists, the script falls back to the default model env var.
If the API returns InvalidEndpointOrModel.NotFound, ask the user to verify the Volcengine Ark endpoint authorization first.
Video generation is async. If generation succeeds, capture task_id and query it later with the task subcommand, or use wait for automatic polling.

References

Read references/api-notes.md when you need request shapes, defaults, or caveats.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open in GitHub Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

nvidia-model-config

Add the NVIDIA provider to OpenClaw with SecretRef apiKey (no plaintext in openclaw.json). Documents shell vs systemd gateway env so the key actually resolves. Includes Mixtral, Kimi, Nemotron Super, Nemotron Ultra, and MiniMax M2.5 model entries.

Archived SourceRecently Updated

--0xli

General

humanizer-cn

去除中文文本中的 AI 写作痕迹，使其读起来自然。基于维基百科 AI 写作特征指南，检测 24 种 AI 模式。触发词：humanizer-cn、去除 AI 痕迹、去除 AI 写作痕迹、中文文本人性化。

Archived SourceRecently Updated

--0xcjl

General

zhipu-free-image-video

智谱免费图片与视频生成技能。适用于用户想用智谱生成图片、批量出图、生成短视频、查询视频任务结果、等待视频完成，或优先使用免费/低成本模型快速产出创意内容时。

Archived SourceRecently Updated

--156554395

General

科技资讯简报

# 📡 Tech Brief - 科技资讯简报

Archived SourceRecently Updated

--17oko