AI Content Pipeline

Build multi-step content creation pipelines via inference.sh CLI.

Quick Start

Requires inference.sh CLI (infsh ). Get installation instructions: npx skills add inference-sh/skills@agent-tools

infsh login

Simple pipeline: Generate image -> Animate to video

infsh app run falai/flux-dev --input '{"prompt": "portrait of a woman smiling"}' > image.json infsh app run falai/wan-2-5 --input '{"image_url": "<url-from-previous>"}'

Pipeline Patterns

Pattern 1: Image -> Video -> Audio

[FLUX Image] -> [Wan 2.5 Video] -> [Foley Sound]

Pattern 2: Script -> Speech -> Avatar

[LLM Script] -> [Kokoro TTS] -> [OmniHuman Avatar]

Pattern 3: Research -> Content -> Distribution

[Tavily Search] -> [Claude Summary] -> [FLUX Visual] -> [Twitter Post]

Complete Workflows

YouTube Short Pipeline

Create a complete short-form video from a topic.

1. Generate script with Claude

infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a 30-second script about the future of AI. Make it engaging and conversational. Just the script, no stage directions." }' > script.json

2. Generate voiceover with Kokoro

infsh app run infsh/kokoro-tts --input '{ "prompt": "<script-text>", "voice": "af_sarah" }' > voice.json

3. Generate background image with FLUX

infsh app run falai/flux-dev --input '{ "prompt": "Futuristic city skyline at sunset, cyberpunk aesthetic, 4K wallpaper" }' > background.json

4. Animate image to video with Wan

infsh app run falai/wan-2-5 --input '{ "image_url": "<background-url>", "prompt": "slow camera pan across cityscape, subtle movement" }' > video.json

5. Add captions (manually or with another tool)

6. Merge video with audio

infsh app run infsh/media-merger --input '{ "video_url": "<video-url>", "audio_url": "<voice-url>" }'

Talking Head Video Pipeline

Create an AI avatar presenting content.

1. Write the script

infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a 1-minute explainer script about quantum computing for beginners." }' > script.json

2. Generate speech

infsh app run infsh/kokoro-tts --input '{ "prompt": "<script>", "voice": "am_michael" }' > speech.json

3. Generate or use a portrait image

infsh app run falai/flux-dev --input '{ "prompt": "Professional headshot of a friendly tech presenter, neutral background, looking at camera" }' > portrait.json

4. Create talking head video

infsh app run bytedance/omnihuman-1-5 --input '{ "image_url": "<portrait-url>", "audio_url": "<speech-url>" }' > talking_head.json

Product Demo Pipeline

Create a product showcase video.

1. Generate product image

infsh app run falai/flux-dev --input '{ "prompt": "Sleek wireless earbuds on white surface, studio lighting, product photography" }' > product.json

2. Animate product reveal

infsh app run falai/wan-2-5 --input '{ "image_url": "<product-url>", "prompt": "slow 360 rotation, smooth motion" }' > product_video.json

3. Upscale video quality

infsh app run falai/topaz-video-upscaler --input '{ "video_url": "<product-video-url>" }' > upscaled.json

4. Add background music

infsh app run infsh/media-merger --input '{ "video_url": "<upscaled-url>", "audio_url": "https://your-music.mp3", "audio_volume": 0.3 }'

Blog to Video Pipeline

Convert written content to video format.

1. Summarize blog post

infsh app run openrouter/claude-haiku-45 --input '{ "prompt": "Summarize this blog post into 5 key points for a video script: <blog-content>" }' > summary.json

2. Generate images for each point

for i in 1 2 3 4 5; do infsh app run falai/flux-dev --input "{ "prompt": "Visual representing point $i: <point-text>" }" > "image_$i.json" done

3. Animate each image

for i in 1 2 3 4 5; do infsh app run falai/wan-2-5 --input "{ "image_url": "<image-$i-url>" }" > "video_$i.json" done

4. Generate voiceover

infsh app run infsh/kokoro-tts --input '{ "prompt": "<full-script>", "voice": "bf_emma" }' > narration.json

5. Merge all clips

infsh app run infsh/media-merger --input '{ "videos": ["<video1>", "<video2>", "<video3>", "<video4>", "<video5>"], "audio_url": "<narration-url>", "transition": "crossfade" }'

Pipeline Building Blocks

Content Generation

Step App Purpose

Script openrouter/claude-sonnet-45

Write content

Research tavily/search-assistant

Gather information

Summary openrouter/claude-haiku-45

Condense content

Visual Assets

Step App Purpose

Image falai/flux-dev

Generate images

Image google/imagen-3

Alternative image gen

Upscale falai/topaz-image-upscaler

Enhance quality

Animation

Step App Purpose

I2V falai/wan-2-5

Animate images

T2V google/veo-3-1-fast

Generate from text

Avatar bytedance/omnihuman-1-5

Talking heads

Audio

Step App Purpose

TTS infsh/kokoro-tts

Voice narration

Music infsh/ai-music

Background music

Foley infsh/hunyuanvideo-foley

Sound effects

Post-Production

Step App Purpose

Upscale falai/topaz-video-upscaler

Enhance video

Merge infsh/media-merger

Combine media

Caption infsh/caption-video

Add subtitles

Best Practices

Plan the pipeline first - Map out each step before running
Save intermediate results - Store outputs for iteration
Use appropriate quality - Fast models for drafts, quality for finals
Match resolutions - Keep consistent aspect ratios throughout
Test each step - Verify outputs before proceeding

Related Skills

Video generation models

npx skills add inference-sh/skills@ai-video-generation

Image generation

npx skills add inference-sh/skills@ai-image-generation

Text-to-speech

npx skills add inference-sh/skills@text-to-speech

LLM models for scripts

npx skills add inference-sh/skills@llm-models

Full platform skill

npx skills add inference-sh/skills@agent-tools

Browse all apps: infsh app list

Documentation

Content Pipeline Example - Official pipeline guide
Building Workflows - Workflow best practices

ai-content-pipeline

Safety Notice

Copy this and send it to your AI assistant to learn

Simple pipeline: Generate image -> Animate to video

1. Generate script with Claude

2. Generate voiceover with Kokoro

3. Generate background image with FLUX

4. Animate image to video with Wan

5. Add captions (manually or with another tool)

6. Merge video with audio

1. Write the script

2. Generate speech

3. Generate or use a portrait image

4. Create talking head video

1. Generate product image

2. Animate product reveal

3. Upscale video quality

4. Add background music

1. Summarize blog post

2. Generate images for each point

3. Animate each image

4. Generate voiceover

5. Merge all clips

Video generation models

Image generation

Text-to-speech

LLM models for scripts

Full platform skill

Source Transparency

Related Skills

python-executor

python-sdk

agent-tools

ai-image-generation