AI Content Pipeline — Documentation Index

73 AI models across 12 categories. Click-based CLI (aicp ). Central model registry. YAML pipelines with parallel execution.

Key Facts

Package: video-ai-studio on PyPI, commands aicp / ai-content-pipeline
Version: 1.0.24, Python 3.10+
Registry: packages/core/ai_content_pipeline/ai_content_pipeline/registry.py

registry_data.py

CLI: Click framework in cli/click_app.py , commands in cli/commands/ (6 modules, 19 commands + vimax subgroup)
Providers: FAL AI (primary, 30+ models), Google (Gemini/Veo), ElevenLabs (TTS), OpenRouter (prompts), Replicate
API keys: FAL_KEY , GEMINI_API_KEY , ELEVENLABS_API_KEY , OPENROUTER_API_KEY in .env

Documentation Files

Load the specific file when Claude needs deeper information on that topic.

Models & Providers

File Load When

models.md Selecting models, checking pricing, comparing options across 12 categories

provider-comparison.md Choosing between FAL AI, Google, ElevenLabs, OpenRouter, Replicate

CLI & API

File Load When

cli-commands.md CLI usage, command flags, global options (--json, --quiet, --stream, --input)

python-api.md Python API: AIPipelineManager methods, data classes, error handling

aicp-vimax-commands.md ViMax subgroup: novel2movie, idea2video, script2video pipelines

Architecture & Code

File Load When

architecture.md System design, data flow diagrams, component responsibilities

package-structure.md File locations, import paths, module dependencies

Pipelines

File Load When

yaml-pipelines.md YAML config syntax, 10 step types, variable interpolation, dependencies

parallel-execution.md Parallel groups, performance optimization, 2-3x speedup patterns

Content Creation

File Load When

prompting.md Writing effective prompts, templates, model-specific tips

video-tips.md Image-to-video vs text-to-video, motion prompts, model selection

video-analysis.md AI video analysis with Gemini, timeline/describe/transcribe modes

Optimization

File Load When

cost-management.md Pricing tables, budget strategies, cost estimation CLI/API

performance.md Speed benchmarks, batching, caching, network optimization

best-practices.md Project organization, pipeline design patterns, QA workflows

Troubleshooting

File Load When

error-codes.md Error codes AUTH/CFG/MDL/PIP/NET/FILE/RATE/COST/VAL with solutions

troubleshooting.md Diagnostic steps, common issues, installation/API/network problems

faq.md Frequently asked questions across 9 categories

Development

File Load When

testing.md Test strategies, mocks, fixtures, CI/CD, running pytest

security.md API key security, input validation, production deployment

contributing.md Development workflow, coding standards, adding models/providers

migration.md Version upgrades, breaking changes, rollback instructions

Setup & Learning

File Load When

setup.md Installation (PyPI/source), venv, API keys, first pipeline

learning-path.md Structured learning tracks: Quick Start, Comprehensive, Developer

Examples

File Load When

basic-examples.md Simple image/video generation, CLI examples

advanced-pipelines.md Multi-stage production, A/B testing, batch processing

use-cases.md Marketing, education, enterprise, gaming applications

integrations.md Flask, FastAPI, Celery, webhook patterns

Model Categories Quick Reference

Category Count Key Models Cost Range

Text-to-Image 8 flux_dev , flux_schnell , imagen4 , nano_banana_pro

$0.001-0.08

Image-to-Image 8 photon , kontext , clarity , seededit

$0.015-0.05

Text-to-Video 10 veo3 , kling_3_pro , sora_2 , hailuo_pro

$0.08-6.00

Image-to-Video 15 veo_3_1_fast , kling_3_pro_i2v , sora_2_i2v

$0.08-3.60

Video-to-Video 4 kling_o3_pro_edit , kling_o3_standard_edit

$0.25-0.34/s

Avatar 10 omnihuman_v1_5 , fabric_1_0 , multitalk

$0.06-0.25/s

Image Understanding 7 gemini_describe , gemini_detailed , gemini_qa

$0.001-0.002

Prompt Generation 5 openrouter_video_prompt

style variants $0.002

Text-to-Speech 3 elevenlabs , elevenlabs_turbo , elevenlabs_v3

$0.03-0.08

Speech-to-Text 1 scribe_v2

$0.008/min

Add Audio 1 thinksound

$0.001/s

Upscale Video 1 topaz

~$1.50/video

CLI Commands Quick Reference

Core generation

aicp generate-image --text "prompt" --model flux_dev aicp create-video --text "prompt" --video-model kling_3_pro aicp run-chain --config pipeline.yaml [--parallel] [--stream] [--dry-run]

Media operations

aicp generate-avatar --image-url URL --audio-url URL --model omnihuman_v1_5 aicp analyze-video -i video.mp4 [-t timeline|describe|transcribe] aicp transcribe --input audio.mp3 [--srt] [--raw-json] aicp transfer-motion --image-url URL --video-url URL aicp upscale-image --image photo.png --upscale 2 aicp generate-grid --text "prompt" --layout 2x2

Discovery

aicp list-models [--category X] [--provider X] [--json] aicp list-avatar-models | list-video-models | list-motion-models | list-speech-models

Project

aicp setup | init-project | organize-project | structure-info | create-examples

ViMax (novel-to-video)

aicp vimax idea2video --idea "concept" aicp vimax novel2movie --novel novel.txt [--storyboard-only] aicp vimax script2video --script story.txt

Architecture Summary

CLI (Click) → Pipeline Manager → Providers → External APIs ↓ ↑ Config (YAML) FAL / Google / ElevenLabs / OpenRouter ↓ Executor → Parallel Engine → Results → Output Files

Central Registry: registry.py defines ModelDefinition

ModelRegistry , registry_data.py registers all 73 models

Auto-Discovery: Generator classes use MODEL_KEY class attributes
CLI: Root Click group in cli/click_app.py , commands auto-registered at import time
Pipeline: manager.py orchestrates, executor.py runs steps, parallel.py handles concurrency

Testing Quick Reference

python -m pytest tests/ -v # Full suite (~844 tests) python -m pytest tests/test_registry.py -v # Registry tests python -m pytest tests/test_click_app.py -v # CLI tests python scripts/validate_registry.py # Registry validation

project-docs

Safety Notice

Copy this and send it to your AI assistant to learn

Core generation

Media operations

Discovery

Project

ViMax (novel-to-video)

Source Transparency

Related Skills

ffmpeg media processing

ai content pipeline

organize-project

pr-comments