videoagent-video-studio

Generate short AI videos from text or images — text-to-video, image-to-video, and reference-based generation — with zero API key setup. Use when the user wants to create a video clip, animate an image, or generate video from a description.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "videoagent-video-studio" with this command: npx skills add pexoai/pexo-skills/pexoai-pexo-skills-videoagent-video-studio

🎬 VideoAgent Video Studio

Use when: User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.

Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.


Quick Reference

User IntentModeTypical Duration
"Make a video of..." (no image)text-to-video4–10 s
"Animate this image" / "Make this move"image-to-video4–6 s
"Turn this into a video with..."image-to-video4–6 s
Cinematic, story, adPrefer text-to-video with detailed prompt5–10 s

Generation Modes

ModeDescriptionModels
text-to-videoText prompt only → videominimax, kling, veo, hunyuan, grok, seedance
image-to-videoSingle image + prompt → animated clipminimax, kling, veo, pixverse, grok, seedance
reference-basedReference images/video → consistent outputminimax, kling, veo, hunyuan, grok, seedance

Models (use --model <id>)

Model IDT2VI2VReferenceNotes
minimaxSubject reference image, character consistency
klingMulti-element / character / keyframe (O3)
veoGoogle Veo 3.1, multiple reference images
hunyuanVideo-to-video style transfer
pixverseStylized image-to-video
grokVideo editing via reference video
seedanceSeedance 1.5 Pro, synchronized audio, 4–12 s

Full model details and endpoint reference: references/models.md.


How to Generate a Video

Step 1 — Choose mode and enhance the prompt

  • Text-to-video: Expand with subject, action, camera movement, lighting, and style. Be specific about motion (e.g. "camera slowly zooms in", "character walks left to right").
  • Image-to-video: Describe the motion to apply to the image (e.g. "gentle breeze in the hair", "camera pans across the scene"). See references/prompt_guide.md for patterns.

Step 2 — Run the script

Text-to-video:

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "<enhanced prompt>" \
  --duration <seconds> \
  --aspect-ratio <ratio>

Image-to-video:

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --prompt "<motion description>" \
  --image-url "<public image URL>" \
  --duration <seconds> \
  --aspect-ratio <ratio>

Parameters:

ParameterDefaultDescription
--modetext-to-videotext-to-video or image-to-video
--prompt(required)Scene or motion description
--image-urlRequired for image-to-video; public image URL
--duration5Length in seconds (typically 4–10)
--aspect-ratio16:916:9, 9:16, 1:1, 4:3, 3:4
--modelautoModel ID (e.g. kling, veo, grok, seedance); auto = proxy picks

Other commands:

CommandDescription
node tools/generate.js --list-modelsList available models from the proxy
node tools/generate.js --status --job-id <id>Check async job status

Step 3 — Return the result

The script returns JSON:

{
  "success": true,
  "mode": "text-to-video",
  "videoUrl": "https://...",
  "duration": 5,
  "aspectRatio": "16:9"
}

Send videoUrl to the user.


Example Conversations

User: "Generate a short video of a cat walking in the rain, cinematic."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "A cat walking through rain, wet streets, neon reflections, cinematic lighting, slow motion, 4K" \
  --duration 5 \
  --aspect-ratio 16:9

User: "Animate this photo" (user uploads a landscape)

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --prompt "Gentle clouds moving across the sky, subtle grass movement, cinematic atmosphere" \
  --image-url "https://..." \
  --duration 5 \
  --aspect-ratio 16:9

User: "Make a 10-second vertical video of a coffee pour, slow motion."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "Close-up of coffee pouring into a white cup, slow motion, steam rising, soft lighting, product shot" \
  --duration 10 \
  --aspect-ratio 9:16

User: "Use Google Veo for a cinematic shot."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --model veo \
  --prompt "A dragon flying through cloudy skies, cinematic lighting, 8s" \
  --duration 8 \
  --aspect-ratio 16:9

User: "Animate this portrait."

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --model grok \
  --prompt "Gentle smile, subtle head turn" \
  --image-url "https://..." \
  --duration 5

Setup

Zero API keys by default. Requests go through a hosted proxy. Set these for a custom proxy or token:

VariableRequiredDescription
VIDEO_STUDIO_PROXY_URLNoProxy base URL
VIDEO_STUDIO_TOKENNoAuth token if the proxy requires it

Knowledge Base

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

videoagent-image-studio

No summary provided by upstream source.

Repository SourceNeeds Review
4.9K-pexoai
Automation

videoagent-audio-studio

No summary provided by upstream source.

Repository SourceNeeds Review
3.2K-pexoai
Automation

pexo-agent

No summary provided by upstream source.

Repository SourceNeeds Review
843-pexoai
General

seedance-2.0-prompter

No summary provided by upstream source.

Repository SourceNeeds Review
349-pexoai