video-from-reference

Video from Reference

Generate videos with consistent subject appearance using reference images via Google DeepMind's Veo 3.1 model on fal.ai.

See references/fal-api.md for setup, Python patterns, and error handling.

API Endpoint

fal-ai/veo3.1/reference-to-video

Parameters

Required

prompt (string): Text description of the video scene and action
image_urls (array of strings): URLs of reference images for consistent subject appearance (use fal_client.upload_file() for local files)

Optional

Parameter Type Default Options

duration

string "8s" "8s"

resolution

string "720p" "720p", "1080p"

generate_audio

boolean true Disable to save ~50% credits

CLI Script

python3 scripts/fal_generate.py
--endpoint video-from-reference
--prompt "The character walks through a sunny park"
--images ref-front.jpg ref-side.jpg
--video-resolution 1080p
--output video.mp4

Tips

Provide multiple reference images from different angles for better subject consistency
Use clear, well-lit reference images with the subject prominently visible
Describe both the scene environment and the subject's actions in the prompt
More reference images generally lead to better subject consistency (2-4 recommended)
Works best with distinct, recognizable subjects (people, pets, products)
Upload local files with fal_client.upload_file() before passing to image_urls

video-from-reference

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

photographer-testino

photographer-lindbergh

photographer-lachapelle

photographer-vonunwerth