fal-vision

Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.

Scripts

Script Purpose

analyze.sh

Analyze an image (segment, detect, OCR, describe, QA)

Usage

Segment Objects

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"

Detect Objects

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect

Extract Text (OCR)

./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr

Describe Image

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe

Visual QA

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"

Arguments

Argument Description Required

--image-url

URL of image to analyze Yes

--operation

segment, detect, ocr, describe, qa Yes

--query / -q

Text prompt for segment/qa operations For segment/qa

--model / -m

Override model endpoint No

Finding Models

To discover the best and latest vision/analysis models, use the search API:

Search for segmentation models

bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"

Search for object detection models

bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"

Search for OCR models

bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"

Search for image captioning / visual QA models

bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption" bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"

Or use the search_models MCP tool with keywords like "segmentation", "detection", "ocr", "caption", "vision".

fal-vision

Safety Notice

Copy this and send it to your AI assistant to learn

Search for segmentation models

Search for object detection models

Search for OCR models

Search for image captioning / visual QA models

Source Transparency

Related Skills

fal-image-edit

fal-generate

fal-audio

fal-upscale