gemini-system

Gemini System — Multimodal File Processing

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "gemini-system" with this command: npx skills add del-taiseiozaki/claude-code-orchestra/del-taiseiozaki-claude-code-orchestra-gemini-system

Gemini System — Multimodal File Processing

Gemini CLI is specialized for multimodal file processing (PDF, video, audio, image).

Detailed rules: .claude/rules/gemini-delegation.md

Research/codebase analysis: Use general-purpose subagent (Opus) instead — Opus now supports 1M context.

Multimodal File Processing

Extract content from PDF, video, audio, and image files.

PDF

gemini -p "Extract: {what to extract}" < /path/to/file.pdf 2>/dev/null

Video

gemini -p "Summarize: key concepts, timestamps" < /path/to/video.mp4 2>/dev/null

Audio

gemini -p "Transcribe and summarize: decisions, action items" < /path/to/audio.mp3 2>/dev/null

Image (diagrams, charts)

gemini -p "Analyze: components, relationships, data flow" < /path/to/diagram.png 2>/dev/null

Target Extensions

PDF .pdf

Video .mp4 , .mov , .avi , .mkv , .webm

Audio .mp3 , .wav , .m4a , .flac , .ogg

Images (advanced analysis) .png , .jpg , .jpeg , .gif , .webp , .svg

Simple screenshot inspection can be done directly with Claude's Read tool.

Auto-Trigger

When multimodal files appear in a task, automatically pass them to Gemini without waiting for user instructions.

When NOT to Use Gemini

Task Correct Owner

Research and investigation general-purpose subagent (Opus)

Codebase analysis general-purpose subagent (Opus)

Design and planning Codex

Debugging Codex

Code implementation Claude / Subagents

How to Use

Subagent Pattern (for large outputs)

Task tool parameters:

  • subagent_type: "gemini-explore"

  • prompt: | {task description}

    gemini -p "{prompt}" < /path/to/file 2>/dev/null

    Return CONCISE summary (5-7 bullet points).

Direct Call (for short extractions)

gemini -p "{what to extract}" < /path/to/file 2>/dev/null

Language Protocol

  • Ask Gemini in English

  • Receive response in English

  • Report to user in the user's language

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

simplify

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

codex-system

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

init

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

checkpointing

No summary provided by upstream source.

Repository SourceNeeds Review