π― Quick Reference (30 seconds)
Purpose: Professional image generation using Google's Nano Banana Pro (Gemini 3 Pro Image Preview).
Key Features:
-
Text-to-Image: Detailed prompts β 1K/2K/4K resolution images
-
Image-to-Image: Style transfer, object manipulation, editing
-
Real-time Grounding: Google Search integration for factual content
-
Multi-Reference: Up to 14 reference images (6 objects + 5 humans)
-
Advanced Text: Sophisticated text rendering directly in images
Two Models:
-
Nano Banana Pro (gemini-3-pro-image-preview) - Professional quality, 10-60s
-
Gemini 2.5 Flash (gemini-2.5-flash-image) - Fast, ~5-15s
Implementation Guide (5 minutes)
Features
-
Text-to-Image generation with 1K/2K/4K resolutions
-
Image-to-Image editing and style transfer
-
Multi-turn refinement for iterative improvements
-
Reference image guidance (up to 14 references)
-
Real-time Google Search grounding for factual content
-
Advanced text rendering directly in images
When to Use
-
Generating professional visual assets for documentation or marketing
-
Creating UI mockups and design concepts quickly
-
Producing social media graphics and promotional images
-
Illustrating technical documentation with custom diagrams
-
Rapid prototyping of visual ideas before final design work
Core Patterns
Pattern 1: Structured Prompt for Quality
prompt = """ A serene Japanese garden at golden hour. Lighting: warm sunset light filtering through maple trees. Camera: wide-angle 35mm lens, low angle shot. Composition: Rule of thirds, stone path leading to pagoda. Color palette: warm gold, jade green, soft cream. Style: photorealistic with slight cinematic color grading. Quality: 4K resolution. Final output: PNG. """
Pattern 2: Multi-Turn Refinement
-
Generate initial image with base prompt
-
Review output and identify areas for improvement
-
Provide targeted refinement: "Make sky more dramatic"
-
Iterate up to 5 turns for perfect result
Pattern 3: Reference-Guided Generation
Use reference images to guide style
generate_image( prompt="Mountain landscape in the style of reference", reference_images=["style_ref.png", "composition_ref.png"], resolution="2K", aspect_ratio="16:9" )
π Core Patterns (5-10 minutes)
Pattern 1: Prompt Structure for Quality Images
Key Concept: Well-structured prompts generate better images
Template:
[Scene Description] A [adjective] [subject] doing [action]. Setting: [location] with [environmental details].
[Photographic Elements] Lighting: [type], creating [mood]. Camera: [angle] shot with [lens] lens. Composition: [framing_details].
[Color & Style] Color palette: [colors]. Style: [art_style]. Quality: [resolution]. Final output: [format].
Example:
A serene Japanese garden at golden hour. Lighting: warm sunset light, creating peaceful mood. Camera: wide-angle 35mm lens shot. Color palette: gold, jade green, cream. Quality: 4K photorealistic. Final: PNG.
Pattern 2: Text-to-Image Generation
Key Concept: Generate professional images from text prompts
Basic Flow:
-
Write detailed, structured prompt
-
Choose resolution (1K, 2K, 4K)
-
Select aspect ratio (1:1, 16:9, 3:2, etc.)
-
Enable Google Search for current information (optional)
-
Generate and retrieve Base64 PNG
Execution:
image_data = generate_image( prompt="Your detailed prompt here", resolution="2K", aspect_ratio="16:9", enable_google_search=True, # For current info thinking_process=True # Auto-optimize )
Pattern 3: Image-to-Image Editing
Key Concept: Transform existing images with detailed instructions
Common Tasks:
-
Style Transfer: Convert to art style (Van Gogh, anime, etc.)
-
Object Manipulation: Add, remove, or modify elements
-
Composition Change: Reframe, zoom, or reposition subjects
-
Quality Enhancement: Upscale, improve detail, adjust colors
Flow:
-
Load original image
-
Write transformation instruction
-
Reference images (optional)
-
Apply edit maintaining coherence
-
Retrieve edited image
Pattern 4: Multi-Turn Refinement
Key Concept: Iteratively improve images through conversation
Workflow:
-
Generate initial image
-
Review output
-
Provide refinement instruction
-
Regenerate with improvements
-
Repeat (max 5 turns)
Example:
Turn 1: "A mountain landscape at sunset" Turn 2: "Make the sky more dramatic with purple clouds" Turn 3: "Add a lone tree in foreground"
Pattern 5: Reference Image Guidance
Key Concept: Use reference images to guide generation style
Supported References:
-
Up to 6 object references
-
Up to 5 human references
-
Style influences
-
Composition guides
Usage:
generate_image( prompt="Similar style to reference", reference_images=[ "path/to/style_reference.png", "path/to/composition_ref.png" ] )
π Advanced Documentation
This Skill uses Progressive Disclosure. For detailed implementation:
-
modules/prompt-engineering.md - Professional prompt templates
-
modules/api-reference.md - Complete API documentation
-
modules/examples.md - Real-world usage examples
-
modules/troubleshooting.md - Common issues and solutions
π¨ Model Selection Guide
Choose Nano Banana Pro when:
-
Professional quality required
-
2K/4K resolution needed
-
Complex compositions
-
Sophisticated text in images
-
Real-time information important
-
Budget allows (higher cost)
Choose Gemini 2.5 Flash when:
-
Quick iterations needed
-
Prototyping and testing
-
High volume generation
-
1K resolution sufficient
-
Speed critical
-
Cost sensitive
Quick Reference (30 seconds)
Core Purpose: Professional AI image generation using Nano Banana Pro (Gemini 3 Pro) and Gemini 2.5 Flash.
Key Features: Text-to-image, image-to-image editing, multi-turn refinement, reference guidance, 4K resolution.
When to Use: Visual asset creation, prototyping, documentation, UI mockups, marketing materials.
Works Well With
Agents:
-
design-uiux - UI/UX design integration
-
code-frontend - Frontend asset implementation
-
workflow-docs - Visual documentation generation
Skills:
-
moai-lang-unified - UI/UX implementation with generated assets
-
moai-docs-generation - Create visual documentation
-
moai-cc-claude-md - Embed generated images in markdown
-
moai-domain-frontend - Frontend integration
Commands:
-
/moai:3-sync
-
Documentation with visual assets
-
/moai:9-feedback
-
Image generation improvements
π Integration with Other Skills
Typical Workflow:
-
Use this Skill to generate visual assets
-
Use moai-domain-frontend to implement in UI
-
Use moai-docs-generation to document with images
π Version History
1.0.1 (2025-11-23)
-
π Refactored with Progressive Disclosure pattern
-
π Detailed prompts moved to modules/
-
β¨ Core patterns highlighted in SKILL.md
-
β¨ Added model selection guide
1.0.0 (2025-11-12)
-
β¨ Nano Banana Pro (Gemini 3 Pro) support
-
β¨ Text-to-Image and Image-to-Image
-
β¨ Multi-turn refinement capability
-
β¨ Reference image guidance
Maintained by: alfred Domain: Image Generation & Visual Creation Generated with: MoAI-ADK Skill Factory