Image Generation with Gemini

Name: nano-banana-pro
Rating: 62
Author: sanyuan0704

Generate images using Google's Gemini model (gemini-3-pro-image-preview).

Script Directory

Agent Execution:

•SKILL_DIR = this SKILL.md file's directory
•Script path = ${SKILL_DIR}/scripts/generate_image.py

Prerequisites

•Python 3.8+
•Google API key with Gemini access
•google-genai package installed

Installation

bash

pip install google-genai

Environment Variables

Variable	Description
`GOOGLE_API_KEY`	Google API key (required)

Usage

bash

# Basic usage
python ${SKILL_DIR}/scripts/generate_image.py --prompt "A serene mountain landscape" --output "output.png"

# With custom model (default: gemini-3-pro-image-preview)
python ${SKILL_DIR}/scripts/generate_image.py --prompt "A cute cat" --output "cat.png" --model "gemini-3-pro-image-preview"

# With temperature control
python ${SKILL_DIR}/scripts/generate_image.py --prompt "A futuristic city" --output "city.png" --temperature 0.8

Options

Option	Description
`--prompt <text>`	Prompt text for image generation (required)
`--output <path>`	Output image filename (default: output.png)
`--model <id>`	Gemini image model (default: gemini-3-pro-image-preview)
`--temperature <float>`	Sampling temperature (default: 1.0)

How It Works

•Takes a text prompt as input
•Sends the prompt to Gemini's image generation API
•Streams the response and extracts inline image data
•Saves the generated image to the specified output file

Example Prompts

•"A serene mountain landscape at sunset"
•"A cute cartoon cat playing with yarn"
•"A futuristic cityscape with flying cars"
•"A watercolor painting of a flower garden"
•"A minimalist logo design for a coffee shop"

Error Handling

•If no API key is set, the script will fail with an authentication error
•If image generation fails, the script outputs "生成图片失败"
•On success, outputs "图片已保存到 {output_path}"

Notes

•Uses streaming API for efficient response handling
•Supports PNG output format
•Model must support IMAGE response modality