Image Describer

Name: image-describer
Rating: 76
Author: usa-tech-lab

This skill allows you to analyze image files by providing a prompt and the file path. It uses the gemini-3-flash-preview model for fast and accurate vision analysis.

Workflow for Agents

When you encounter an image file and need to understand its content:

•Identify the absolute path of the image file.
•Decide what information you need from the image (e.g., general description, text extraction, UI component identification).
•Execute the describe.py script.

Example Usage

bash

python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"

Specific Use Cases

•OCR: python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"
•UI Analysis: python3 scripts/describe.py --prompt "Identify the main UI components and their layout." --image "mockup.jpg"
•Icon Identification: python3 scripts/describe.py --prompt "What does this icon represent?" --image "icon.svg"

Resource Details

•scripts/describe.py: Vertex AI SDK (google-genai) を使用して、画像や PDF を解析する Python スクリプト。