AgentSkillsCN

gemini-image

当用户想要创建图片、绘画、涂鸦或生成艺术作品时,使用AI生成图像。支持文本到图像和图像到图像的生成。

SKILL.md
--- frontmatter
name: gemini-image
description: Generate images using AI when user wants to create pictures, draw, paint, or generate artwork. Supports text-to-image and image-to-image generation.

Gemini Image Generation

Use this skill when user expresses intent to generate images (e.g., "draw a...", "generate an image...", "create a picture...").

Steps

1. Read Configuration

  • Read config/secrets.md to get API Key

2. Construct Prompt

ModePrompt FormatExample
Text-to-Imagedescription texta cute orange cat
Image-to-Imageimage_URL descriptionhttps://xxx.jpg draw similar style
Multi-Image ReferenceURL1 URL2 descriptionhttps://a.jpg https://b.jpg merge these two

For image-to-image, upload local images first. See tips/image-upload.md.

3. Call API

bash
curl -s -X POST "https://api.apicore.ai/v1/images/generations" \
  -H "Authorization: Bearer API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "model_name",
    "prompt": "prompt_text",
    "size": "aspect_ratio",
    "n": 1
  }'

4. Return Result

Extract data[0].url from response and return to user.

Reference Docs

  • tips/image-upload.md - Image upload methods
  • tips/chinese-text.md - Chinese text handling tips