Skip to main content
Glama

mcp_gemini_generate_image

Generate images using Google Gemini or Imagen models with automated API selection. Returns image file paths for user access, supporting prompts, aspect ratios, and customizable parameters.

Instructions

Google Gemini 또는 Imagen 모델을 사용하여 이미지를 생성합니다. 모델 이름에 따라 적절한 API가 자동으로 선택됩니다. 생성된 이미지 파일 경로를 반환하며, 이 경로는 반드시 사용자에게 알려주어야 합니다.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
aspectRatioNo이미지 가로세로 비율 (Imagen 모델 전용)1:1
fileNameNo저장할 이미지 파일 이름 (확장자 제외)
imageDataNo이미지 편집 시 사용할 Base64로 인코딩된 이미지 데이터 (Gemini 모델 전용)
imageMimeTypeNo이미지 MIME 타입 (Gemini 모델 전용)image/png
modelNo사용할 모델 ID (예: imagen-3.0-generate-002, gemini-2.0-flash-exp-image-generation)imagen-3.0-generate-002
numberOfImagesNo생성할 이미지 수 (1-4, Imagen 모델 전용)
personGenerationNo사람 이미지 생성 허용 여부 (Imagen 모델 전용)ALLOW_ADULT
promptYes이미지 생성을 위한 텍스트 프롬프트
responseModalitiesNo응답에 포함할 모달리티 (Gemini 모델 전용)
saveDirNo이미지를 저장할 디렉토리./temp
sizeNo생성할 이미지 크기1024x1024
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It reveals that the tool returns a file path ('생성된 이미지 파일 경로를 반환하며') and that this path must be communicated to the user ('이 경로는 반드시 사용자에게 알려주어야 합니다'). However, it doesn't disclose important behavioral aspects like whether this is a read-only or mutating operation, potential rate limits, authentication requirements, error conditions, or what happens when files are saved. The description adds some value but leaves significant gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is reasonably concise with two sentences. The first sentence states the core functionality, and the second provides important behavioral information about the return value. There's no obvious fluff or redundancy. However, it could be slightly more front-loaded by immediately clarifying the tool's scope relative to siblings.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity (11 parameters, no annotations, no output schema), the description is moderately complete. It covers the basic purpose and return behavior but lacks important context. Without annotations or output schema, the description should ideally explain more about what kind of operation this is (read vs write), error handling, and the format/meaning of the returned file path. The current description provides a foundation but leaves the agent with significant uncertainty about the tool's full behavior.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema description coverage is 100%, meaning all parameters are documented in the schema itself. The description doesn't add any parameter-specific information beyond what's already in the schema descriptions. It mentions model selection automation but doesn't explain parameter implications or interactions. With complete schema coverage, the baseline score of 3 is appropriate as the description doesn't enhance parameter understanding beyond the structured data.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Google Gemini 또는 Imagen 모델을 사용하여 이미지를 생성합니다' (uses Google Gemini or Imagen models to generate images). It specifies the verb ('생성합니다' - generates) and resource ('이미지' - images). However, it doesn't explicitly differentiate from sibling tools like 'mcp_gemini_create_image' or 'mcp_imagen_generate', which appear to have similar functionality.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides some implied usage context: '모델 이름에 따라 적절한 API가 자동으로 선택됩니다' (the appropriate API is automatically selected based on the model name). This suggests the tool handles model selection automatically. However, it doesn't explicitly state when to use this tool versus alternatives like 'mcp_gemini_create_image' or 'mcp_imagen_generate', nor does it provide any exclusion criteria or prerequisites.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Related Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bigdata-coss/agent_mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server