gemini_mcp

gemini

Destructive

Execute AI-driven tasks using Gemini via Agent Client Protocol with multi-modal support, session management, and tool execution capabilities.

Instructions

Invokes Gemini via ACP (Agent Client Protocol) for AI-driven tasks.

**Return structure:**
    - `success`: boolean indicating execution status
    - `SESSION_ID`: ACP session identifier (auto-managed per workspace)
    - `agent_messages`: concatenated assistant response text
    - `thought`: agent reasoning/thinking (when available)
    - `stop_reason`: why the agent stopped (end_turn, max_tokens, etc.)
    - `tool_calls`: list of tool invocations made by the agent (if any)
    - `plan`: agent execution plan entries (if any)
    - `error`: error description when `success=False`

**Best practices:**
    - Sessions auto-reuse per workspace with turn-count eviction
    - ALWAYS pass `model`. Use `gemini-3.1-pro-preview` for complex tasks, `gemini-3-flash-preview` for simple tasks
    - Use `approval_mode` to control tool approval: yolo (default), auto_edit, default, plan
    - On 429 capacity errors, automatically retries with `gemini-3-flash-preview`
    - Pass `image_path` for vision analysis (requires agent image support)
    - Pass `context` to inject text as embedded resource (ACP resource ContentBlock)
    - Pass `allowed_mcp_servers` to filter which MCP servers Gemini loads

Input Schema

TableJSON Schema

Name	Required	Description	Default
`PROMPT`	Yes	Instruction for the task to send to Gemini.
`cd`	Yes	Set the workspace root for Gemini before executing the task.
`model`	No	REQUIRED. Pass 'gemini-3.1-pro-preview' for complex tasks, 'gemini-3-flash-preview' for simple tasks.	gemini-3.1-pro-preview
`approval_mode`	No	Tool approval mode. 'yolo': auto-approve all (default). 'auto_edit': auto-approve edits only. 'default': prompt for every action (safest). 'plan': read-only mode.	yolo
`image_path`	No	Path to an image file for vision analysis. Sent as image ContentBlock. Empty string means no image.
`context`	No	Text context to inject as ACP resource ContentBlock. Use for passing file contents, docs, or background info that Gemini should reference.
`allowed_mcp_servers`	No	Filter which MCP servers Gemini loads. Pass a list of server names to include. None means load all discovered servers.

Output Schema

TableJSON Schema

Name	Required	Description	Default
`result`	Yes

Implementation Reference

src/geminimcp/server.py:795-1036 (handler)

Handler for the 'gemini' MCP tool, which acts as a bridge to the Gemini CLI. It manages sessions, invokes the CLI, handles streaming responses, and implements retries/fallbacks.

@mcp.tool(
    name="gemini",
    annotations=ToolAnnotations(
        title="Gemini CLI Agent",
        readOnlyHint=False,
        destructiveHint=True,
        idempotentHint=False,
        openWorldHint=True,
    ),
    description="""
    Invokes Gemini via ACP (Agent Client Protocol) for AI-driven tasks.

    **Return structure:**
        - `success`: boolean indicating execution status
        - `SESSION_ID`: ACP session identifier (auto-managed per workspace)
        - `agent_messages`: concatenated assistant response text
        - `thought`: agent reasoning/thinking (when available)
        - `stop_reason`: why the agent stopped (end_turn, max_tokens, etc.)
        - `tool_calls`: list of tool invocations made by the agent (if any)
        - `plan`: agent execution plan entries (if any)
        - `error`: error description when `success=False`

    **Best practices:**
        - Sessions auto-reuse per workspace with turn-count eviction
        - ALWAYS pass `model`. Use `gemini-3.1-pro-preview` for complex tasks, `gemini-3-flash-preview` for simple tasks
        - Use `approval_mode` to control tool approval: yolo (default), auto_edit, default, plan
        - On 429 capacity errors, automatically retries with `gemini-3-flash-preview`
        - Pass `image_path` for vision analysis (requires agent image support)
        - Pass `context` to inject text as embedded resource (ACP resource ContentBlock)
        - Pass `allowed_mcp_servers` to filter which MCP servers Gemini loads
    """,
)
async def gemini(
    PROMPT: Annotated[
        str,
        Field(description="Instruction for the task to send to Gemini."),
    ],
    cd: Annotated[
        Path,
        Field(
            description="Set the workspace root for Gemini before executing the task."
        ),
    ],
    model: Annotated[
        str,
        Field(
            description="REQUIRED. Pass 'gemini-3.1-pro-preview' for complex tasks, "
            "'gemini-3-flash-preview' for simple tasks."
        ),
    ] = "gemini-3.1-pro-preview",
    approval_mode: Annotated[
        str,
        Field(
            description="Tool approval mode. "
            "'yolo': auto-approve all (default). "
            "'auto_edit': auto-approve edits only. "
            "'default': prompt for every action (safest). "
            "'plan': read-only mode."
        ),
    ] = "yolo",
    image_path: Annotated[
        str,
        Field(
            description="Path to an image file for vision analysis. "
            "Sent as image ContentBlock. Empty string means no image."
        ),
    ] = "",
    context: Annotated[
        str,
        Field(
            description="Text context to inject as ACP resource ContentBlock. "
            "Use for passing file contents, docs, or background info that Gemini should reference."
        ),
    ] = "",
    allowed_mcp_servers: Annotated[
        Optional[List[str]],
        Field(
            description="Filter which MCP servers Gemini loads. "
            "Pass a list of server names to include. None means load all discovered servers."
        ),
    ] = None,
) -> Dict[str, Any]:
    """Execute a Gemini session via ACP and return results."""
    if not shutil.which("gemini"):
        return {"success": False, "error": "CLI tool 'gemini' not found in PATH."}

    if not cd.exists():
        return {
            "success": False,
            "error": f"Workspace directory `{cd.absolute().as_posix()}` does not exist.",
        }

    if approval_mode not in _APPROVAL_MODES:
        return {
            "success": False,
            "error": f"Invalid approval_mode '{approval_mode}'. "
            f"Valid values: {', '.join(_APPROVAL_MODES.keys())}",
        }

    cwd = cd.absolute().as_posix()
    result = _bridge.prompt(
        cwd,
        PROMPT,
        model=model,
        approval_mode=approval_mode,
        image_path=image_path,
        context=context,
        allowed_mcp_servers=allowed_mcp_servers,
    )

    # Session error → retry with fresh session
    if not result["success"] and result.get("SESSION_ID"):
        _bridge._sessions.pop(cwd, None)
        result = _bridge.prompt(
            cwd,
            PROMPT,
            model=model,
            approval_mode=approval_mode,
            image_path=image_path,
            context=context,
            allowed_mcp_servers=allowed_mcp_servers,
        )

    # 429 fallback: capacity error → retry with flash model
    # Skip fallback for auto-* models (they handle routing internally)
    _FALLBACK_MODEL = "gemini-3-flash-preview"
    if (
        not result["success"]
        and model != _FALLBACK_MODEL
        and not model.startswith("auto-")
        and any(
            kw in result.get("error", "").lower()
            for kw in ("capacity", "429", "resource_exhausted", "overloaded")
        )
    ):
        _bridge._sessions.pop(cwd, None)
        result = _bridge.prompt(
            cwd, PROMPT, model=_FALLBACK_MODEL, approval_mode=approval_mode
        )
        result["fallback_model"] = _FALLBACK_MODEL

    return result


@mcp.tool(
    name="list_models",
    annotations=ToolAnnotations(
        title="List Available Models",
        readOnlyHint=True,
        destructiveHint=False,
        idempotentHint=True,
        openWorldHint=False,
    ),
    description="List available Gemini models and current bridge state. "
    "Returns known models, current active model, and agent info.",
)
async def list_models() -> Dict[str, Any]:
    """List available models and bridge status."""
    return {
        "models": _KNOWN_MODELS,
        "approval_modes": list(_APPROVAL_MODES.keys()),
        "current_model": _bridge._current_model or "(not started)",
        "agent_info": _bridge._agent_info or None,
        "bridge_version": VERSION,
        "process_running": _bridge._proc is not None and _bridge._proc.poll() is None,
    }


@mcp.tool(
    name="list_sessions",
    annotations=ToolAnnotations(
        title="List Active Sessions",
        readOnlyHint=True,
        destructiveHint=False,
        idempotentHint=True,
        openWorldHint=False,
    ),
    description="List all active ACP sessions managed by the bridge. "
    "Shows workspace path, session ID, turn count, and model for each session.",
)
async def list_sessions() -> Dict[str, Any]:
    """List active sessions."""
    sessions = []
    for workspace, info in _bridge._sessions.items():
        sessions.append(
            {
                "workspace": workspace,
                "session_id": info["session_id"],
                "turn_count": info["turn_count"],
                "max_turns": _MAX_TURNS_PER_SESSION,
                "model": info.get("actual_model", ""),
            }
        )
    return {
        "sessions": sessions,
        "count": len(sessions),
    }


@mcp.tool(
    name="reset_session",
    annotations=ToolAnnotations(
        title="Reset Session",
        readOnlyHint=False,
        destructiveHint=True,
        idempotentHint=True,
        openWorldHint=False,
    ),
    description="Reset (clear) the ACP session for a workspace. "
    "The next gemini call for this workspace will create a fresh session. "
    "Pass workspace path, or omit to reset all sessions.",
)
async def reset_session(
    workspace: Annotated[
        str,
        Field(description="Workspace path to reset. Empty string resets all sessions."),
    ] = "",
) -> Dict[str, Any]:
    """Reset session for a workspace or all sessions."""
    if workspace:
        removed = _bridge._sessions.pop(workspace, None)
        if not removed:
            # Try matching by suffix (user might pass partial path)
            matched = [k for k in _bridge._sessions if k.endswith(workspace)]
            if matched:
                for k in matched:
                    _bridge._sessions.pop(k)
                return {"reset": matched, "count": len(matched)}
            return {
                "reset": [],
                "count": 0,
                "message": "No session found for workspace",
            }
        return {"reset": [workspace], "count": 1}
    else:
        count = len(_bridge._sessions)
        _bridge._sessions.clear()
        return {"reset": "all", "count": count}


def run() -> None:
    """Start the MCP server over stdio transport."""

src/geminimcp/server.py:568-568 (handler)
The internal `AcpBridge.prompt` method which handles the communication with the underlying Gemini --acp subprocess.
```
def prompt(
```

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations declare destructive/open-world capabilities; the description adds critical runtime context missing from annotations: automatic session reuse with turn-count eviction, automatic fallback to flash-preview on rate limits, and detailed explanation of return fields (thought, tool_calls, plan) that describe the agent's internal reasoning. It could clarify the scope of possible destruction (file edits vs API calls) more explicitly.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with clear visual hierarchy: single-line purpose statement, bulleted return structure, and bulleted best practices. Every section serves a distinct purpose. Minor deduction for the return structure list being somewhat lengthy, though justified by the complex nested return type of an agent invocation.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex 7-parameter agent-invocation tool with destructive/open-world annotations, the description is comprehensive. It covers input parameters, return value semantics (7 distinct fields), session lifecycle, error handling strategies, and integration patterns (MCP server filtering, image ContentBlocks). Sufficient given the tool's complexity and existing schema richness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema coverage, the baseline is 3. The description adds significant value through the Best Practices section, which provides semantic guidance on model selection (complex vs simple tasks) and approval_mode implications (yolo meaning 'auto-approve all') that raw schema descriptions don't convey. It effectively guides the agent toward correct parameter combinations.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description opens with a precise action statement ('Invokes Gemini via ACP') specifying the protocol, resource, and task type ('AI-driven tasks'). It clearly distinguishes from siblings like list_models or reset_session by being the primary execution tool versus management utilities.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The 'Best practices' section provides explicit guidance: model selection criteria (pro for complex, flash for simple), approval_mode behavior (yolo vs auto_edit vs default), automatic retry logic on 429 errors, and specific parameter usage patterns (image_path for vision, context for resources). It names concrete alternatives (flash model for retries) and safety thresholds.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/shenyunhuan/gemini_mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server