Grok CLI MCP Server

Grok Query

grok_query

Send prompts to Grok AI models through CLI to get text responses, with options for raw output and model selection.

Instructions

Send a single prompt to Grok via CLI headless mode. Returns the assistant's text. Use raw_output=true to get raw CLI output and parsed messages.

Input Schema

TableJSON Schema

Name	Required	Description	Default
`prompt`	Yes
`model`	No
`raw_output`	No
`timeout_s`	No

Output Schema

TableJSON Schema

Name	Required	Description	Default
`result`	Yes

Implementation Reference

src/grok_cli_mcp/server.py:28-35 (registration)

Registration of the 'grok_query' tool with FastMCP, defining its name, title, description, and input parameters which serve as the schema.

@server.tool(
    name="grok_query",
    title="Grok Query",
    description=(
        "Send a single prompt to Grok via CLI headless mode. Returns the assistant's text. "
        "Use raw_output=true to get raw CLI output and parsed messages."
    ),
)

src/grok_cli_mcp/server.py:36-70 (handler)

The core handler function for 'grok_query' that invokes the Grok CLI using _run_grok helper, processes the output with _collect_assistant_text, and returns the assistant's response text or structured data.

async def grok_query(
    prompt: str,
    model: Optional[str] = None,
    raw_output: bool = False,
    timeout_s: float = 120.0,
    ctx: Optional[Context] = None,
) -> str | dict:
    """
    Send a single prompt to Grok.

    Args:
        prompt: The user prompt.
        model: Optional Grok model name (passed with -m if provided).
        raw_output: If true, returns {text, messages, raw, model}.
        timeout_s: Process timeout in seconds.
        ctx: FastMCP context.

    Returns:
        Assistant's text response, or dict with full details if raw_output=True.
    """
    result = await _run_grok(prompt, model=model, timeout_s=timeout_s, ctx=ctx)

    # Collate assistant text
    assistant_text = _collect_assistant_text(result.messages) if result.messages else (result.raw or "")

    if raw_output:
        return {
            "text": assistant_text,
            "messages": [m.model_dump() for m in result.messages],
            "raw": result.raw,
            "model": result.model,
        }

    return assistant_text

src/grok_cli_mcp/utils.py:147-244 (helper)

Helper function _run_grok that executes the Grok CLI subprocess with the given prompt and model, handles timeouts and errors, parses JSON output into structured GrokParsedOutput used by the handler.

async def _run_grok(
    prompt: str,
    *,
    model: Optional[str],
    timeout_s: float,
    ctx: Optional[Context] = None,
) -> GrokParsedOutput:
    """
    Run Grok CLI in headless mode: `grok -p "<prompt>" [-m <model>]`

    Parse JSON output and return a structured response.

    Args:
        prompt: The prompt to send to Grok.
        model: Optional Grok model name (passed with -m if provided).
        timeout_s: Process timeout in seconds.
        ctx: Optional FastMCP context for logging.

    Returns:
        GrokParsedOutput with messages, model, and raw output.

    Raises:
        FileNotFoundError: If Grok CLI binary not found.
        TimeoutError: If CLI execution exceeds timeout.
        RuntimeError: If CLI exits with non-zero code.
    """
    grok_bin = _resolve_grok_path()
    if not shutil.which(grok_bin) and not os.path.exists(grok_bin):
        raise FileNotFoundError(
            f"Grok CLI not found. Checked {grok_bin} and PATH. "
            f"Set {ENV_GROK_CLI_PATH} or install grok CLI."
        )

    _require_api_key()

    args = [grok_bin, "-p", prompt]
    if model:
        # Only pass -m if caller supplied a model; if CLI rejects, the error will be caught
        args += ["-m", model]

    env = os.environ.copy()
    # Ensure GROK_API_KEY is present in the subprocess environment
    env[ENV_GROK_API_KEY] = env[ENV_GROK_API_KEY]

    if ctx:
        await ctx.info(f"Invoking Grok CLI {'with model ' + model if model else ''}...")

    proc = await asyncio.create_subprocess_exec(
        *args, stdout=asyncio.subprocess.PIPE, stderr=asyncio.subprocess.PIPE, env=env
    )

    try:
        stdout_b, stderr_b = await asyncio.wait_for(proc.communicate(), timeout=timeout_s)
    except asyncio.TimeoutError:
        try:
            proc.kill()
        except Exception:
            pass
        raise TimeoutError(f"Grok CLI timed out after {timeout_s:.0f}s")

    stdout = (stdout_b or b"").decode("utf-8", errors="replace")
    stderr = (stderr_b or b"").decode("utf-8", errors="replace")

    if proc.returncode != 0:
        # Grok CLI error; include stderr to help debugging
        raise RuntimeError(f"Grok CLI failed (exit {proc.returncode}): {stderr.strip() or stdout.strip()}")

    # Parse JSON payload
    parsed: Any
    try:
        parsed = _extract_json_from_text(stdout)
    except Exception as e:
        # If JSON parse fails, provide raw output in a structured wrapper
        if ctx:
            await ctx.warning(f"Failed to parse Grok JSON output: {e}. Returning raw output.")
        return GrokParsedOutput(messages=[], model=model, raw=stdout)

    # Normalize to list of GrokMessage
    messages: list[GrokMessage] = []
    if isinstance(parsed, dict) and "role" in parsed and "content" in parsed:
        messages = [GrokMessage(**parsed)]
    elif isinstance(parsed, list):
        # Either a list of messages or a list with one message
        for item in parsed:
            if isinstance(item, dict) and "role" in item and "content" in item:
                messages.append(GrokMessage(**item))
    elif isinstance(parsed, dict) and "messages" in parsed:
        for item in parsed.get("messages", []) or []:
            if isinstance(item, dict) and "role" in item and "content" in item:
                messages.append(GrokMessage(**item))
    else:
        # Unknown shape: keep raw and empty messages
        if ctx:
            await ctx.warning("Unrecognized JSON shape from Grok CLI. Returning raw output.")
        return GrokParsedOutput(messages=[], model=model, raw=stdout)

    return GrokParsedOutput(messages=messages, model=model, raw=stdout)

src/grok_cli_mcp/utils.py:105-145 (helper)

Helper function _collect_assistant_text that extracts and concatenates text content from assistant messages in the parsed Grok output.

def _collect_assistant_text(messages: Sequence[GrokMessage]) -> str:
    """
    Collate assistant message text from a sequence of messages.

    Handles:
      - content as a plain string
      - content as a list of blocks with 'type'=='text'
      - content as a dict with 'text' field

    Args:
        messages: Sequence of GrokMessage objects.

    Returns:
        Concatenated text from all assistant messages.
    """
    chunks: list[str] = []
    for m in messages:
        if m.role != "assistant":
            continue
        c = m.content
        if isinstance(c, str):
            chunks.append(c)
        elif isinstance(c, list):
            for block in c:
                try:
                    if isinstance(block, dict) and block.get("type") == "text" and "text" in block:
                        chunks.append(str(block["text"]))
                    elif isinstance(block, dict) and "content" in block:
                        chunks.append(str(block["content"]))
                except Exception:
                    continue
        elif isinstance(c, dict) and "text" in c:
            chunks.append(str(c["text"]))
        else:
            # Fallback: stringify structured content
            try:
                chunks.append(json.dumps(c, ensure_ascii=False))
            except Exception:
                chunks.append(str(c))
    return "\n".join([s for s in (s.strip() for s in chunks) if s])

src/grok_cli_mcp/types.py:10-23 (schema)

Pydantic models GrokMessage and GrokParsedOutput used for type validation of parsed CLI output in _run_grok and processing in the handler.

class GrokMessage(BaseModel):
    """A message in a Grok conversation."""

    role: str
    content: Any  # Grok may return str or structured content


class GrokParsedOutput(BaseModel):
    """Parsed output from Grok CLI."""

    messages: list[GrokMessage] = Field(default_factory=list)
    model: Optional[str] = None
    raw: Optional[str] = None

Tool Definition Quality

B3.4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses that the tool returns 'the assistant's text' and mentions a 'raw_output' option for CLI output, adding some behavioral context. However, it doesn't cover critical aspects like error handling, rate limits, authentication needs, or what 'CLI headless mode' entails operationally.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is brief and front-loaded with the core purpose, followed by a specific usage tip. Both sentences earn their place by adding value, though it could be slightly more structured for clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of a 4-parameter tool with no annotations and 0% schema coverage, the description is incomplete. It covers basic purpose and one parameter nuance, but lacks details on other parameters, error cases, or operational constraints. The presence of an output schema helps, but doesn't fully compensate for the gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate. It adds meaning for 'raw_output' by explaining its effect ('to get raw CLI output and parsed messages'), but doesn't address other parameters like 'model', 'timeout_s', or 'prompt' beyond what the schema titles imply. This partial compensation meets the baseline for low coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Send a single prompt to Grok via CLI headless mode') and the resource (Grok), making the purpose understandable. However, it doesn't explicitly differentiate from sibling tools like 'grok_chat' or 'grok_code', which likely have overlapping functionality with Grok interactions.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides some implied usage context by mentioning 'raw_output=true' for specific output formats, but it lacks explicit guidance on when to use this tool versus alternatives like 'grok_chat' or 'grok_code'. No exclusions or prerequisites are stated.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/BasisSetVentures/grok-cli-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server