Skip to main content
Glama

agent-vision-mcp

agent-vision-mcp exposes image analysis, inspection, cropping, OCR, and comparison tools through the Model Context Protocol.

Quickstart

Run the published package without installing it permanently:

uvx agent-vision-mcp

Example MCP client configuration:

{
  "mcpServers": {
    "agent-vision": {
      "command": "uvx",
      "args": ["agent-vision-mcp"],
      "env": {
        "VISION_API_KEY": "your-api-key",
        "VISION_BASE_URL": "https://your-provider.example/v1",
        "VISION_MODEL_ID": "your-vision-model"
      }
    }
  }
}

Related MCP server: mcp-see

Development

python -m venv .venv
.venv/bin/pip install -e ".[dev]"
cp .env.example .env
.venv/bin/agent-vision-mcp

Configure an OpenAI-compatible multimodal endpoint with VISION_API_KEY, VISION_BASE_URL, and VISION_MODEL_ID.

URL Handling

VISION_URL_MODE controls how remote images are sent to the model:

  • auto passes URLs through for analysis and comparison, but downloads them when inspection, cropping, or OCR requires image bytes.

  • passthrough prefers URL passthrough, except for tools that require bytes.

  • download always downloads and verifies remote images before model calls.

Downloads are streamed with byte limits, redirects are security checked, and all downloaded or encoded inputs are verified as supported images. URL passthrough relies on the configured model provider to fetch URLs safely; use download when the provider is not trusted to enforce outbound-network restrictions.

Dedicated OCR is disabled by default. Set OCR_ENABLED=true and configure the OCR_* variables to use a separate OCR model; otherwise OCR uses the VLM.

Run Tests

.venv/bin/python -m unittest discover -s tests -v

License

MIT

A
license - permissive license
-
quality - not tested
C
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/idealizing/agent-vision-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server