Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
CODEX_VISION_MODELNoResponses model to use for vision analysis. Default: gpt-5.5gpt-5.5
CODEX_VISION_BASE_URLNoBase URL for the Codex Responses backend. Default: https://chatgpt.com/backend-api/codexhttps://chatgpt.com/backend-api/codex
CODEX_VISION_AUTH_JSONNoExplicit path to a JSON file containing OAuth credentials. If not set, the server attempts to discover auth from other sources.
CODEX_VISION_TIMEOUT_MSNoTimeout in milliseconds for API requests. Default: 300000300000
CODEX_VISION_IMAGE_DETAILNoImage detail level: auto, low, high, or original. Default: autoauto
CODEX_VISION_MAX_IMAGE_MBNoMaximum image file size in MB. Default: 1010
CODEX_VISION_MAX_VIDEO_MBNoMaximum video file size in MB. Default: 5050
CODEX_VISION_MCP_LOG_PATHNoOptional path to a log file for the MCP server.
CODEX_VISION_VIDEO_FRAMESNoNumber of frames to extract from video. Default: 44

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}

Tools

Functions exposed to the LLM to take actions

NameDescription
ui_to_artifactA

Convert UI screenshots into artifacts: frontend code, AI prompts, design specifications, or natural-language descriptions.

Use only for UI screenshots. Do not use for OCR-only screenshots, error messages, diagrams, or charts.

extract_text_from_screenshotA

Extract and recognize text from screenshots, including code, terminal output, logs, documentation, and UI copy.

Use when the user needs faithful OCR or text reconstruction from an image.

diagnose_error_screenshotA

Diagnose screenshots of error messages, stack traces, exception dialogs, or failed command output.

Returns likely cause, actionable fixes, and prevention notes where possible.

understand_technical_diagramA

Analyze technical diagrams including architecture diagrams, flowcharts, UML, ER diagrams, network diagrams, and sequence diagrams.

analyze_data_visualizationA

Analyze charts, graphs, dashboards, and other data visualizations to extract metrics, trends, anomalies, and recommendations.

ui_diff_checkB

Compare an expected/reference UI screenshot with an actual/current UI screenshot and report visual implementation differences.

analyze_imageA

General-purpose image analysis for cases not covered by the specialized tools. Use as a fallback for flexible visual understanding.

analyze_videoA

Analyze a local or remote video by extracting sampled frames with ffmpeg and sending them to Codex vision analysis.

Requires ffmpeg on PATH. For precise video work, extract important frames manually and use analyze_image or ui_diff_check.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/darkamenosa/codex-vision-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server