The AI Vision Debug MCP Server enables autonomous debugging and testing of web interfaces using AI models, powered by Playwright.
Visual Analysis: Comprehensive analysis of web pages with interactive element mapping, performance metrics, and screenshots.
UI Issue Detection: Identifies and debugs UI issues through inspection of visual elements.
User Workflow Testing: Automates and validates complex user workflows without manual test script creation.
API Endpoint Validation: Tests API endpoints to verify backend functionality.
Visual Change Tracking: Compares UI states between versions to detect visual regressions.
Console Monitoring: Captures console logs for error detection and debugging.
Performance Optimization: Analyzes metrics to identify bottlenecks.
Browser Automation: Executes low-level actions like navigation, clicks, form filling, and JavaScript.
Report Generation: Produces detailed reports with actionable recommendations.
Integration: Works with AI models (including non-vision models), Smithery, GLAMA, and CI/CD pipelines.
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
Tools
A Model Context Protocol server that provides AI vision capabilities for analyzing UI screenshots, offering tools for screen analysis, file operations, and UI/UX report generation.
- Autonomous UI Debugging Agent
- Installation Options
- Complete Tool Reference
- Autonomous Debugging Workflows
- Visual Analysis Examples
- Integration Options
- CI/CD Integration
- License
Related Resources
Related MCP Servers
- AsecurityAlicenseAqualityA Model Context Protocol server that provides browser automation capabilities using Playwright. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment.Last updated -327,9924,828MIT License
- AsecurityAlicenseAqualityAn official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.Last updated -1031MIT License
- -securityFlicense-qualityA server that provides rich UI context and interaction capabilities to AI models, enabling deep understanding of user interfaces through visual analysis and precise interaction via Model Context Protocol.Last updated -60
- -securityAlicense-qualityA Model Context Protocol server enabling AI assistants to generate images through OpenAI's DALL-E API with full support for all available options and fine-grained control.Last updated -21MIT License