Integrations
Supports exporting YOLO models to ONNX format for compatibility with different runtime environments
Leverages YOLO (You Only Look Once) models for advanced computer vision tasks including object detection, segmentation, classification, and pose estimation with support for model training, validation, and export
YOLO MCP Service
A powerful YOLO (You Only Look Once) computer vision service that integrates with Claude AI through Model Context Protocol (MCP). This service enables Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
Features
- Object detection, segmentation, classification, and pose estimation
- Real-time camera integration for live object detection
- Support for model training, validation, and export
- Comprehensive image analysis combining multiple models
- Support for both file paths and base64-encoded images
- Seamless integration with Claude AI
Setup Instructions
Prerequisites
- Python 3.10 or higher
- Git (optional, for cloning the repository)
Environment Setup
- Create a directory for the project and navigate to it:Copy
- Download the project files or clone from repository:Copy
- Create a virtual environment:Copy
- Activate the virtual environment:Copy
- Run the setup script:The setup script will:Copy
- Check your Python version
- Create a virtual environment (if not already created)
- Install required dependencies
- Generate an MCP configuration file (mcp-config.json)
- Output configuration information for different MCP clients including Claude
- Note the output from the setup script, which will look similar to:Copy
Downloading YOLO Models
Before using the service, you need to download the YOLO models. The service looks for models in the following directories:
- The current directory where the service is running
- A
models
subdirectory - Any other directory configured in the
CONFIG["model_dirs"]
variable in server.py
Create a models directory and download some common models:
For Windows PowerShell users:
Configuring Claude
To use this service with Claude:
- For Claude web: Set up the service on your local machine and use the configuration provided by the setup script in your MCP client.
- For Claude Desktop:
- Run the setup script and note the configuration output
- Locate your Claude Desktop configuration file (the path is provided in the setup script output)
- Add or merge the configuration into your Claude Desktop configuration file
- Restart Claude Desktop
Using YOLO Tools in Claude
1. First Check Available Models
Always check which models are available on your system first:
2. Detecting Objects in an Image
For analyzing an image file on your computer:
You can also specify a different model:
3. Running Comprehensive Image Analysis
For more detailed analysis that combines object detection, classification, and more:
4. Image Segmentation
For identifying object boundaries and creating segmentation masks:
5. Image Classification
For classifying the entire image content:
6. Using Your Computer's Camera
Start real-time object detection using your computer's camera:
Get the latest camera detections:
Stop the camera when finished:
7. Advanced Model Operations
Training a Custom Model
Validating a Model
Exporting a Model to Different Formats
8. Testing Connection
Check if the YOLO service is running correctly:
Troubleshooting
Camera Issues
If the camera doesn't work, try different camera IDs:
Model Not Found
If a model is not found, make sure you've downloaded it to one of the configured directories:
Performance Issues
For better performance with limited resources, use the smaller models (e.g., yolov8n.pt instead of yolov8x.pt)
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
Related MCP Servers
- -securityAlicense-qualityA powerful server that integrates the Moondream vision model to enable advanced image analysis, including captioning, object detection, and visual question answering, through the Model Context Protocol, compatible with AI assistants like Claude and Cline.Last updated -11JavaScriptApache 2.0
- -securityFlicense-qualityConnects Sketchup to Claude AI through the Model Context Protocol, allowing Claude to directly interact with and control Sketchup for prompt-assisted 3D modeling and scene manipulation.Last updated -20
- -securityFlicense-qualityConnects Houdini to Claude AI through Model Context Protocol, enabling AI-assisted 3D modeling, scene creation, simulation setup, and rendering through natural language commands.Last updated -37
- AsecurityFlicenseAqualityA Model Context Protocol server that enables Claude to generate and upscale images through the Letz AI API, allowing users to create images directly within Claude conversations.Last updated -21JavaScript