Provides access to Google's Gemini models including Gemini 2.5 Pro/Flash with 1M+ context windows through OpenRouter, supporting both text-only and vision capabilities.
Enables interaction with Meta's Llama models including Llama 3.3, Llama 3.2 vision models, and Llama 2 chat models through OpenRouter's unified API.
Provides access to OpenAI models including GPT-4o, GPT-4 Turbo, and GPT-3.5 Turbo through OpenRouter's unified API, with support for text and vision capabilities, streaming responses, and usage tracking.
Provides access to Perplexity's web-connected Llama 3 Sonar models through OpenRouter, enabling AI interactions with real-time web information.
OpenRouter MCP Server
š A powerful Model Context Protocol (MCP) server that provides seamless access to multiple AI models through OpenRouter's unified API.
⨠Features
š§ Collective Intelligence System: Advanced multi-model collaboration and consensus building
5 specialized MCP tools for ensemble reasoning and intelligent decision-making
Multi-model consensus with automated agreement analysis and quality scoring
Ensemble reasoning using specialized models for different task aspects
Adaptive model selection based on task context, requirements, and performance metrics
Cross-model validation for content quality assurance and accuracy verification
Collaborative problem-solving through iterative multi-model interaction
š¤ Multi-Model Access: Chat with GPT-4o, Claude 3.5, Llama 3.3, Gemini 2.5, and 200+ other AI models
š¼ļø Vision/Multimodal Support: Analyze images and visual content with vision-capable models
Support for base64-encoded images and image URLs
Automatic image resizing and optimization for API limits
Compatible with GPT-4o, Claude 3.5, Gemini 2.5, Llama Vision, and more
š Latest Models (Jan 2025): Always up-to-date with the newest models
OpenAI o1, GPT-4o, GPT-4 Turbo
Claude 3.5 Sonnet, Claude 3 Opus
Gemini 2.5 Pro/Flash (1M+ context)
DeepSeek V3, Grok 2, and more
ā” Intelligent Caching: Smart model list caching for improved performance
Dual-layer memory + file caching with configurable TTL
Automatic model metadata enhancement and categorization
Advanced filtering by provider, category, capabilities, and performance tiers
Statistics tracking and cache optimization
š·ļø Rich Metadata: Comprehensive model information with intelligent extraction
Automatic provider detection (OpenAI, Anthropic, Google, Meta, DeepSeek, XAI, etc.)
Smart categorization (chat, image, audio, embedding, reasoning, code, multimodal)
Advanced capability detection (vision, functions, tools, JSON mode, streaming)
Performance tiers (premium/standard/economy) and cost analysis
Version parsing with family identification and latest model detection
Quality scoring system (0-10) based on context length, pricing, and capabilities
š Streaming Support: Real-time response streaming for better user experience
š Advanced Model Benchmarking: Comprehensive performance analysis system
Side-by-side model comparison with detailed metrics (response time, cost, quality, throughput)
Category-based model selection (chat, code, reasoning, multimodal)
Weighted performance analysis for different use cases
Multiple report formats (Markdown, CSV, JSON)
Historical benchmark tracking and trend analysis
5 MCP tools for seamless integration with Claude Desktop
š° Usage Tracking: Monitor API usage, costs, and token consumption
š”ļø Error Handling: Robust error handling with detailed logging
š§ Easy Setup: One-command installation with
npxš„ļø Claude Desktop Integration: Seamless integration with Claude Desktop app
š Full MCP Compliance: Implements Model Context Protocol standards
š Quick Start
Option 1: Using npx (Recommended)
Option 2: Global Installation
š Prerequisites
Node.js 16+: Required for CLI interface
Python 3.9+: Required for the MCP server backend
OpenRouter API Key: Get one free at openrouter.ai
š ļø Installation & Configuration
1. Get Your OpenRouter API Key
Visit OpenRouter
Sign up for a free account
Navigate to the API Keys section
Create a new API key
2. Initialize the Server
This will:
Prompt you for your OpenRouter API key
Create a
.envconfiguration fileOptionally set up Claude Desktop integration
3. Start the Server
The server will start on localhost:8000 by default.
šÆ Usage
Available Commands
Start Server Options
š¤ Claude Desktop Integration
Automatic Setup
This automatically configures Claude Desktop to use OpenRouter models.
Manual Setup
Add to your Claude Desktop config file:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
Linux: ~/.config/claude/claude_desktop_config.json
Then restart Claude Desktop.
š» Claude Code CLI Integration
Automatic Setup
This automatically configures Claude Code CLI to use OpenRouter models.
Manual Setup
Add to your Claude Code CLI config file at ~/.claude/claude_code_config.json:
Usage with Claude Code CLI
Once configured, you can use OpenRouter models directly in your terminal:
For detailed setup instructions, see Claude Code CLI Integration Guide.
š ļø Available MCP Tools
Once integrated with Claude Desktop or Claude Code CLI, you'll have access to these tools:
1. chat_with_model
Chat with any available AI model.
Parameters:
model: Model ID (e.g., "openai/gpt-4o", "anthropic/claude-3.5-sonnet")messages: Conversation historytemperature: Creativity level (0.0-2.0)max_tokens: Maximum response lengthstream: Enable streaming responses
Example:
2. list_available_models
Get comprehensive information about all available models with enhanced metadata.
Parameters:
filter_by: Optional filter by model nameprovider: Filter by provider (openai, anthropic, google, etc.)category: Filter by category (chat, image, reasoning, etc.)capabilities: Filter by specific capabilitiesperformance_tier: Filter by tier (premium, standard, economy)min_quality_score: Minimum quality score (0-10)
Returns:
Model IDs, names, descriptions with enhanced metadata
Provider and category classification
Detailed pricing and context information
Capability flags (vision, functions, streaming, etc.)
Performance metrics and quality scores
Version information and latest model indicators
3. get_usage_stats
Track your API usage and costs.
Parameters:
start_date: Start date (YYYY-MM-DD)end_date: End date (YYYY-MM-DD)
Returns:
Total costs and token usage
Request counts
Model-specific breakdowns
4. chat_with_vision š¼ļø
Chat with vision-capable models by sending images.
Parameters:
model: Vision-capable model ID (e.g., "openai/gpt-4o", "anthropic/claude-3-opus", "google/gemini-pro-vision")messages: Conversation history (supports both text and image content)images: List of images (file paths, URLs, or base64 strings)temperature: Creativity level (0.0-2.0)max_tokens: Maximum response length
Image Format Support:
File paths:
/path/to/image.jpg,./image.pngURLs:
https://example.com/image.jpgBase64: Direct base64 strings (with or without data URI prefix)
Example - Multiple Images:
Features:
Automatic image format detection and conversion
Image resizing for API size limits (20MB max)
Support for JPEG, PNG, GIF, and WebP formats
Batch processing of multiple images
5. list_vision_models š¼ļø
Get all vision-capable models.
Parameters: None
Returns:
List of models that support image analysis
Model capabilities and pricing information
Context window sizes for multimodal content
Example Vision Models:
openai/gpt-4o: OpenAI's latest multimodal modelopenai/gpt-4o-mini: Fast and cost-effective vision modelanthropic/claude-3-opus: Most capable Claude vision modelanthropic/claude-3-sonnet: Balanced Claude vision modelgoogle/gemini-pro-vision: Google's multimodal AImeta-llama/llama-3.2-90b-vision-instruct: Meta's vision-capable Llama model
6. benchmark_models š
Compare multiple AI models with the same prompt.
Parameters:
models: List of model IDs to benchmarkprompt: The prompt to send to each modeltemperature: Temperature setting (0.0-2.0)max_tokens: Maximum response tokensruns_per_model: Number of runs per model for averaging
Returns:
Performance metrics (response time, cost, tokens)
Model rankings by speed, cost, and reliability
Individual responses from each model
7. compare_model_categories š
Compare the best models from different categories.
Parameters:
categories: List of categories to compareprompt: Test promptmodels_per_category: Number of top models per category
Returns:
Category-wise comparison results
Best performers in each category
8. get_benchmark_history š
Retrieve historical benchmark results.
Parameters:
limit: Maximum number of results to returndays_back: Number of days to look backmodel_filter: Optional model ID filter
Returns:
List of past benchmark results
Performance trends over time
Summary statistics
9. export_benchmark_report š
Export benchmark results in different formats.
Parameters:
benchmark_file: Benchmark result file to exportformat: Output format ("markdown", "csv", "json")output_file: Optional custom output filename
Returns:
Exported report file path
Export status and summary
10. compare_model_performance āļø
Advanced model comparison with weighted metrics.
Parameters:
models: List of model IDs to compareweights: Metric weights (speed, cost, quality, throughput)include_cost_analysis: Include detailed cost analysis
Returns:
Weighted performance rankings
Cost-effectiveness analysis
Usage recommendations for different scenarios
š§ Collective Intelligence Tools
The following advanced tools leverage multiple AI models for enhanced accuracy and insights:
11. collective_chat_completion š¤
Generate chat completion using collective intelligence with multiple models to reach consensus.
Parameters:
prompt: The prompt to process collectivelymodels: Optional list of specific models to usestrategy: Consensus strategy ("majority_vote", "weighted_average", "confidence_threshold")min_models: Minimum number of models to use (default: 3)max_models: Maximum number of models to use (default: 5)temperature: Sampling temperature (default: 0.7)system_prompt: Optional system prompt for all models
Returns:
consensus_response: The agreed-upon responseagreement_level: Level of agreement between modelsconfidence_score: Confidence in the consensusparticipating_models: List of models that participatedindividual_responses: Responses from each modelquality_metrics: Accuracy, consistency, and completeness scores
12. ensemble_reasoning šÆ
Perform ensemble reasoning using specialized models for different aspects of complex problems.
Parameters:
problem: Problem to solve with ensemble reasoningtask_type: Type of task ("reasoning", "analysis", "creative", "factual", "code_generation")decompose: Whether to decompose the problem into subtasksmodels: Optional list of specific models to usetemperature: Sampling temperature (default: 0.7)
Returns:
final_result: The combined reasoning resultsubtask_results: Results from individual subtasksmodel_assignments: Which models handled which subtasksreasoning_quality: Quality metrics for the reasoning processprocessing_time: Total processing timestrategy_used: Decomposition strategy used
13. adaptive_model_selection šļø
Intelligently select the best model for a given task using adaptive routing.
Parameters:
query: Query for adaptive model selectiontask_type: Type of task ("reasoning", "creative", "factual", "code_generation", "analysis")performance_requirements: Performance requirements (accuracy, speed thresholds)constraints: Task constraints (max cost, timeout, etc.)
Returns:
selected_model: The chosen model IDselection_reasoning: Why this model was selectedconfidence: Confidence in the selection (0-1)alternative_models: Other viable options with scoresrouting_metrics: Performance metrics used in selectionexpected_performance: Predicted performance characteristics
14. cross_model_validation ā
Validate content quality and accuracy across multiple models for quality assurance.
Parameters:
content: Content to validate across modelsvalidation_criteria: Specific validation criteria (e.g., "factual_accuracy", "technical_correctness")models: Optional list of models to use for validationthreshold: Validation threshold (0-1, default: 0.7)
Returns:
validation_result: Overall validation result ("VALID" or "INVALID")validation_score: Numerical validation score (0-1)validation_issues: Issues found by multiple modelsmodel_validations: Individual validation results from each modelrecommendations: Suggested improvementsconfidence: Confidence in the validation result
15. collaborative_problem_solving š¤
Solve complex problems through collaborative multi-model interaction and iterative refinement.
Parameters:
problem: Problem to solve collaborativelyrequirements: Problem requirements and constraintsconstraints: Additional constraints (budget, time, resources)max_iterations: Maximum number of iteration rounds (default: 3)models: Optional list of specific models to use
Returns:
final_solution: The collaborative solutionsolution_path: Step-by-step solution developmentalternative_solutions: Alternative approaches consideredcollaboration_quality: Quality metrics for the collaborationcomponent_contributions: Individual model contributionsconvergence_metrics: How the solution evolved over iterations
š§ Configuration
Environment Variables
Create a .env file in your project directory:
Configuration Options
Variable | Description | Default |
| Your OpenRouter API key | Required |
| App identifier for tracking | "openrouter-mcp" |
| HTTP referer header | " " |
| Server bind address | "localhost" |
| Server port | "8000" |
| Logging level | "info" |
| Model cache TTL in hours | "1" |
| Max items in memory cache | "1000" |
| Cache file path | "openrouter_model_cache.json" |
š Popular Models
Here are some popular models available through OpenRouter:
OpenAI Models
openai/gpt-4o: Most capable multimodal GPT-4 model (text + vision)openai/gpt-4o-mini: Fast and cost-effective with vision supportopenai/gpt-4: Most capable text-only GPT-4 modelopenai/gpt-3.5-turbo: Fast and cost-effective text model
Anthropic Models
anthropic/claude-3-opus: Most capable Claude model (text + vision)anthropic/claude-3-sonnet: Balanced capability and speed (text + vision)anthropic/claude-3-haiku: Fast and efficient (text + vision)
Open Source Models
meta-llama/llama-3.2-90b-vision-instruct: Meta's flagship vision modelmeta-llama/llama-3.2-11b-vision-instruct: Smaller vision-capable Llamameta-llama/llama-2-70b-chat: Meta's text-only flagship modelmistralai/mixtral-8x7b-instruct: Efficient mixture of expertsmicrosoft/wizardlm-2-8x22b: High-quality instruction following
Specialized Models
google/gemini-pro-vision: Google's multimodal AI (text + vision)google/gemini-pro: Google's text-only modelcohere/command-r-plus: Great for RAG applicationsperplexity/llama-3-sonar-large-32k-online: Web-connected model
Use list_available_models to see all available models and their pricing.
š Troubleshooting
Common Issues
1. Python not found
2. Missing Python dependencies
3. API key not configured
4. Port already in use
5. Claude Desktop not detecting server
Restart Claude Desktop after configuration
Check config file path and format
Verify API key is correct
Debug Mode
Enable debug logging for detailed troubleshooting:
Status Check
Check server configuration and status:
š§Ŗ Development
Running Tests
Project Structure
š Documentation
Quick Links
Documentation Index - Complete documentation overview
Installation Guide - Detailed setup instructions
API Reference - Complete API documentation
Troubleshooting - Common issues and solutions
FAQ - Frequently asked questions
Integration Guides
Claude Desktop Integration - Desktop app setup
Claude Code CLI Integration - Terminal workflow
Feature Guides
Multimodal/Vision Guide - Image analysis capabilities
Benchmarking Guide - Model performance comparison
Model Metadata Guide - Enhanced filtering system
Model Caching - Cache optimization
Development
Architecture Overview - System design documentation
Testing Guide - TDD practices and test suite
Contributing Guide - Development guidelines
External Resources
OpenRouter API Docs - Official OpenRouter documentation
MCP Specification - Model Context Protocol standard
š¤ Contributing
We welcome contributions! Please see our Contributing Guide for details.
Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request
š License
This project is licensed under the MIT License - see the LICENSE file for details.
š Links
OpenRouter - Get your API key
Claude Desktop - Download Claude Desktop app
Model Context Protocol - Learn about MCP
FastMCP - The MCP framework we use
š Acknowledgments
OpenRouter for providing access to multiple AI models
FastMCP for the excellent MCP framework
Anthropic for the Model Context Protocol specification
Made with ā¤ļø for the AI community
Need help? Open an issue or check our documentation!