Perplexity MCP Server

Overview Schema Related Servers Score Discussions

Perplexity_MCP
Notes

explanations.md•52.3 KiB

# Technical Explanations and Deep-Dives This document provides detailed technical explanations for various aspects of the Perplexity MCP implementation, covering core concepts, design decisions, and development insights. --- ## 1. Logger Mechanism - Low Level Deep Dive ### How Logging Works in Our Codebase #### Logger Configuration and Setup **In server.py (Lines 12-19):** ```python logging.basicConfig( level=logging.INFO, format="%(asctime)s - %(name)s - %(levelname)s - %(message)s", handlers=[logging.StreamHandler(sys.stderr)] ) logger = logging.getLogger(__name__) ``` **Critical MCP Protocol Compliance:** - **stderr Direction**: All log output goes to `sys.stderr`, NEVER `stdout` - **Why Critical**: MCP uses `stdout` for JSON-RPC communication; any non-JSON output breaks the protocol - **Format Structure**: Timestamp + Module Name + Level + Message for debugging **In client.py (Lines 12-13):** ```python logger = logging.getLogger(__name__) ``` This creates a child logger that inherits from the root logger configuration. #### Logging During Test Script Execution **When Running `uv run python tests/tests.py small`:** 1. **Logger Initialization**: - Python imports `client.py` → Logger created but no handlers yet - Python imports `server.py` → `logging.basicConfig()` configures global logging - All loggers in the process now use stderr with INFO level 2. **API Request Logging**: ```python # client.py line 57-58 logger.info(f"Making request to Perplexity API with model: {model}") logger.debug(f"Request payload: {payload}") ``` - **INFO level**: Shows in console output - **DEBUG level**: Hidden (level set to INFO) - **Output destination**: stderr (doesn't interfere with test output) 3. **Response Logging**: ```python # client.py lines 67-71 logger.info(f"Received response from Perplexity API") if "usage" in result: usage = result["usage"] logger.info(f"Token usage - Prompt: {usage.get('prompt_tokens', 0)}, " f"Completion: {usage.get('completion_tokens', 0)}, " f"Total: {usage.get('total_tokens', 0)}") ``` 4. **Error Logging**: ```python # client.py lines 75-76, 82, 88 logger.error(f"HTTP error from Perplexity API: {e.response.status_code}") logger.error(f"Request timeout after {self.timeout} seconds") logger.exception("Unexpected error in chat_completion") ``` #### Logging During Claude Desktop MCP Usage **Process Separation:** 1. **Claude Desktop Process**: Starts our MCP server as a child process 2. **MCP Server Process**: Our `server.py` runs independently with its own logger 3. **Communication**: Only JSON-RPC messages over stdout/stdin **Logger Behavior in MCP Context:** 1. **Server Startup**: ```python # server.py lines 140-142 logger.info("Starting Perplexity MCP server...") logger.info("Available tools: perplexity_small, perplexity_medium, perplexity_large") ``` - These go to stderr of the MCP server process - Claude Desktop may capture these for debugging 2. **Tool Execution**: ```python # server.py line 118 logger.warning(f"Starting deep research query - this may take 10-30 minutes") ``` - When Claude calls a tool, logging continues to stderr - API request/response logging from client.py happens 3. **Error Handling**: ```python # server.py lines 65, 92, 125 logger.exception("Error in perplexity_small") ``` - Exceptions logged to stderr without breaking MCP protocol - Claude Desktop receives proper error responses via stdout **Logger Hierarchy Flow:** ``` Root Logger (configured in server.py) ├── __main__ (server.py logger) ├── client (client.py logger) └── mcp.server.fastmcp (FastMCP internal logging) ``` **Key Differences Between Test and MCP Usage:** - **Test**: Logs visible in terminal, mixed with test output - **MCP**: Logs go to Claude Desktop's stderr capture, invisible to user - **Protocol**: Both maintain clean stdout for JSON-RPC communication --- ## 2. Object-Oriented Programming and 'self' Concept - Low Level Deep Dive ### Understanding 'self' in Our Codebase #### What is 'self'? **'self'** is Python's way of referring to the current instance of a class. It's the first parameter of every instance method and represents the specific object the method is being called on. #### How 'self' Works in Our PerplexityClient Class **Class Definition (client.py lines 16-27):** ```python class PerplexityClient: def __init__(self): self.api_key = get_api_key() self.base_url = PERPLEXITY_BASE_URL self.timeout = PERPLEXITY_TIMEOUT self.headers = { "Authorization": f"Bearer {self.api_key}", "Content-Type": "application/json" } ``` **Memory Layout When Instance Created:** ``` client_instance_memory_address: 0x7f8b8c0d4a90 ├── self.api_key = "pplx-abc123..." ├── self.base_url = "https://api.perplexity.ai" ├── self.timeout = 300 └── self.headers = { "Authorization": "Bearer pplx-abc123...", "Content-Type": "application/json" } ``` #### Instance Creation and Method Calls **In our server.py (lines 28-33):** ```python def get_perplexity_client() -> PerplexityClient: global perplexity_client if perplexity_client is None: perplexity_client = PerplexityClient() # __init__ called here return perplexity_client ``` **What happens during `PerplexityClient()`:** 1. **Memory Allocation**: Python creates new object in memory 2. **`__init__` Call**: `__init__(self)` called automatically 3. **'self' Binding**: 'self' parameter bound to the new object's memory address 4. **Attribute Assignment**: Each `self.attribute = value` stores data in the object's memory space #### Method Execution with 'self' **When we call `client.chat_completion(...)`:** ```python # In server.py line 54 client = get_perplexity_client() # Returns our instance response = client.chat_completion(messages=messages, **config) ``` **Behind the scenes transformation:** ```python # What we write: client.chat_completion(messages=messages, **config) # What Python actually executes: PerplexityClient.chat_completion(client, messages=messages, **config) # ^^^^^^ # 'self' parameter gets the client instance ``` **Inside chat_completion method (client.py lines 57-58):** ```python def chat_completion(self, messages, model, **kwargs): # 'self' here refers to our specific client instance logger.info(f"Making request to Perplexity API with model: {model}") # When we access self.headers: headers = self.headers # Gets the headers we stored in __init__ # When we access self.timeout: with httpx.Client(timeout=self.timeout) as client: # Gets the timeout value (300) we stored in __init__ ``` #### Multiple Instances vs Singleton Pattern **Our Implementation Uses Singleton Pattern:** ```python # Global variable (server.py line 25) perplexity_client = None def get_perplexity_client(): global perplexity_client if perplexity_client is None: perplexity_client = PerplexityClient() # Create once return perplexity_client # Reuse same instance ``` **Why Singleton?** - **Performance**: Avoid repeated API key validation - **Resource Management**: Single HTTP client configuration - **State Consistency**: Same timeout/headers across all tool calls **If We Used Multiple Instances:** ```python # This would create new instances each time: def get_perplexity_client(): return PerplexityClient() # New instance every call # Each instance would have its own memory: client1 = PerplexityClient() # Memory address: 0x7f8b8c0d4a90 client2 = PerplexityClient() # Memory address: 0x7f8b8c0d4b20 ``` #### 'self' vs Local Variables **Instance Variables (survive method calls):** ```python def __init__(self): self.api_key = get_api_key() # Stored in object memory self.timeout = 300 # Persists across method calls ``` **Local Variables (destroyed after method ends):** ```python def chat_completion(self, messages, model, **kwargs): payload = { # Local variable - destroyed when method ends "model": model, "messages": messages, **kwargs } # 'payload' only exists during this method execution ``` #### Method Types and 'self' **Instance Methods (have 'self'):** ```python def chat_completion(self, messages, model, **kwargs): # Can access instance variables: self.api_key, self.timeout # Can modify instance state: self.last_request = payload ``` **Class Methods (have 'cls'):** ```python @classmethod def from_config(cls, config_dict): # 'cls' refers to the PerplexityClient class itself instance = cls() # Same as PerplexityClient() return instance ``` **Static Methods (no self/cls):** ```python @staticmethod def validate_api_key(api_key): # No access to instance or class variables # Just a function bundled with the class return api_key.startswith("pplx-") ``` #### Related OOP Concepts in Our Code **Encapsulation:** ```python class PerplexityClient: def __init__(self): self.api_key = get_api_key() # Private data def chat_completion(self, messages, model, **kwargs): # Public interface - hides internal HTTP implementation ``` **Inheritance (if we extended the class):** ```python class EnhancedPerplexityClient(PerplexityClient): def __init__(self): super().__init__() # Call parent's __init__ self.cache = {} # Add our own attributes def chat_completion(self, messages, model, **kwargs): # 'self' here includes both parent and child attributes if cache_key in self.cache: # Our attribute return self.cache[cache_key] result = super().chat_completion(messages, model, **kwargs) # Parent method self.cache[cache_key] = result return result ``` **Polymorphism (different classes, same interface):** ```python class OpenAIClient: def chat_completion(self, messages, model, **kwargs): # Different implementation, same method signature pass class AnthropicClient: def chat_completion(self, messages, model, **kwargs): # Different implementation, same method signature pass # All can be used interchangeably: clients = [PerplexityClient(), OpenAIClient(), AnthropicClient()] for client in clients: result = client.chat_completion(messages, model) # Polymorphism ``` --- ## 3. Class vs Functions Design Decision - Low Level Deep Dive ### Why We Used a Class for PerplexityClient #### Current Class-Based Implementation **Our Implementation (client.py):** ```python class PerplexityClient: def __init__(self): self.api_key = get_api_key() self.base_url = PERPLEXITY_BASE_URL self.timeout = PERPLEXITY_TIMEOUT self.headers = { "Authorization": f"Bearer {self.api_key}", "Content-Type": "application/json" } def chat_completion(self, messages, model, **kwargs): # Method uses self.headers, self.timeout, etc. ``` **Usage in server.py:** ```python perplexity_client = None def get_perplexity_client() -> PerplexityClient: global perplexity_client if perplexity_client is None: perplexity_client = PerplexityClient() # Initialize once return perplexity_client # In each tool function: client = get_perplexity_client() # Reuse same instance response = client.chat_completion(messages=messages, **config) ``` #### Alternative Function-Based Implementation **What it would look like with functions:** ```python # client.py - Function-based approach def create_headers(): api_key = get_api_key() # Called every time return { "Authorization": f"Bearer {api_key}", "Content-Type": "application/json" } def chat_completion(messages, model, **kwargs): headers = create_headers() # Recreated every call base_url = PERPLEXITY_BASE_URL # Looked up every call timeout = PERPLEXITY_TIMEOUT # Looked up every call payload = {"model": model, "messages": messages, **kwargs} with httpx.Client(timeout=timeout) as client: response = client.post( f"{base_url}/chat/completions", headers=headers, json=payload ) return response.json() def format_response(api_response): # Same implementation pass ``` **Usage in server.py would be:** ```python from client import chat_completion, format_response # In each tool function: response = chat_completion(messages=messages, **config) # Direct function call formatted = format_response(response) ``` #### Performance Analysis **Class-Based Approach (Our Choice):** 1. **Initialization Cost (One-time)**: ```python # Called once when MCP server starts client = PerplexityClient() ``` - API key validation: 1x - Header creation: 1x - Configuration setup: 1x 2. **Per-Request Cost**: ```python # Called for each tool invocation response = client.chat_completion(messages, model) ``` - API key lookup: 0x (already in self.headers) - Header creation: 0x (reuse self.headers) - HTTP client configuration: 0x (reuse self.timeout) **Function-Based Approach:** 1. **Initialization Cost**: None 2. **Per-Request Cost**: ```python # Called for each tool invocation response = chat_completion(messages, model) ``` - API key validation: 1x per request - Header creation: 1x per request - Configuration lookup: 1x per request #### Memory Usage Comparison **Class-Based Memory Usage:** ```python # Single instance in memory: PerplexityClient instance at 0x7f8b8c0d4a90: ├── api_key: "pplx-abc123..." (stored once) ├── base_url: "https://api.perplexity.ai" (stored once) ├── timeout: 300 (stored once) └── headers: {...} (dict stored once) # Total: ~200 bytes for instance data ``` **Function-Based Memory Usage:** ```python # Per function call: def chat_completion(): api_key = get_api_key() # 50 bytes headers = create_headers() # 100 bytes base_url = PERPLEXITY_BASE_URL # 30 bytes timeout = PERPLEXITY_TIMEOUT # 8 bytes # Total: ~188 bytes per call # Multiplied by number of requests ``` #### State Management Advantages **Class-Based State Management:** ```python class PerplexityClient: def __init__(self): self.api_key = get_api_key() self.request_count = 0 # Could track usage self.last_error = None # Could store last error def chat_completion(self, messages, model, **kwargs): self.request_count += 1 # Persistent state try: # Make request result = ... self.last_error = None return result except Exception as e: self.last_error = e # Store for debugging raise def get_stats(self): return { "requests_made": self.request_count, "last_error": self.last_error } ``` **Function-Based State Management:** ```python # Would need global variables or external storage request_count = 0 last_error = None def chat_completion(messages, model, **kwargs): global request_count, last_error request_count += 1 # Global state mutation # Less clean, harder to manage ``` #### Error Handling and Context **Class-Based Error Context:** ```python def chat_completion(self, messages, model, **kwargs): try: response = self._make_request(payload) # Helper method return response except httpx.HTTPStatusError as e: # Can access self.api_key for logging without re-fetching logger.error(f"API error for key {self.api_key[:10]}...") return {"error": "http_error", "message": str(e)} ``` **Function-Based Error Context:** ```python def chat_completion(messages, model, **kwargs): api_key = get_api_key() # Must fetch again for error logging try: response = make_request(payload, api_key) return response except httpx.HTTPStatusError as e: logger.error(f"API error for key {api_key[:10]}...") return {"error": "http_error", "message": str(e)} ``` #### Extensibility and Future Features **Class-Based Extension:** ```python class PerplexityClient: def __init__(self): self.api_key = get_api_key() self.cache = {} # Easy to add caching def chat_completion(self, messages, model, **kwargs): cache_key = self._generate_cache_key(messages, model, kwargs) if cache_key in self.cache: return self.cache[cache_key] # Use instance cache result = self._make_api_request(messages, model, kwargs) self.cache[cache_key] = result return result def clear_cache(self): self.cache.clear() # Instance method for cache management ``` **Function-Based Extension:** ```python # Would need module-level cache or external cache service _cache = {} def chat_completion(messages, model, **kwargs): cache_key = generate_cache_key(messages, model, kwargs) if cache_key in _cache: # Module-level global state return _cache[cache_key] result = make_api_request(messages, model, kwargs) _cache[cache_key] = result return result def clear_cache(): global _cache _cache.clear() # Global state mutation ``` #### Thread Safety Considerations **Class-Based Thread Safety:** ```python import threading class PerplexityClient: def __init__(self): self.api_key = get_api_key() self._lock = threading.Lock() # Instance-level lock self.request_count = 0 def chat_completion(self, messages, model, **kwargs): with self._lock: # Protect instance state self.request_count += 1 # Make request safely ``` **Function-Based Thread Safety:** ```python import threading _lock = threading.Lock() # Global lock _request_count = 0 def chat_completion(messages, model, **kwargs): global _request_count with _lock: # Global lock affects all functions _request_count += 1 # More complex coordination needed ``` #### Testing Advantages **Class-Based Testing:** ```python def test_api_client(): # Easy to create test instance with mock data client = PerplexityClient() client.api_key = "test-key" # Can override instance attributes client.base_url = "http://localhost:8000" # Test server result = client.chat_completion(test_messages, "test-model") assert result is not None def test_with_mock(): client = PerplexityClient() client.headers = {"test": "headers"} # Easy to mock # Test behavior ``` **Function-Based Testing:** ```python def test_api_functions(): # Need to mock global functions or environment with patch('client.get_api_key') as mock_key: mock_key.return_value = "test-key" result = chat_completion(test_messages, "test-model") # More complex mocking required ``` #### Resource Management **Class-Based Resource Management:** ```python class PerplexityClient: def __init__(self): self.api_key = get_api_key() self._session = httpx.Client(timeout=self.timeout) # Persistent session def chat_completion(self, messages, model, **kwargs): # Reuse same HTTP session for connection pooling response = self._session.post(...) return response.json() def close(self): if self._session: self._session.close() # Clean resource management ``` **Function-Based Resource Management:** ```python def chat_completion(messages, model, **kwargs): # Create new client every time - no connection pooling with httpx.Client(timeout=PERPLEXITY_TIMEOUT) as client: response = client.post(...) return response.json() # Client closed after each request - less efficient ``` #### Summary: Why Class-Based Approach Wins 1. **Performance**: One-time initialization vs per-request overhead 2. **Memory Efficiency**: Shared state vs repeated allocations 3. **State Management**: Clean instance variables vs global state 4. **Extensibility**: Easy to add features like caching, metrics, etc. 5. **Resource Management**: Persistent connections, proper cleanup 6. **Testing**: Easier mocking and test instance creation 7. **Thread Safety**: Instance-level coordination vs global locks 8. **Code Organization**: Related functionality grouped together 9. **Error Handling**: Persistent context for better error reporting 10. **Maintainability**: Object-oriented design patterns vs scattered functions **The class-based approach provides superior architecture for a production MCP server that may handle many requests and needs to be maintainable and extensible.** --- ## 4. HTTPX Library vs Requests - Medium Level Detail ### What is HTTPX? HTTPX is a modern, async-capable HTTP client library for Python that serves as a more advanced alternative to the popular `requests` library. It's developed by the same team behind several other popular Python libraries. ### Key Differences from Requests #### 1. Async/Await Support **HTTPX:** ```python import httpx # Synchronous (like requests) with httpx.Client() as client: response = client.get("https://api.example.com") # Asynchronous (unique to httpx) async with httpx.AsyncClient() as client: response = await client.get("https://api.example.com") ``` **Requests:** ```python import requests # Only synchronous response = requests.get("https://api.example.com") # No native async support ``` #### 2. HTTP/2 Support **HTTPX:** - Native HTTP/2 support for better performance - Multiplexing multiple requests over single connection - Better bandwidth utilization **Requests:** - Only HTTP/1.1 support - Requires separate connections for concurrent requests #### 3. Modern Python Support **HTTPX:** - Built for Python 3.7+ - Uses modern Python type hints - Better integration with async frameworks like FastAPI **Requests:** - Supports older Python versions - Less modern architecture ### Why We Chose HTTPX for Our MCP #### 1. Better Timeout Handling ```python # Our implementation in client.py with httpx.Client(timeout=self.timeout) as client: response = client.post( f"{self.base_url}/chat/completions", headers=self.headers, json=payload ) ``` **HTTPX Timeout Advantages:** - More granular timeout control - Separate timeouts for connection vs read - Better timeout error reporting #### 2. JSON Response Handling ```python # HTTPX (our code) response = client.post(url, headers=headers, json=payload) response.raise_for_status() result = response.json() # Requests equivalent response = requests.post(url, headers=headers, json=payload) response.raise_for_status() result = response.json() ``` Both libraries have similar JSON handling, but HTTPX provides better error context. #### 3. Context Manager Support ```python # HTTPX - Clean resource management with httpx.Client(timeout=300) as client: response = client.post(...) # Connection automatically closed # Requests - Manual session management session = requests.Session() try: response = session.post(...) finally: session.close() # Manual cleanup ``` ### Performance Comparison for Our Use Case #### Connection Reuse **Our Current Implementation:** ```python # Creates new client for each request with httpx.Client(timeout=self.timeout) as client: response = client.post(...) ``` **Optimized Implementation (Future):** ```python class PerplexityClient: def __init__(self): self.client = httpx.Client(timeout=PERPLEXITY_TIMEOUT) def chat_completion(self, messages, model, **kwargs): # Reuse same client for connection pooling response = self.client.post(...) def close(self): self.client.close() ``` #### Memory Usage - **HTTPX**: Slightly higher memory usage due to HTTP/2 support - **Requests**: Lower memory footprint for simple use cases - **For our MCP**: Difference is negligible given our request patterns ### Specific Advantages for Perplexity API #### 1. Better Error Handling ```python # Our error handling with httpx try: response = client.post(...) response.raise_for_status() except httpx.HTTPStatusError as e: logger.error(f"HTTP error: {e.response.status_code}") logger.error(f"Response content: {e.response.text}") except httpx.TimeoutException: logger.error(f"Request timeout after {self.timeout} seconds") ``` HTTPX provides more specific exception types than requests. #### 2. Timeout Granularity ```python # HTTPX allows fine-grained timeout control timeout = httpx.Timeout( connect=10.0, # Connection timeout read=300.0, # Read timeout (important for long Perplexity responses) write=10.0, # Write timeout pool=10.0 # Pool timeout ) client = httpx.Client(timeout=timeout) ``` This is particularly valuable for Perplexity's `sonar-deep-research` model which can take 10-30 minutes. #### 3. Response Streaming (Future Enhancement) ```python # HTTPX supports streaming for large responses with httpx.stream("POST", url, json=payload) as response: for chunk in response.iter_bytes(): # Process streaming response pass ``` Useful if we implement streaming responses from Perplexity API. ### Migration Considerations #### From Requests to HTTPX ```python # Requests code import requests response = requests.post(url, json=data, timeout=30) # HTTPX equivalent import httpx with httpx.Client(timeout=30) as client: response = client.post(url, json=data) ``` The APIs are very similar, making migration straightforward. #### Dependency Size - **requests**: ~50KB + dependencies (~200KB total) - **httpx**: ~100KB + dependencies (~400KB total) - **For our MCP**: Size difference is acceptable given the benefits ### When to Use Each Library #### Use HTTPX When: - Building modern Python applications (3.7+) - Need async/await support - Require HTTP/2 support - Want better timeout control - Building APIs or microservices #### Use Requests When: - Supporting older Python versions - Simple, one-off HTTP requests - Minimizing dependencies - Working with legacy codebases ### Conclusion for Our MCP We chose HTTPX because: 1. **Better timeout handling** for long-running Perplexity queries 2. **Modern architecture** aligns with FastMCP framework 3. **Future-proofing** for potential async enhancements 4. **Better error reporting** for debugging API issues 5. **HTTP/2 support** for potential performance benefits The slight increase in dependency size is offset by the improved reliability and feature set for our MCP server use case. --- ## 5. JSON Response Parsing - Basic Detail ### JSON Parsing in Our Code #### The `.json()` Method **In our client.py (line 64):** ```python response = client.post( f"{self.base_url}/chat/completions", headers=self.headers, json=payload ) result = response.json() # <-- This line ``` ### No Additional Library Required **Answer: No, we don't need any additional library to use `.json()`** #### Why No Extra Library? 1. **Built into HTTPX**: The `.json()` method is part of the HTTPX Response object 2. **Uses Python's Built-in `json` Module**: Internally calls `json.loads()` 3. **Automatic Handling**: HTTPX handles the response parsing automatically #### What Happens Behind the Scenes ```python # What response.json() does internally: import json def json_method(self): # Get the response text text_content = self.text # Response body as string # Parse JSON using Python's built-in json module return json.loads(text_content) ``` #### Manual Alternative If we wanted to do it manually: ```python import json response = client.post(url, headers=headers, json=payload) text_content = response.text result = json.loads(text_content) # Same result as response.json() ``` #### Error Handling ```python try: result = response.json() except json.JSONDecodeError: # Handle invalid JSON response print("Response is not valid JSON") ``` HTTPX's `.json()` method automatically raises appropriate exceptions if the response isn't valid JSON. ### JSON in Our Project Context #### Input JSON (to Perplexity API): ```python payload = { "model": "sonar-pro", "messages": [ {"role": "user", "content": "What is Python?"} ] } # HTTPX automatically converts this to JSON string ``` #### Output JSON (from Perplexity API): ```python # response.json() converts this JSON string back to Python dict: { "id": "abc123", "model": "sonar-pro", "choices": [ { "message": { "content": "Python is a programming language..." } } ], "citations": ["https://python.org"], "usage": {"total_tokens": 150} } ``` The built-in JSON support in Python and HTTPX makes this seamless without requiring additional dependencies. --- ## 6. __init__.py Import Issues and Project Structure - Low Level Deep Dive ### Understanding the __init__.py Problem #### What is __init__.py? `__init__.py` is a special file that tells Python "this directory is a package." It can be empty or contain initialization code. #### The Problem You Faced in Your Previous MCP **Scenario**: You likely had a project structure like this: ``` your_mcp/ ├── server.py ├── client.py ├── utils.py └── __init__.py # Empty or minimal ``` **The Import Error**: When running the MCP server, you got import errors like: ``` ModuleNotFoundError: No module named 'client' ``` #### Why This Happens - Python Import Resolution **Python's Module Search Process:** 1. **Current Directory**: Where the script is running from 2. **PYTHONPATH**: Environment variable with additional paths 3. **sys.path**: List of directories Python searches for modules 4. **Standard Library**: Built-in Python modules 5. **Site-packages**: Installed packages #### The Empty __init__.py Problem **Original Problem Structure:** ```python # server.py from client import PerplexityClient # ❌ Fails to import # __init__.py (empty file) # No imports defined ``` **Why it fails:** 1. **Empty __init__.py**: Doesn't export any modules from the package 2. **Import Resolution**: Python can't find `client` module 3. **Path Issues**: Module not properly exposed through package structure #### Our Current Solution **Our Current __init__.py (lines 5-9):** ```python # Export main components for package imports from .client import PerplexityClient from .config import TOOL_CONFIGS, get_api_key __all__ = ["PerplexityClient", "TOOL_CONFIGS", "get_api_key"] ``` **How This Solves the Problem:** 1. **Explicit Imports**: We explicitly import and expose modules 2. **Relative Imports**: Using `.client` (dot notation) for relative imports 3. **__all__ Definition**: Controls what gets imported with `from package import *` #### Import Flow in Our Current Structure **When Python starts server.py:** 1. **Direct Imports in server.py:** ```python from client import PerplexityClient # Line 9 from config import TOOL_CONFIGS # Line 10 ``` 2. **Python's Resolution Process:** ``` 1. Look for 'client.py' in current directory ✓ 2. Found: /Users/rohitseelam/Projects/Perplexity_MCP/client.py 3. Import successful ``` 3. **__init__.py Role:** - Makes the directory a package - Enables relative imports within the package - Controls package-level imports #### Alternative Import Patterns **Pattern 1: Direct Module Imports (Our Current Approach)** ```python # server.py from client import PerplexityClient from config import TOOL_CONFIGS ``` **Pattern 2: Package-Level Imports** ```python # server.py from perplexity_mcp import PerplexityClient, TOOL_CONFIGS # This would require __init__.py to have: # from .client import PerplexityClient # from .config import TOOL_CONFIGS ``` **Pattern 3: Absolute Package Imports** ```python # server.py from perplexity_mcp.client import PerplexityClient from perplexity_mcp.config import TOOL_CONFIGS ``` #### The tests/ Directory Import Fix **Original Problem in tests/tests.py:** ```python # Add current directory to Python path for imports sys.path.insert(0, os.path.dirname(os.path.abspath(__file__))) from client import PerplexityClient # ❌ Looks in tests/ directory ``` **Fixed Version:** ```python # Add parent directory to Python path for imports sys.path.insert(0, os.path.dirname(os.path.dirname(os.path.abspath(__file__)))) from client import PerplexityClient # ✅ Looks in parent directory ``` **What this does:** 1. **`__file__`**: `/Users/rohitseelam/Projects/Perplexity_MCP/tests/tests.py` 2. **First `os.path.dirname()`**: `/Users/rohitseelam/Projects/Perplexity_MCP/tests` 3. **Second `os.path.dirname()`**: `/Users/rohitseelam/Projects/Perplexity_MCP` 4. **`sys.path.insert(0, ...)`**: Adds parent directory to module search path #### Complex Import Scenarios **Circular Import Problem (potential issue):** ```python # client.py from config import TOOL_CONFIGS # config.py from client import PerplexityClient # ❌ Circular import # This would cause ImportError ``` **Our Solution - Unidirectional Dependencies:** ``` config.py (no imports from other modules) ↑ client.py (imports from config) ↑ server.py (imports from client and config) ↑ tests.py (imports from all modules) ``` #### Package Structure Best Practices **Flat Structure (Our Approach):** ``` Perplexity_MCP/ ├── __init__.py # Package marker + exports ├── server.py # Main MCP server ├── client.py # API client ├── config.py # Configuration └── tests/ └── tests.py # Test scripts ``` **Advantages:** - Simple import paths - Easy to understand - Good for small to medium projects **Nested Package Structure (Alternative):** ``` Perplexity_MCP/ ├── __init__.py ├── perplexity_mcp/ │ ├── __init__.py │ ├── server.py │ ├── client.py │ └── config.py └── tests/ └── test_client.py ``` **When to use nested:** - Large projects with many modules - Need namespace separation - Planning for distribution as a package #### Import Debugging Techniques **1. Check sys.path:** ```python import sys print("Python module search paths:") for path in sys.path: print(f" {path}") ``` **2. Check current working directory:** ```python import os print(f"Current working directory: {os.getcwd()}") print(f"Script location: {os.path.abspath(__file__)}") ``` **3. Test import manually:** ```python try: from client import PerplexityClient print("✅ Import successful") except ImportError as e: print(f"❌ Import failed: {e}") ``` #### Claude Desktop MCP Context **When Claude Desktop starts our MCP:** 1. **Working Directory**: Set to our project directory via `--directory` argument 2. **Python Path**: Includes the working directory automatically 3. **Import Resolution**: Works because `client.py` is in working directory **UV's Role:** ```bash # Claude Desktop runs: /Users/rohitseelam/.local/bin/uv \ --directory /Users/rohitseelam/Projects/Perplexity_MCP \ run python server.py ``` This ensures: 1. **Correct Working Directory**: UV changes to project directory 2. **Virtual Environment**: UV activates the project's virtual environment 3. **Dependencies**: All packages from pyproject.toml are available #### Summary: Why __init__.py Matters 1. **Package Recognition**: Makes directory a Python package 2. **Import Control**: Controls what gets imported from the package 3. **Namespace Management**: Organizes related modules together 4. **Relative Imports**: Enables relative imports within the package 5. **Distribution**: Required for packaging and distribution **Our __init__.py serves as both a package marker and an export controller, ensuring clean imports while maintaining the simple flat structure appropriate for our MCP server.** --- ## 7. UV vs UVX for MCP Server Execution - Low Level Deep Dive ### Understanding UV and UVX #### What is UV? UV is a Python package manager that can also execute Python projects and scripts. It manages virtual environments, dependencies, and project execution. #### What is UVX? UVX is UV's tool execution command, designed for running Python tools without installing them permanently or affecting your project's dependencies. It's similar to `npx` in the Node.js ecosystem. ### Current Implementation: UV for MCP Server **Our Claude Desktop Configuration:** ```json { "mcpServers": { "perplexity-mcp": { "command": "/Users/rohitseelam/.local/bin/uv", "args": [ "--directory", "/Users/rohitseelam/Projects/Perplexity_MCP", "run", "python", "server.py" ] } } } ``` #### What Happens When Claude Desktop Starts Our MCP **Process Creation Flow:** 1. **Claude Desktop Process** (main application) 2. **Spawns Child Process**: `uv --directory /path/to/project run python server.py` 3. **UV Process**: Manages environment and spawns Python 4. **Python Process**: Our actual MCP server running 5. **Communication**: Claude Desktop ↔ Python process via stdin/stdout **Detailed Execution Steps:** 1. **Environment Setup by UV:** ```bash # UV's internal process: cd /Users/rohitseelam/Projects/Perplexity_MCP source .venv/bin/activate # Activate virtual environment export PYTHONPATH=/path/to/project:.venv/lib/python3.11/site-packages python server.py ``` 2. **Process Tree:** ``` Claude Desktop (PID 1234) └── uv (PID 1235) └── python server.py (PID 1236) # Our MCP server ``` 3. **File Descriptors:** ``` Claude Desktop stdout → UV stdin → Python stdin Python stdout → UV stdout → Claude Desktop stdin Python stderr → UV stderr → Claude Desktop stderr (logged) ``` ### Alternative: UVX Implementation **Hypothetical UVX Configuration:** ```json { "mcpServers": { "perplexity-mcp": { "command": "/Users/rohitseelam/.local/bin/uvx", "args": [ "perplexity-mcp", "--server-mode" ] } } } ``` #### How UVX Would Work **UVX Execution Model:** 1. **Tool Installation**: UVX would install our package in an isolated environment 2. **Temporary Environment**: Creates isolated environment for the tool 3. **Execution**: Runs the tool command 4. **Cleanup**: Environment persists for reuse but isolated from system **Process Flow with UVX:** ```bash # UVX internal process: uvx perplexity-mcp --server-mode # This would: 1. Check if perplexity-mcp is installed in UVX cache 2. If not, install from PyPI or specified source 3. Create/reuse isolated environment 4. Run the tool in that environment ``` ### Comparison: UV vs UVX for MCP #### 1. Development vs Production Usage **UV (Current - Development Focused):** - **Project Context**: Runs in our development project directory - **Dependencies**: Uses our local pyproject.toml and uv.lock - **Changes**: Immediate effect when we modify code - **Environment**: Uses our local .venv **UVX (Tool Focused):** - **Tool Context**: Runs as an installed tool/application - **Dependencies**: Uses published package dependencies - **Changes**: Need to republish package for updates - **Environment**: Isolated tool environment #### 2. Process Startup Performance **UV Process Startup:** ```bash # Steps UV takes: 1. Parse pyproject.toml # ~5ms 2. Activate virtual environment # ~10ms 3. Set PYTHONPATH # ~1ms 4. Launch Python interpreter # ~50ms 5. Import our modules # ~20ms 6. Initialize MCP server # ~30ms # Total: ~116ms ``` **UVX Process Startup:** ```bash # Steps UVX would take: 1. Check tool cache # ~10ms 2. Resolve tool environment # ~15ms 3. Activate tool environment # ~10ms 4. Launch Python interpreter # ~50ms 5. Import installed package # ~25ms 6. Initialize MCP server # ~30ms # Total: ~140ms ``` **Winner: UV (faster startup for development)** #### 3. Memory Usage **UV Memory Usage:** ``` UV Process: ~8MB Python Process: ~25MB (our code + dependencies) Total: ~33MB ``` **UVX Memory Usage:** ``` UVX Process: ~12MB (larger tool management overhead) Python Process: ~25MB (our code + dependencies) Total: ~37MB ``` **Winner: UV (lower memory overhead)** #### 4. Dependency Management **UV (Current):** ```toml # pyproject.toml - Full control [project] dependencies = [ "mcp>=1.11.0", "python-dotenv>=1.1.1", "httpx>=0.28.1", ] # Direct control over versions, no surprises ``` **UVX (Would require):** ```toml # pyproject.toml - Published package [project] dependencies = [ "mcp>=1.11.0", "python-dotenv>=1.1.1", "httpx>=0.28.1", ] [project.scripts] perplexity-mcp = "perplexity_mcp.server:main" # Users get whatever we published ``` **Winner: UV (development flexibility)** #### 5. Error Handling and Debugging **UV Error Flow:** ``` Python Exception → UV stdout/stderr → Claude Desktop logs ``` **Benefits:** - Direct error messages - Full stack traces - Real-time debugging - Can add print statements for debugging **UVX Error Flow:** ``` Python Exception → UVX tool wrapper → Claude Desktop logs ``` **Potential Issues:** - Tool wrapper might filter/modify errors - Less direct debugging access - Need to republish for debugging changes #### 6. Configuration and Customization **UV Configuration:** ```bash # Can modify code directly vim client.py # Edit timeout, add logging, etc. # Changes take effect immediately # Just restart Claude Desktop ``` **UVX Configuration:** ```bash # Would need configuration file or environment variables export PERPLEXITY_TIMEOUT=600 uvx perplexity-mcp --config=/path/to/config.json # Or modify package and republish ``` ### When UVX Would Be Better #### 1. Distribution to Others ```bash # Easy installation for end users uvx install perplexity-mcp # No need to clone repository or manage dependencies ``` #### 2. Version Management ```bash # Run specific versions uvx perplexity-mcp@1.0.0 uvx perplexity-mcp@latest # Automatic updates uvx upgrade perplexity-mcp ``` #### 3. Isolation ```bash # Complete isolation from system Python # No conflicts with other projects # Consistent environment across machines ``` #### 4. Production Deployment ```bash # Standardized deployment # No source code access needed # Professional tool distribution ``` ### Claude Desktop Child Process Management #### Process Lifecycle **Startup Sequence:** 1. **Claude Desktop reads config** → Validates MCP server configuration 2. **Spawns UV process** → `uv --directory /path run python server.py` 3. **UV initializes environment** → Activates venv, sets PYTHONPATH 4. **Python starts** → Our server.py executes 5. **MCP handshake** → FastMCP sends initialization response 6. **Ready state** → Tools available to Claude Desktop **Communication Protocol:** ``` Claude Desktop → MCP Server: {"jsonrpc":"2.0","method":"initialize","id":1,"params":{...}} MCP Server → Claude Desktop: {"jsonrpc":"2.0","id":1,"result":{"protocolVersion":"2024-11-05",...}} ``` **Process Monitoring:** - Claude Desktop monitors the UV process health - If UV process dies, MCP server is marked as failed - Automatic restart on process failure - Graceful shutdown on Claude Desktop exit #### Resource Management **File Descriptors:** ``` Claude Desktop: ├── stdin (receives MCP responses) ├── stdout (sends MCP requests) └── stderr (logs MCP server errors) UV Process: ├── stdin (forwards to Python) ├── stdout (forwards from Python) └── stderr (captures Python errors + UV logs) Python Process: ├── stdin (receives JSON-RPC from Claude) ├── stdout (sends JSON-RPC to Claude) └── stderr (our logging.basicConfig output) ``` **Memory Management:** - Each MCP server is a separate process - No shared memory between Claude Desktop and MCP - Process isolation prevents crashes from affecting Claude Desktop - Automatic cleanup when processes terminate ### Recommendation: Stick with UV **For our current use case, UV is superior because:** 1. **Development Workflow**: Immediate code changes without republishing 2. **Performance**: Faster startup and lower memory usage 3. **Debugging**: Direct access to code and error messages 4. **Flexibility**: Easy to modify configuration and behavior 5. **Simplicity**: No packaging/publishing overhead **UVX would be better if:** - We were distributing to many users - We needed strong isolation guarantees - We wanted professional tool deployment - We had multiple versions to maintain **Current Status: UV is the right choice for local development and personal use of our MCP server.** --- ## 8. UV Best Practices 2024-2025 - Basic Level with Core Concepts ### What is UV? UV is a modern Python package manager and project manager written in Rust, designed to be extremely fast and replace traditional tools like pip, virtualenv, and poetry. Released in 2024, it has quickly become the recommended tool for Python development. ### Core UV Concepts #### 1. Project Management UV treats everything as a "project" with a `pyproject.toml` file that defines dependencies, configuration, and metadata. ```toml # pyproject.toml [project] name = "my-project" version = "0.1.0" dependencies = [ "requests>=2.28.0", "click>=8.0.0", ] ``` #### 2. Lock Files for Reproducibility UV generates `uv.lock` files that pin exact versions of all dependencies, ensuring identical environments across machines and over time. ``` # uv.lock (auto-generated) # Ensures everyone gets exactly the same dependency versions ``` #### 3. Virtual Environment Management UV automatically creates and manages virtual environments in `.venv/` directories, eliminating the need to manually create or activate environments. ### Essential UV Commands #### Project Initialization ```bash # Create new project uv init my-project cd my-project # Initialize existing directory uv init # In existing directory ``` #### Dependency Management ```bash # Add dependencies uv add requests uv add pytest --dev # Development dependency uv add "django>=4.0,<5.0" # Version constraints # Remove dependencies uv remove requests # Update dependencies uv sync # Install/update all dependencies from lock file uv lock --upgrade # Update lock file with latest versions ``` #### Running Code ```bash # Run Python in project environment uv run python script.py uv run python -m module # Run commands in environment uv run pytest uv run black . uv run mypy src/ ``` #### Tool Management with UVX ```bash # Install global tools uv tool install black uv tool install ruff # Run tools temporarily uvx black . # Run without installing uvx ruff check src/ ``` ### Best Practices for 2024-2025 #### 1. Always Use pyproject.toml **Instead of requirements.txt:** ```toml # pyproject.toml - Modern approach [project] dependencies = [ "fastapi>=0.104.0", "uvicorn>=0.24.0", ] [dependency-groups] dev = [ "pytest>=7.0.0", "black>=23.0.0", "ruff>=0.1.0", ] ``` **Benefits:** - Single source of truth for project metadata - Better dependency resolution - Standardized across Python ecosystem - Future-proof project configuration #### 2. Use uv run Instead of Virtual Environment Activation **Modern approach:** ```bash # Don't do this anymore: source .venv/bin/activate python script.py deactivate # Do this instead: uv run python script.py uv run pytest uv run black . ``` **Advantages:** - No need to remember to activate/deactivate - Works from any directory in the project - Prevents "wrong environment" mistakes - Cleaner shell sessions #### 3. Dependency Groups for Organization ```toml [dependency-groups] dev = ["pytest", "black", "ruff", "mypy"] docs = ["sphinx", "sphinx-rtd-theme"] testing = ["pytest", "coverage", "pytest-cov"] production = ["gunicorn", "prometheus-client"] ``` ```bash # Install specific groups uv sync --group dev uv sync --group testing uv sync --all-groups # Install everything ``` #### 4. Lock File Management ```bash # Always commit uv.lock to version control git add uv.lock git commit -m "Lock dependencies" # Regular maintenance uv lock --upgrade # Update lock file uv sync # Apply updates ``` #### 5. Use UVX for One-off Tools ```bash # Instead of installing globally: uvx black . uvx ruff check src/ uvx mypy src/ uvx pytest # Or install frequently used tools: uv tool install black uv tool install ruff ``` ### Performance Best Practices #### 1. Leverage UV's Speed ```bash # UV is 10-100x faster than pip # Take advantage by running operations frequently uv add package # Nearly instant uv sync # Much faster than pip install -r requirements.txt ``` #### 2. Use --no-sync for Quick Additions ```bash # Add dependency without immediately syncing uv add requests --no-sync uv add pandas --no-sync uv add matplotlib --no-sync uv sync # Sync all at once ``` #### 3. Offline Mode ```bash # Work offline using cache uv sync --offline uv add package --offline # Uses cached packages ``` ### Project Structure Best Practices #### 1. Standard Layout ``` my-project/ ├── pyproject.toml # Project configuration ├── uv.lock # Exact dependency versions ├── .venv/ # Virtual environment (auto-created) ├── .gitignore # Include .venv/, exclude uv.lock if needed ├── src/ │ └── my_project/ │ ├── __init__.py │ └── main.py └── tests/ └── test_main.py ``` #### 2. .gitignore Configuration ```gitignore # Virtual environment .venv/ # Python cache __pycache__/ *.py[cod] *$py.class # UV cache (optional - can be useful to exclude) .uv-cache/ # Keep uv.lock for reproducibility # !uv.lock ``` ### Advanced UV Features #### 1. Python Version Management ```bash # Install Python versions uv python install 3.11 uv python install 3.12 # Use specific Python version uv init --python 3.11 uv run --python 3.12 python script.py ``` #### 2. Scripts in pyproject.toml ```toml [project.scripts] my-app = "my_project.main:main" my-cli = "my_project.cli:cli" [tool.uv.scripts] test = "pytest" lint = "ruff check src/" format = "black src/" ``` ```bash # Run scripts uv run my-app uv run test uv run lint ``` #### 3. Build and Publish ```bash # Build package uv build # Publish to PyPI uv publish ``` ### Common Migration Patterns #### From pip + virtualenv ```bash # Old way: python -m venv .venv source .venv/bin/activate pip install -r requirements.txt # New way: uv init uv add $(cat requirements.txt) uv run python script.py ``` #### From Poetry ```bash # Old poetry commands → UV equivalents poetry init → uv init poetry add package → uv add package poetry install → uv sync poetry run command → uv run command poetry shell → (not needed with uv run) ``` #### From Conda ```bash # Conda environment.yml → pyproject.toml # Manual conversion needed, but UV handles dependencies better uv init uv add numpy pandas matplotlib # Instead of conda install ``` ### Integration with Development Tools #### 1. CI/CD Integration ```yaml # GitHub Actions example - name: Setup Python and UV uses: actions/setup-python@v4 with: python-version: '3.11' - name: Install UV run: curl -LsSf https://astral.sh/uv/install.sh | sh - name: Install dependencies run: uv sync - name: Run tests run: uv run pytest ``` #### 2. IDE Integration Most modern IDEs automatically detect `.venv/` directories created by UV and configure themselves appropriately. #### 3. Docker Integration ```dockerfile # Dockerfile with UV FROM python:3.11-slim COPY --from=ghcr.io/astral-sh/uv:latest /uv /usr/local/bin/uv COPY pyproject.toml uv.lock ./ RUN uv sync --frozen --no-cache COPY . . CMD ["uv", "run", "python", "main.py"] ``` ### Performance and Monitoring #### 1. Cache Management ```bash # Check cache size uv cache info # Clean cache uv cache clean ``` #### 2. Dependency Insights ```bash # Show dependency tree uv tree # Show outdated packages uv lock --upgrade --dry-run ``` ### Security Best Practices #### 1. Dependency Auditing ```bash # Check for security vulnerabilities uv audit # If/when implemented # Meanwhile, use with uvx: uvx safety check uvx bandit -r src/ ``` #### 2. Reproducible Builds - Always commit `uv.lock` - Use `uv sync --frozen` in production - Regularly update dependencies: `uv lock --upgrade` ### Summary: UV Excellence in 2024-2025 1. **Replace pip/virtualenv/poetry** with UV for all Python projects 2. **Use `uv run`** instead of virtual environment activation 3. **Organize dependencies** with dependency groups 4. **Leverage UVX** for tools and one-off executions 5. **Commit lock files** for reproducibility 6. **Structure projects** with standard layouts 7. **Integrate with CI/CD** for consistent builds 8. **Monitor and update** dependencies regularly UV represents the modern standard for Python development, offering unprecedented speed and a superior developer experience while maintaining compatibility with existing Python tooling and workflows.

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Rohit-Seelam/Perplexity_MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

explanations.md•52.3 KiB