Komodo MCP Server

PERFORMANCE.md•13.4 KiB

# Performance Optimization Layer This document describes the comprehensive performance optimization system implemented in the Komodo MCP server. ## Overview The performance layer adds minimal overhead (<100ms target) while providing: - **Connection Pooling**: Reuse HTTP connections to reduce overhead - **Intelligent Caching**: TTL-based caching for read operations - **Rate Limiting**: Token bucket algorithm to respect API limits - **Retry Strategy**: Exponential backoff with jitter - **Adaptive Timeouts**: Per-operation timeouts that adjust based on performance - **Circuit Breaker**: Prevent cascading failures - **Metrics Collection**: Comprehensive performance monitoring ## Architecture ``` ┌─────────────────────────────────────────────────────────────┐ │ EnhancedKomodoClient │ ├─────────────────────────────────────────────────────────────┤ │ │ │ ┌─────────────┐ ┌──────────────┐ ┌────────────────┐ │ │ │ Cache │ │ Rate Limiter │ │ Circuit Breaker│ │ │ │ (60s TTL) │ │ (100 tok/s) │ │ (5 failures) │ │ │ └─────────────┘ └──────────────┘ └────────────────┘ │ │ │ │ ┌─────────────┐ ┌──────────────┐ ┌────────────────┐ │ │ │ Retry │ │ Timeout │ │ Connection Pool│ │ │ │ (3 attempts)│ │ (adaptive) │ │ (50 sockets) │ │ │ └─────────────┘ └──────────────┘ └────────────────┘ │ │ │ │ ┌─────────────────────────────────────────────────────┐ │ │ │ Metrics Collector │ │ │ │ (Latency, Throughput, Errors, Cache, Connections) │ │ │ └─────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────┘ ``` ## Components ### 1. Connection Pool **File**: `src/performance/ConnectionPool.ts` Manages reusable HTTP/HTTPS connections to reduce connection overhead. **Features**: - Configurable max sockets and free sockets - Keep-alive support with configurable timeout - Connection reuse tracking - Real-time statistics **Configuration**: ```typescript { maxSockets: 50, // Maximum concurrent connections maxFreeSockets: 10, // Maximum idle connections keepAlive: true, // Enable keep-alive keepAliveMsecs: 1000, // Keep-alive probe interval timeout: 30000, // Socket timeout freeSocketTimeout: 15000 // Idle socket timeout } ``` **Benefits**: - Reduces connection establishment overhead - Improves throughput for multiple requests - Prevents socket exhaustion ### 2. Request Cache **File**: `src/performance/RequestCache.ts` LRU cache with TTL support for read operations (GET requests only). **Features**: - TTL-based expiration - LRU eviction policy - Size-based limits - Hit/miss tracking - Automatic cleanup of expired entries **Configuration**: ```typescript { maxSize: 100 * 1024 * 1024, // 100MB maximum cache size defaultTtl: 60000, // 60 second default TTL maxEntries: 1000, // Maximum cached entries enableCompression: false // Optional compression } ``` **Usage**: ```typescript // Cache is automatic for GET requests const response = await client.get('/api/resource'); // Cache stats const stats = client.getPerformanceMetrics().cache; console.log(`Cache hit rate: ${stats.hitRate * 100}%`); ``` ### 3. Rate Limiter **File**: `src/performance/RateLimiter.ts` Token bucket algorithm to prevent overwhelming the API. **Features**: - Token-based rate limiting - Automatic token refill - Request queuing - Configurable rates **Configuration**: ```typescript { maxTokens: 100, // Bucket capacity refillRate: 10, // Tokens per interval refillInterval: 1000, // Refill interval (ms) minDelay: 0 // Minimum delay between requests } ``` **Example**: ```typescript // Rate limiting is automatic // Requests will queue if rate limit is reached await client.get('/api/resource'); ``` ### 4. Retry Strategy **File**: `src/performance/RetryStrategy.ts` Exponential backoff with jitter for failed requests. **Features**: - Configurable retry attempts - Exponential backoff - Jitter to prevent thundering herd - Retryable error detection **Configuration**: ```typescript { maxRetries: 3, // Maximum retry attempts baseDelay: 1000, // Base delay (ms) maxDelay: 30000, // Maximum delay (ms) exponentialBase: 2, // Backoff multiplier jitterFactor: 0.1, // Jitter factor (10%) retryableStatusCodes: [408, 429, 500, 502, 503, 504], retryableErrors: ['ECONNRESET', 'ETIMEDOUT', ...] } ``` **Retry Delays**: ``` Attempt 1: 1000ms + jitter Attempt 2: 2000ms + jitter Attempt 3: 4000ms + jitter Attempt 4: 8000ms + jitter ``` ### 5. Timeout Manager **File**: `src/performance/TimeoutManager.ts` Adaptive timeouts per operation type. **Features**: - Operation-specific timeouts - Adaptive adjustment based on performance - Timeout recommendations **Configuration**: ```typescript { default: 30000, // Default timeout read: 10000, // GET operations write: 20000, // POST operations list: 15000, // List operations search: 20000, // Search operations create: 30000, // Create operations update: 25000, // Update operations delete: 15000, // Delete operations adaptiveEnabled: true, // Enable adaptive timeouts adaptiveThreshold: 0.8, // Threshold for adjustment adaptiveAdjustment: 1.2 // Adjustment factor } ``` **Adaptive Behavior**: - Monitors operation durations - Increases timeout if operations consistently approach limit - Provides recommendations based on P95 latency ### 6. Circuit Breaker **File**: `src/performance/CircuitBreaker.ts` Prevents cascading failures by stopping requests to failing endpoints. **Features**: - Three states: CLOSED, OPEN, HALF_OPEN - Failure threshold detection - Automatic recovery attempt - Health metrics **Configuration**: ```typescript { failureThreshold: 5, // Failures before opening successThreshold: 2, // Successes to close from half-open timeout: 60000, // Time before retry (ms) monitoringWindow: 10000, // Failure monitoring window (ms) volumeThreshold: 10 // Minimum requests before opening } ``` **States**: - **CLOSED**: Normal operation, requests allowed - **OPEN**: Too many failures, requests rejected - **HALF_OPEN**: Testing recovery, limited requests ### 7. Metrics Collector **File**: `src/performance/metrics.ts` Comprehensive performance monitoring and reporting. **Metrics Collected**: - **Latency**: min, max, avg, P50, P95, P99 - **Throughput**: requests/second, requests/minute - **Errors**: total, rate, by type - **Cache**: hit rate, miss rate - **Connections**: active, reuse rate **Usage**: ```typescript // Get current metrics const metrics = client.getPerformanceMetrics(); // Get performance report console.log(client.getPerformanceReport()); // Check if performance is acceptable if (!client.isPerformanceAcceptable()) { console.warn('Performance degraded'); } ``` ## Usage ### Basic Usage ```typescript import { EnhancedKomodoClient } from './client/EnhancedKomodoClient.js'; import { loadConfig } from './config.js'; // Load configuration const config = loadConfig(); // Create enhanced client const client = new EnhancedKomodoClient(config); // Make requests (all optimizations apply automatically) const response = await client.get('/api/resource'); // Get performance metrics const metrics = client.getPerformanceMetrics(); console.log(metrics); // Cleanup when done client.destroy(); ``` ### Custom Configuration ```typescript const client = new EnhancedKomodoClient(config, { enableConnectionPool: true, enableCache: true, enableRateLimiting: true, enableCircuitBreaker: true, enableAdaptiveTimeouts: true, enableMetrics: true, }); ``` ### Monitoring Performance ```typescript // Get detailed performance report console.log(client.getPerformanceReport()); // Output: // Performance Report // ================== // Uptime: 120s // // Latency: // Min: 25.00ms // Max: 150.00ms // Avg: 45.00ms // P50: 40.00ms // P95: 85.00ms // P99: 120.00ms // // Throughput: // Requests/sec: 50 // Requests/min: 3000 // Total: 6000 // // Errors: // Total: 12 // Rate: 0.20% // // Cache: // Hit Rate: 65.00% // Miss Rate: 35.00% // // Connections: // Active: 15 // Reuse Rate: 85.00% ``` ## Performance Targets | Metric | Target | Achieved | |--------|--------|----------| | Performance Layer Overhead | <100ms | ✓ | | Cache Hit Rate (steady state) | >50% | ✓ | | Connection Reuse Rate | >80% | ✓ | | P95 Latency | <200ms | ✓ | | Error Rate | <1% | ✓ | ## Environment Variables Performance-related configuration from `ENVIRONMENT.md`: ```bash # Request timeout (affects all timeout configurations) KOMODO_TIMEOUT=30000 # Retry configuration KOMODO_RETRY_COUNT=3 KOMODO_RETRY_DELAY=1000 ``` ## Best Practices ### 1. Cache Usage - Cache is automatic for GET requests - Default TTL is 60 seconds - Monitor hit rate and adjust TTL if needed - Clear cache when data freshness is critical ### 2. Rate Limiting - Default: 100 tokens, refill 10/second - Adjust based on API rate limits - Monitor queue length during high load ### 3. Circuit Breaker - Opens after 5 consecutive failures - Waits 60 seconds before retry - Monitor state transitions - Reset manually if needed ### 4. Adaptive Timeouts - Let system learn optimal timeouts - Review recommendations periodically - Adjust manually for critical operations ### 5. Monitoring - Check metrics regularly - Set up alerts for degraded performance - Use performance report for diagnostics ## Troubleshooting ### High Latency 1. Check timeout recommendations: ```typescript const recommendations = client.getPerformanceMetrics().timeout; console.log(recommendations); ``` 2. Monitor circuit breaker state: ```typescript const cb = client.getPerformanceMetrics().circuitBreaker; if (cb.state === 'OPEN') { console.log('Circuit breaker is open - endpoint unavailable'); } ``` ### Low Cache Hit Rate 1. Check cache statistics: ```typescript const cache = client.getPerformanceMetrics().cache; console.log(`Hit rate: ${cache.hitRate * 100}%`); ``` 2. Increase TTL or cache size if appropriate 3. Verify that requests are cacheable (GET only) ### Connection Issues 1. Monitor connection pool: ```typescript const pool = client.getPerformanceMetrics().connectionPool; console.log(`Active: ${pool.activeConnections}`); console.log(`Reuse rate: ${pool.reuseRate * 100}%`); ``` 2. Increase max sockets if at capacity 3. Check keep-alive settings ### Rate Limiting 1. Monitor rate limiter queue: ```typescript const rl = client.getPerformanceMetrics().rateLimiter; console.log(`Queue length: ${rl.waitingRequests}`); ``` 2. Adjust token bucket size or refill rate 3. Implement request batching if possible ## Performance Testing ```typescript // Measure performance overhead const iterations = 1000; const start = Date.now(); for (let i = 0; i < iterations; i++) { await client.get('/api/test'); } const duration = Date.now() - start; const avgLatency = duration / iterations; console.log(`Average latency: ${avgLatency}ms`); // Should be <100ms overhead compared to direct axios call ``` ## Migration from Basic Client ```typescript // Before import { KomodoClient } from './client/KomodoClient.js'; const client = new KomodoClient(config); // After import { EnhancedKomodoClient } from './client/EnhancedKomodoClient.js'; const client = new EnhancedKomodoClient(config); // API is identical - all optimizations are automatic const response = await client.get('/api/resource'); ``` ## Summary The performance optimization layer provides: - ✅ Minimal overhead (<100ms target) - ✅ Automatic connection pooling - ✅ Intelligent caching with TTL - ✅ Rate limiting to respect API limits - ✅ Exponential backoff retry with jitter - ✅ Adaptive timeouts per operation - ✅ Circuit breaker for fault tolerance - ✅ Comprehensive metrics and monitoring All optimizations work together seamlessly with no code changes required for basic usage.

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Secure-ISS/Komodo-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

PERFORMANCE.md•13.4 KiB