Insight Digger MCP

session-management.md•7.7 KiB

# Redis-Only Architecture Documentation ## Overview This document describes the Redis-only session management architecture implemented for the MCP Client system. This design ensures scalability, reliability, and stateless operation across multiple workers. ## Architecture Diagram ```mermaid graph TB subgraph "Client Request" C[Claude Desktop] B[MCP Bridge] end subgraph "Load Balancer" LB[Nginx/Load Balancer] end subgraph "Flask Workers" W1[Worker 1 Flask Process] W2[Worker 2 Flask Process] W3[Worker N Flask Process] end subgraph "Redis Storage" R[(Redis Sessions + Cache)] end subgraph "MCP Managers" M1[Temp MCP Manager 1] M2[Temp MCP Manager 2] M3[Temp MCP Manager N] end subgraph "External API" API[Sandsiv+ API] end C --> B B --> LB LB --> W1 LB --> W2 LB --> W3 W1 -.->|"Create on-demand"| M1 W2 -.->|"Create on-demand"| M2 W3 -.->|"Create on-demand"| M3 W1 --> R W2 --> R W3 --> R M1 -.->|"Cleanup after use"| W1 M2 -.->|"Cleanup after use"| W2 M3 -.->|"Cleanup after use"| W3 M1 --> API M2 --> API M3 --> API R -.->|"Auto-expire 24h idle TTL"| R classDef worker fill:#e1f5fe classDef redis fill:#ffecb3 classDef temp fill:#f3e5f5,stroke-dasharray: 5 5 classDef api fill:#e8f5e8 class W1,W2,W3 worker class R redis class M1,M2,M3 temp class API api ``` ## Component Descriptions ### Client Layer - **Claude Desktop**: The AI assistant that initiates requests - **MCP Bridge**: Translates MCP protocol to HTTP requests ### Load Balancer Layer - **Nginx/Load Balancer**: Distributes requests across multiple Flask workers - Enables horizontal scaling and high availability - Can be configured with health checks and failover ### Application Layer - **Flask Workers**: Multiple stateless Flask processes handling HTTP requests - Each worker can handle requests for any session (no process affinity required) - Workers scale independently based on load ### Storage Layer - **Redis**: Centralized session storage with automatic TTL management - Stores session data, cached parameters, and workflow state - Handles automatic cleanup through idle-based TTL (24 hours default) ### Processing Layer - **MCP Managers**: Temporary subprocess managers created on-demand - Created fresh for each request requiring MCP access - Immediately cleaned up after request completion - No persistent state or caching ### External Integration - **Sandsiv+ API**: External data analysis API - Accessed through MCP managers for tool execution ## Key Design Principles ### 1. Stateless Workers ```python # Each request creates a fresh MCP manager mcp_manager = create_mcp_manager_for_request(session_id) try: # Execute request result = mcp_manager.call_tool(tool_name, params) finally: # Always cleanup mcp_manager.stop() ``` **Benefits:** - Any worker can handle any request - No worker affinity required - Simplified load balancing - Easy horizontal scaling ### 2. Redis-Only Storage ```python # All session data in Redis with automatic TTL session_data = session_manager.get_session_data(session_id) # Resets TTL session_manager.update_session_data(session_id, updates) # Resets TTL ``` **Benefits:** - Sessions survive service restarts - Shared across all workers - Automatic cleanup via TTL - No background cleanup threads needed ### 3. On-Demand Resource Creation ```python # Create MCP manager only when needed def create_mcp_manager_for_request(session_id): if not session_manager.touch_session(session_id): return None return MCPServerManager(server_script=MCPConfig.MCP.SERVER_SCRIPT) ``` **Benefits:** - Optimal resource utilization - No persistent subprocess management - Immediate cleanup prevents resource leaks - Simplified error handling ### 4. Idle-Based TTL ```python # Every access resets TTL to full duration def get_session_data(self, session_id): # ... get data ... self.redis.setex(redis_key, self.idle_ttl, updated_data) # Reset TTL ``` **Benefits:** - Active sessions never expire - Inactive sessions automatically cleaned up - No manual session management required - Predictable resource usage ## Request Flow ### 1. Session Creation (`/init`) ``` 1. Client → Load Balancer → Worker 2. Worker validates credentials with external API 3. Worker stores session data in Redis with TTL 4. Worker returns success ``` ### 2. Tool Execution (`/call-tool`) ``` 1. Client → Load Balancer → Any Available Worker 2. Worker retrieves session data from Redis (resets TTL) 3. Worker creates temporary MCP manager 4. Worker executes tool through MCP manager 5. Worker caches results in Redis session 6. Worker cleans up MCP manager 7. Worker returns filtered results to client ``` ### 3. Session Expiration ``` 1. Redis automatically expires sessions after 24h of inactivity 2. No background cleanup processes needed 3. Next access to expired session returns "not found" 4. Client can re-initialize with /init if needed ``` ## Scaling Characteristics ### Horizontal Scaling - **Add Workers**: Simply start more Flask processes - **Load Balancing**: Standard HTTP load balancing works - **No Coordination**: Workers don't need to communicate - **Shared State**: All workers access same Redis sessions ### Vertical Scaling - **Redis Capacity**: Single Redis can handle thousands of sessions - **Worker Memory**: Each worker only holds temporary MCP managers - **CPU Usage**: On-demand creation optimizes CPU usage ### Geographic Distribution - **Redis Clustering**: For multi-region deployments - **Local Workers**: Workers can be distributed geographically - **Session Affinity**: Not required - any worker can serve any session ## Operational Benefits ### Deployment - **Zero Downtime**: Rolling updates possible - **Session Persistence**: Sessions survive deployments - **Configuration**: Centralized in shared config system - **Monitoring**: Single Redis instance to monitor ### Debugging - **Session Inspection**: Direct Redis access for debugging - **Stateless Workers**: No worker-specific state to debug - **Centralized Logs**: All session activity in Redis - **TTL Monitoring**: Easy to check session expiration ### Maintenance - **No Background Jobs**: No cleanup processes to manage - **Automatic Cleanup**: Redis TTL handles all cleanup - **Simple Architecture**: Fewer moving parts - **Predictable Behavior**: No race conditions or timing issues ## Migration Benefits ### From Previous Architecture - **Eliminated Complexity**: No dual-layer management - **Removed Polling**: No background cleanup threads - **Fixed Race Conditions**: No coordination between layers - **Improved Reliability**: No orphaned processes ### Backward Compatibility - **HTTP Endpoints**: All endpoints unchanged - **Client Behavior**: Identical from client perspective - **Configuration**: Same environment variables - **Session IDs**: Existing session IDs continue to work ## Performance Characteristics ### Memory Usage - **Redis**: Session data (~1-10KB per session) - **Workers**: Only temporary MCP managers during requests - **No Leaks**: Immediate cleanup prevents accumulation ### CPU Usage - **On-Demand**: MCP managers created only when needed - **Parallel**: Multiple workers can create managers simultaneously - **Efficient**: No background polling or cleanup ### Network Usage - **Redis**: Fast local connections for session data - **MCP**: Temporary connections to external API - **HTTP**: Standard request/response patterns This architecture provides a robust, scalable foundation for the MCP Client system while maintaining simplicity and operational efficiency.

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/sandsiv/data_narrator_mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

session-management.md•7.7 KiB