ContextForge MCP Gateway

Official

Overview Schema Related Servers Score Discussions

mcp-context-forge

.env.example•76.3 KiB

##################################### # MCP Gateway Configuration Example ##################################### # Basic Server Configuration # Application name displayed in UI and logs APP_NAME=MCP_Gateway # Host interface to bind to (0.0.0.0 = all interfaces, 127.0.0.1 = localhost only) HOST=0.0.0.0 # Port number for the HTTP server PORT=4444 # HTTP server to use (for containers) # Options: gunicorn (default, stable), granian (alternative, Rust-based) HTTP_SERVER=gunicorn # Runtime environment - affects CORS, cookies, and security defaults # Options: development, production # - development: Relaxed CORS (localhost:3000/8080), debug info, insecure cookies # - production: Strict CORS (APP_DOMAIN only), secure cookies, no debug info ENVIRONMENT=development # Domain name for CORS origins and cookie settings (use your actual domain in production) APP_DOMAIN=http://localhost # FastAPI root_path for reverse proxy deployments (empty = serve from root "/") # Used when gateway is behind a proxy with path prefix (e.g., "/api/v1") # See FastAPI docs: https://fastapi.tiangolo.com/advanced/behind-a-proxy/ APP_ROOT_PATH= # Enable HTTP Basic Auth for OpenAPI docs endpoints (/docs, /redoc) # Options: true, false (default: false) # When true: Allows accessing docs with BASIC_AUTH_USER/BASIC_AUTH_PASSWORD # When false: Only JWT Bearer token authentication is accepted DOCS_ALLOW_BASIC_AUTH=false # Database Configuration # SQLite (default) - good for development and small deployments # macOS note: If you see "sqlite3.OperationalError: disk I/O error" on macOS when running # `make serve`, move the DB to a safe APFS path (avoid iCloud/Dropbox/OneDrive/Google Drive, # network shares, or external exFAT) and use an absolute path, for example: # DATABASE_URL=sqlite:////Users/$USER/Library/Application Support/mcpgateway/mcp.db DATABASE_URL=sqlite:///./mcp.db # PostgreSQL - recommended for production deployments # Uses psycopg3 driver (psycopg[binary]) # IMPORTANT: Use postgresql+psycopg:// (not postgresql://) for psycopg3 # DATABASE_URL=postgresql+psycopg://postgres:mysecretpassword@localhost:5432/mcp # MariaDB/MySQL - fully supported for production # For container deployment: mysql+pymysql://mysql:changeme@mariadb:3306/mcp # For localhost: mysql+pymysql://mysql:changeme@localhost:3306/mcp # DATABASE_URL=mysql+pymysql://mysql:changeme@localhost:3306/mcp # Echo commands - debug only, used to identify N+1 issues, etc. # SQLALCHEMY_ECHO=0 # Database Connection Pool Configuration # ============================================================================ # IMPORTANT: Pool size depends on your database connection strategy: # # WITH PgBouncer (recommended for PostgreSQL, default in docker-compose): # - Use SMALL pool (10-20) since PgBouncer handles connection pooling # - docker-compose.yml sets DB_POOL_SIZE=15 by default # - Do NOT override here unless you know what you're doing # - Formula: (replicas × workers × pool) should be < PgBouncer MAX_CLIENT_CONN # # WITHOUT PgBouncer (direct PostgreSQL or SQLite): # - Use LARGER pool based on: (replicas × workers × pool) < max_connections # - Uncomment and set DB_POOL_SIZE=50-200 depending on workload # # DB_POOL_SIZE=200 # Uncomment for SQLite or direct PostgreSQL without PgBouncer # ============================================================================ # Additional connections beyond pool_size for burst handling (default: 10) DB_MAX_OVERFLOW=10 # Seconds to wait for connection before timeout (default: 30) DB_POOL_TIMEOUT=30 # Seconds before recreating connection to prevent stale connections (default: 3600) DB_POOL_RECYCLE=3600 # Database Startup Resilience (exponential backoff with jitter) # Retry progression: 2s → 4s → 8s → 16s → 30s (capped), ±25% jitter # 30 retries ≈ 5 minutes total wait before worker gives up DB_MAX_RETRIES=30 # Base retry interval in milliseconds (doubles each attempt) DB_RETRY_INTERVAL_MS=2000 # Maximum backoff cap in seconds (jitter ±25% applied after cap) DB_MAX_BACKOFF_SECONDS=30 # psycopg3: Number of query executions before auto-preparing server-side (default: 5) # Set to 0 to disable, 1 to prepare immediately. Higher values reduce memory usage. DB_PREPARE_THRESHOLD=5 # Database Performance Optimization # Use database-native percentile functions for observability performance metrics # When true: PostgreSQL uses native percentile_cont (5-10x faster for large datasets) # When false: Falls back to Python-based percentile calculations (works with all databases) # Recommended: true for PostgreSQL production deployments, auto-detected for SQLite USE_POSTGRESDB_PERCENTILES=true # The number of rows fetched from the database at a time when streaming results, # to limit memory usage and avoid loading all rows into RAM at once. YIELD_BATCH_SIZE=1000 # Cache Backend Configuration # Options: database (default), memory (in-process), redis (distributed) # - database: Uses SQLite/PostgreSQL for persistence (good for single-node) # - memory: Fast in-process caching (lost on restart, not shared between workers) # - redis: Distributed caching for multi-node deployments CACHE_TYPE=database # Session Registry Database Polling (Adaptive Backoff) # When CACHE_TYPE=database, sessions use polling to check for messages. # Adaptive backoff reduces database load by ~90% during idle periods while # maintaining responsiveness when messages arrive. # # How it works: # - Starts polling at POLL_INTERVAL (1.0s default) # - When no messages found, interval increases by BACKOFF_FACTOR (1.5x) # - Backs off until reaching MAX_INTERVAL (5.0s cap) # - Immediately resets to POLL_INTERVAL when a message arrives # # Example progression: 1.0s → 1.5s → 2.25s → 3.375s → 5.0s (capped) # # Tuning guide: # - Lower POLL_INTERVAL (0.1-0.5s) for real-time applications needing <1s latency # - Higher MAX_INTERVAL (10-30s) for batch workloads to minimize DB queries # - Higher BACKOFF_FACTOR (2.0) for faster backoff, lower (1.2) for gradual POLL_INTERVAL=1.0 MAX_INTERVAL=5.0 BACKOFF_FACTOR=1.5 # Redis connection URL (only used when CACHE_TYPE=redis) # Format: redis://[username:password@]host:port/database # Example: redis://localhost:6379/0 (local), redis://redis:6379/0 (container) # REDIS_URL=redis://localhost:6379/0 # Cache key prefix for Redis (used to namespace keys in shared Redis instances) # Default: "mcpgw:" CACHE_PREFIX=mcpgw: # Session time-to-live in seconds (how long sessions remain valid) # Default: 3600 (1 hour) SESSION_TTL=3600 # Message time-to-live in seconds (how long messages are retained) # Default: 600 (10 minutes) MESSAGE_TTL=600 # Redis Startup Resilience (exponential backoff with jitter) # Same behavior as DB retries: 2s → 4s → 8s → 16s → 30s (capped), ±25% jitter # 30 retries ≈ 5 minutes total wait before worker gives up REDIS_MAX_RETRIES=30 # Base retry interval in milliseconds (doubles each attempt) REDIS_RETRY_INTERVAL_MS=2000 # Maximum backoff cap in seconds (jitter ±25% applied after cap) REDIS_MAX_BACKOFF_SECONDS=30 # ============================================================================= # Redis Connection Pool - Performance Tuned # ============================================================================= # Connection pool size per worker process # Formula: (concurrent_requests / workers) * 1.5 # Default 50 handles ~500 concurrent requests with 10 workers REDIS_MAX_CONNECTIONS=50 # Socket read/write timeout (seconds) # Keep low for fast failure detection; Redis ops typically <100ms REDIS_SOCKET_TIMEOUT=2.0 # Connection establishment timeout (seconds) # Keep low to avoid blocking event loop on network issues REDIS_SOCKET_CONNECT_TIMEOUT=2.0 # Retry commands that timeout (recommended: true) REDIS_RETRY_ON_TIMEOUT=true # Connection health check interval (seconds, 0=disabled) # Prevents stale connections in pool REDIS_HEALTH_CHECK_INTERVAL=30 # Return strings instead of bytes (recommended: true) REDIS_DECODE_RESPONSES=true # ============================================================================= # Redis Parser Configuration (Performance - ADR-026) # ============================================================================= # Redis protocol parser selection # Options: # - auto (default): Use hiredis C parser if available, fallback to pure-Python # - hiredis: Require hiredis C parser (fails if not installed) # - python: Force pure-Python parser (useful for debugging) # # Performance benchmarks (hiredis vs pure-Python): # - Simple SET/GET: ~1.1x faster # - LRANGE (10 items): ~2.7x faster # - LRANGE (100 items): ~10x faster # - LRANGE (999 items): ~83x faster # # Recommendation: Leave as "auto" - hiredis is installed by default with redis[hiredis] REDIS_PARSER=auto # ============================================================================= # Redis Leader Election - Multi-Node Deployments # ============================================================================= # Leader TTL in seconds (time before failover if leader dies) # Lower = faster failover, but more sensitive to network blips # Recommended: 15s for production, 5s for development REDIS_LEADER_TTL=15 # Leader heartbeat interval (seconds) # Must be < leader_ttl/2 to prevent false failovers # Rule: heartbeat_interval <= leader_ttl / 3 REDIS_LEADER_HEARTBEAT_INTERVAL=5 # Leader key name in Redis REDIS_LEADER_KEY=gateway_service_leader ##################################### # Protocol Settings ##################################### # MCP protocol version supported by this gateway PROTOCOL_VERSION=2025-06-18 ##################################### # Authentication ##################################### # Admin UI HTTP Basic Auth credentials # Used for: Admin UI login, /docs endpoint (if DOCS_ALLOW_BASIC_AUTH=true) # PRODUCTION: Change these to strong, unique values! BASIC_AUTH_USER=admin BASIC_AUTH_PASSWORD=changeme # Global authentication requirement # Options: true (default), false # When true: All endpoints require authentication (Basic or JWT) # When false: Endpoints are publicly accessible (NOT RECOMMENDED) AUTH_REQUIRED=true # MCP endpoint authentication requirement # Options: true, false (default) # When true: All /mcp requests require a valid Bearer token # When false: Unauthenticated /mcp requests can access public-only tools/resources/prompts MCP_REQUIRE_AUTH=false # Content type for outgoing HTTP requests to upstream services # Options: application/json (default), application/x-www-form-urlencoded, multipart/form-data FORGE_CONTENT_TYPE=application/json # JWT Algorithm Selection # Supported algorithms: # HMAC (Symmetric): HS256, HS384, HS512 - Simple deployments, shared secret # RSA (Asymmetric): RS256, RS384, RS512 - Enterprise, distributed systems # ECDSA (Asymmetric): ES256, ES384, ES512 - High performance, modern crypto JWT_ALGORITHM=HS256 # === HMAC (Symmetric) Configuration - Default for Development === # Secret used to sign JWTs (required for HMAC algorithms: HS256, HS384, HS512) # PRODUCTION: Use a strong, random secret (minimum 32 characters) # Generate with: openssl rand -base64 32 JWT_SECRET_KEY=my-test-key # === RSA/ECDSA (Asymmetric) Configuration - Recommended for Production === # Public and private key paths (required for asymmetric algorithms: RS*, ES*) # Generate RSA keys with: make certs-jwt # (creates certs/jwt/private.pem and certs/jwt/public.pem with proper permissions) # Generate ECDSA keys with: make certs-jwt-ecdsa # (creates certs/jwt/ec_private.pem and certs/jwt/ec_public.pem with proper permissions) # Generate both SSL and JWT keys: make certs-all #JWT_PUBLIC_KEY_PATH=certs/jwt/public.pem #JWT_PRIVATE_KEY_PATH=certs/jwt/private.pem # JWT Claims Configuration # PRODUCTION: Set these to your service-specific values JWT_AUDIENCE=mcpgateway-api JWT_ISSUER=mcpgateway # JWT Validation Options # Set to false for Dynamic Client Registration (DCR) scenarios where audience varies JWT_AUDIENCE_VERIFICATION=true # Set to false for custom auth flows where issuer varies or is not present JWT_ISSUER_VERIFICATION=true # Expiry time for generated JWT tokens (in minutes; e.g. 7 days) TOKEN_EXPIRY=10080 REQUIRE_TOKEN_EXPIRATION=false # Require JTI (JWT ID) claim in all tokens for revocation support # When true, tokens without JTI will be rejected REQUIRE_JTI=false # Require all authenticated users to exist in the database # When true, disables the platform admin bootstrap mechanism # WARNING: Enabling this on a fresh deployment will lock you out! REQUIRE_USER_IN_DB=false ##################################### # Security Validation & Sanitization ##################################### # Enable experimental input validation and output sanitization # This implements gateway-level security controls to protect against: # - Path traversal attacks (../../../etc/passwd) # - Command injection (file.jpg; rm -rf /) # - SQL injection ('; DROP TABLE users; --) # - XSS attacks (<script>alert(1)</script>) # - Control character injection (\x1b[31m) # # Roll-out phases: # Phase 0: EXPERIMENTAL_VALIDATE_IO=false (disabled, default) # Phase 1: EXPERIMENTAL_VALIDATE_IO=true, VALIDATION_STRICT=false (log-only) # Phase 2: EXPERIMENTAL_VALIDATE_IO=true, VALIDATION_STRICT=true (enforce in staging) # Phase 3: Production deployment with all features enabled EXPERIMENTAL_VALIDATE_IO=true # Enable validation middleware for all requests # When enabled, validates all incoming request parameters and paths # Options: true, false (default) VALIDATION_MIDDLEWARE_ENABLED=true # Strict validation mode # Options: # - true: Reject requests with validation failures (422 status) # - false: Log warnings but allow requests (log-only mode) # Recommended: false for dev/staging, true for production VALIDATION_STRICT=true # Sanitize output to remove control characters # Removes ANSI escape sequences and C0/C1 control characters from responses # Preserves newlines (\n) and tabs (\t) # Options: true (default), false SANITIZE_OUTPUT=true # Allowed root paths for resource access # Restricts file system access to specific directories # Format: JSON array or comma-separated list # Examples: # - JSON: ["/srv/data", "/var/app/uploads"] # - CSV: /srv/data,/var/app/uploads # - Empty: [] (no restrictions, not recommended) # PRODUCTION: Always configure this to limit resource access ALLOWED_ROOTS=[] # Maximum allowed path depth # Prevents deeply nested path attacks # Default: 10 levels MAX_PATH_DEPTH=10 # Maximum parameter length (characters) # Prevents buffer overflow and DoS attacks # Default: 10000 characters MAX_PARAM_LENGTH=10000 # Regex patterns for dangerous input (JSON array) # Used to detect and block malicious input patterns # Default patterns: # 1. Shell metacharacters: [;&|`$(){}\[\]<>] # 2. Path traversal: \.\.[/\\] # 3. Control characters: [\x00-\x1f\x7f-\x9f] # 4. SQL injection: (drop|delete|insert|update|select)\s+(table|from|into|where) # Format: JSON array of regex patterns DANGEROUS_PATTERNS=["[;&|`$(){}\\[\\]<>]", "\\.\\.[/\\\\]", "[\\x00-\\x1f\\x7f-\\x9f]", "(?i)(drop|delete|insert|update|select)\\s+(table|from|into|where)"] ##################################### # Email-Based Authentication ##################################### # Enable email-based authentication system EMAIL_AUTH_ENABLED=true # Platform admin user (bootstrap from environment) # PRODUCTION: Change these to your actual admin credentials! PLATFORM_ADMIN_EMAIL=admin@example.com PLATFORM_ADMIN_PASSWORD=changeme PLATFORM_ADMIN_FULL_NAME=Platform Administrator # Argon2id Password Hashing Configuration # Time cost (iterations) - higher = more secure but slower ARGON2ID_TIME_COST=3 # Memory cost (KB) - higher = more secure but uses more RAM ARGON2ID_MEMORY_COST=65536 # Parallelism (threads) - typically 1 for web apps ARGON2ID_PARALLELISM=1 # Password Policy Configuration PASSWORD_MIN_LENGTH=8 PASSWORD_REQUIRE_UPPERCASE=false PASSWORD_REQUIRE_LOWERCASE=false PASSWORD_REQUIRE_NUMBERS=false PASSWORD_REQUIRE_SPECIAL=false # Password Change Enforcement # Master switch for all password change enforcement checks PASSWORD_CHANGE_ENFORCEMENT_ENABLED=true # Force admin to change password after bootstrap ADMIN_REQUIRE_PASSWORD_CHANGE_ON_BOOTSTRAP=true # Detect default password during login and mark user for change DETECT_DEFAULT_PASSWORD_ON_LOGIN=true # Require password change when using default password REQUIRE_PASSWORD_CHANGE_FOR_DEFAULT_PASSWORD=true # Enable password complexity validation for new/changed passwords PASSWORD_POLICY_ENABLED=true # Prevent reusing the current password when changing PASSWORD_PREVENT_REUSE=true # Password maximum age in days before expiry forces a change PASSWORD_MAX_AGE_DAYS=90 # Account Security Configuration # Maximum failed login attempts before account lockout MAX_FAILED_LOGIN_ATTEMPTS=5 # Account lockout duration in minutes ACCOUNT_LOCKOUT_DURATION_MINUTES=30 # MCP Client Authentication # Controls JWT authentication for /mcp endpoints MCP_CLIENT_AUTH_ENABLED=true TRUST_PROXY_AUTH=false PROXY_USER_HEADER=X-Authenticated-User # SECURITY NOTE: MCP Access Control Dependencies # Full MCP access control (visibility + team scoping + membership validation) requires: # 1. MCP_CLIENT_AUTH_ENABLED=true (JWT auth extracts user identity and teams) # 2. Valid Bearer token with teams claim for team-scoped access # When MCP_CLIENT_AUTH_ENABLED=false: # - Access control relies on MCP_REQUIRE_AUTH + tool/resource visibility only # - Team membership validation is skipped (no JWT to extract teams from) # - Use TRUST_PROXY_AUTH=true with a reverse proxy for user identification # Used to derive an AES encryption key for secure auth storage # Must be a non-empty string (e.g. passphrase or random secret) AUTH_ENCRYPTION_SECRET=my-test-salt # OAuth Configuration OAUTH_REQUEST_TIMEOUT=30 OAUTH_MAX_RETRIES=3 OAUTH_DEFAULT_TIMEOUT=3600 # OAuth Security Settings # When MCP servers require OAuth authorization code flow, # tokens are stored per-user to prevent cross-user token access. # Users must individually authorize each OAuth-protected gateway. ##################################### # OAuth Dynamic Client Registration (DCR) and PKCE ##################################### # Enable Dynamic Client Registration (RFC 7591) # When enabled, MCP Gateway can automatically register as an OAuth client with Authorization Servers # that support DCR, eliminating the need for manual client credential configuration. DCR_ENABLED=true # Auto-register when gateway has issuer but no client_id # When true, gateway automatically registers with the Authorization Server when configured # with an issuer URL but no client credentials. DCR_AUTO_REGISTER_ON_MISSING_CREDENTIALS=true # Default scopes to request during DCR # JSON array of OAuth scopes to request when auto-registering DCR_DEFAULT_SCOPES=["mcp:read"] # Optional allowlist of issuer URLs for DCR (empty = allow any) # JSON array of trusted Authorization Server issuer URLs # Example: ["https://auth.example.com", "https://auth2.example.com"] # Empty array [] allows DCR with any issuer (not recommended for production) DCR_ALLOWED_ISSUERS=[] # Token endpoint authentication method for DCR # Options: client_secret_basic (default), client_secret_post, none # - client_secret_basic: Send credentials via HTTP Basic Auth header # - client_secret_post: Send credentials in POST body # - none: Public client (no client secret, PKCE-only) DCR_TOKEN_ENDPOINT_AUTH_METHOD=client_secret_basic # AS metadata cache TTL in seconds (RFC 8414 discovery) # How long to cache Authorization Server metadata after discovery DCR_METADATA_CACHE_TTL=3600 # Template for client_name in DCR requests # {gateway_name} will be replaced with the actual gateway name DCR_CLIENT_NAME_TEMPLATE=MCP Gateway ({gateway_name}) # Enable OAuth AS metadata discovery (RFC 8414) # When enabled, gateway automatically discovers Authorization Server endpoints # from the issuer URL using well-known metadata endpoints OAUTH_DISCOVERY_ENABLED=true # Preferred PKCE code challenge method # Options: S256 (SHA-256, recommended), plain (not recommended) # PKCE (Proof Key for Code Exchange) is always enabled for Authorization Code flows OAUTH_PREFERRED_CODE_CHALLENGE_METHOD=S256 # ============================================================================== # SSO (Single Sign-On) Configuration # ============================================================================== # Master SSO switch - enable Single Sign-On authentication # Options: true, false (default) # When true: Enables SSO login options alongside local auth SSO_ENABLED=false # GitHub OAuth Configuration # Options: true, false (default) # Requires: GitHub OAuth App (Settings > Developer settings > OAuth Apps) SSO_GITHUB_ENABLED=false # SSO_GITHUB_CLIENT_ID=your-github-client-id # SSO_GITHUB_CLIENT_SECRET=your-github-client-secret # Google OAuth Configuration SSO_GOOGLE_ENABLED=false # SSO_GOOGLE_CLIENT_ID=your-google-client-id.googleusercontent.com # SSO_GOOGLE_CLIENT_SECRET=your-google-client-secret # IBM Security Verify OIDC Configuration SSO_IBM_VERIFY_ENABLED=false # SSO_IBM_VERIFY_CLIENT_ID=your-ibm-verify-client-id # SSO_IBM_VERIFY_CLIENT_SECRET=your-ibm-verify-client-secret # SSO_IBM_VERIFY_ISSUER=https://your-tenant.verify.ibm.com/oidc/endpoint/default # Okta OIDC Configuration SSO_OKTA_ENABLED=false # SSO_OKTA_CLIENT_ID=your-okta-client-id # SSO_OKTA_CLIENT_SECRET=your-okta-client-secret # SSO_OKTA_ISSUER=https://your-okta-domain.okta.com # Keycloak OIDC Configuration (with auto-discovery) SSO_KEYCLOAK_ENABLED=false # SSO_KEYCLOAK_BASE_URL=https://keycloak.example.com # SSO_KEYCLOAK_REALM=master # SSO_KEYCLOAK_CLIENT_ID=mcp-gateway # SSO_KEYCLOAK_CLIENT_SECRET=your-keycloak-client-secret # SSO_KEYCLOAK_MAP_REALM_ROLES=true # SSO_KEYCLOAK_MAP_CLIENT_ROLES=false # SSO_KEYCLOAK_USERNAME_CLAIM=preferred_username # SSO_KEYCLOAK_EMAIL_CLAIM=email # SSO_KEYCLOAK_GROUPS_CLAIM=groups # Microsoft Entra ID (Azure AD) OIDC Configuration SSO_ENTRA_ENABLED=false # SSO_ENTRA_CLIENT_ID=your-entra-application-client-id # SSO_ENTRA_CLIENT_SECRET=your-entra-client-secret-value # SSO_ENTRA_TENANT_ID=your-entra-tenant-id # Generic OIDC Provider Configuration (Keycloak, Auth0, Authentik, etc.) SSO_GENERIC_ENABLED=false # SSO_GENERIC_PROVIDER_ID=keycloak # SSO_GENERIC_DISPLAY_NAME=Keycloak # SSO_GENERIC_CLIENT_ID=your-oidc-client-id # SSO_GENERIC_CLIENT_SECRET=your-oidc-client-secret # SSO_GENERIC_AUTHORIZATION_URL=https://keycloak.company.com/auth/realms/master/protocol/openid-connect/auth # SSO_GENERIC_TOKEN_URL=https://keycloak.company.com/auth/realms/master/protocol/openid-connect/token # SSO_GENERIC_USERINFO_URL=https://keycloak.company.com/auth/realms/master/protocol/openid-connect/userinfo # SSO_GENERIC_ISSUER=https://keycloak.company.com/auth/realms/master # SSO_GENERIC_SCOPE=openid profile email # SSO General Settings SSO_AUTO_CREATE_USERS=true # JSON array of trusted email domains, e.g., ["example.com", "company.org"] SSO_TRUSTED_DOMAINS=[] # Keep local admin authentication when SSO is enabled SSO_PRESERVE_ADMIN_AUTH=true # SSO Issuers Configuration # Optional JSON array of issuer URLs for SSO providers # Example: ["https://idp1.example.com", "https://idp2.example.com"] # Default: null (not set) # SSO_ISSUERS=["https://idp.example.com"] # SSO Admin Assignment Settings # Email domains that automatically get admin privileges, e.g., ["yourcompany.com"] SSO_AUTO_ADMIN_DOMAINS=[] # GitHub organizations whose members get admin privileges, e.g., ["your-org", "partner-org"] SSO_GITHUB_ADMIN_ORGS=[] # Google Workspace domains that get admin privileges, e.g., ["company.com"] SSO_GOOGLE_ADMIN_DOMAINS=[] # Require admin approval for new SSO registrations SSO_REQUIRE_ADMIN_APPROVAL=false ##################################### # Personal Teams Configuration ##################################### # Enable automatic personal team creation for new users AUTO_CREATE_PERSONAL_TEAMS=true # Personal team naming prefix PERSONAL_TEAM_PREFIX=personal # Team Limits MAX_TEAMS_PER_USER=50 MAX_MEMBERS_PER_TEAM=100 # Team Invitation Settings INVITATION_EXPIRY_DAYS=7 REQUIRE_EMAIL_VERIFICATION_FOR_INVITES=true ##################################### # Admin UI and API Toggles ##################################### # Enable the web-based Admin UI at /admin # Options: true (default), false # PRODUCTION: Set to false for security unless needed MCPGATEWAY_UI_ENABLED=true # Enable Admin REST API endpoints (/tools, /servers, /resources, etc.) # Options: true (default), false # Required for: Admin UI functionality, programmatic management MCPGATEWAY_ADMIN_API_ENABLED=true # Use local CDN assets for airgapped deployments # Options: true, false (default) # When enabled, UI loads CSS/JS from local files instead of external CDNs # Requires container build with downloaded assets (automatic in Containerfile.lite) MCPGATEWAY_UI_AIRGAPPED=false # Enable bulk import feature for mass tool/resource registration # Options: true (default), false # Allows importing multiple tools/resources in a single API call MCPGATEWAY_BULK_IMPORT_ENABLED=true # Maximum number of tools allowed per bulk import request MCPGATEWAY_BULK_IMPORT_MAX_TOOLS=200 # Rate limiting for bulk import endpoint (requests per minute) MCPGATEWAY_BULK_IMPORT_RATE_LIMIT=10 ##################################### # A2A (Agent-to-Agent) Configuration ##################################### # Enable A2A agent features (true/false) # Allows registration and management of external AI agents MCPGATEWAY_A2A_ENABLED=true # Maximum number of A2A agents allowed MCPGATEWAY_A2A_MAX_AGENTS=100 # Default timeout for A2A agent HTTP requests (seconds) MCPGATEWAY_A2A_DEFAULT_TIMEOUT=30 # Maximum retry attempts for failed A2A agent calls MCPGATEWAY_A2A_MAX_RETRIES=3 # Enable A2A agent metrics collection (true/false) MCPGATEWAY_A2A_METRICS_ENABLED=true ##################################### # MCP Server Catalog Configuration ##################################### # Enable MCP server catalog feature # Allows defining a catalog of pre-configured MCP servers in a YAML file # for easy discovery and management via the Admin UI # Options: true (default), false MCPGATEWAY_CATALOG_ENABLED=true # Path to the catalog configuration file # YAML file containing MCP server definitions # Default: mcp-catalog.yml MCPGATEWAY_CATALOG_FILE=mcp-catalog.yml # Automatically health check catalog servers on startup and periodically # Options: true (default), false MCPGATEWAY_CATALOG_AUTO_HEALTH_CHECK=true # Catalog cache TTL in seconds # How long to cache catalog data before refreshing # Default: 3600 (1 hour) MCPGATEWAY_CATALOG_CACHE_TTL=3600 # Number of catalog servers to display per page # Default: 100 MCPGATEWAY_CATALOG_PAGE_SIZE=100 ##################################### # Elicitation Support (MCP 2025-06-18) ##################################### # Enable elicitation passthrough - allows upstream MCP servers to request # structured user input through connected clients (e.g., Claude Desktop) # Per MCP spec 2025-06-18, elicitation enables interactive workflows where # servers can dynamically gather information from users during operations MCPGATEWAY_ELICITATION_ENABLED=true # Default timeout for user responses (seconds) # How long to wait for users to respond to elicitation requests MCPGATEWAY_ELICITATION_TIMEOUT=60 # Maximum concurrent elicitation requests # Prevents resource exhaustion from too many pending user input requests MCPGATEWAY_ELICITATION_MAX_CONCURRENT=100 ##################################### # Header Passthrough Configuration ##################################### # SECURITY WARNING: Header passthrough is disabled by default for security. # Only enable if you understand the security implications and have reviewed # which headers should be passed through to backing MCP servers. # ENABLE_HEADER_PASSTHROUGH=false # Enable overwriting of base headers (advanced usage only) # When disabled, passthrough headers cannot override gateway headers like Content-Type, Authorization # ENABLE_OVERWRITE_BASE_HEADERS=false # Default headers to pass through (when feature is enabled) # JSON array format recommended: ["X-Tenant-Id", "X-Trace-Id"] # Comma-separated also supported: X-Tenant-Id,X-Trace-Id # NOTE: Authorization header removed from defaults for security # DEFAULT_PASSTHROUGH_HEADERS=["X-Tenant-Id", "X-Trace-Id"] # Passthrough headers source priority # Controls where header configuration is read from: # - "db": Database wins if configured, env as fallback (default, backward compatible) # - "env": Environment variable always wins (ideal for Kubernetes/containerized deployments) # - "merge": Union of both sources - env provides base, DB can add more headers # PASSTHROUGH_HEADERS_SOURCE=db ##################################### # Security and CORS ##################################### # Skip SSL/TLS certificate verification for upstream requests # Options: true, false (default) # WARNING: Only use in development or with self-signed certificates! # PRODUCTION: Must be false for security SKIP_SSL_VERIFY=false # CORS allowed origins (JSON array of URLs) # Controls which domains can make cross-origin requests to the gateway # Format: JSON array starting with [ and ending with ] # Example: ["http://localhost:3000", "https://app.example.com"] # Use ["*"] to allow all origins (NOT RECOMMENDED) ALLOWED_ORIGINS='["http://localhost", "http://localhost:4444"]' # Enable CORS (Cross-Origin Resource Sharing) handling # Options: true (default), false # Required for: Web browser clients, cross-domain API access CORS_ENABLED=true # CORS allow credentials (true/false) CORS_ALLOW_CREDENTIALS=true # Environment setting (development/production) - affects security defaults # development: Auto-configures CORS for localhost:3000, localhost:8080, etc. # production: Uses APP_DOMAIN for HTTPS origins, enforces secure cookies # ENVIRONMENT is already defined in Basic Server Configuration section # Domain configuration for production CORS origins # In production, automatically creates origins: https://APP_DOMAIN, https://app.APP_DOMAIN, https://admin.APP_DOMAIN # For production: set to your actual domain (e.g., mycompany.com) # APP_DOMAIN is already defined in Basic Server Configuration section # Security settings for cookies # production: Automatically enables secure cookies regardless of this setting # development: Set to false for HTTP development, true for HTTPS SECURE_COOKIES=false # Cookie SameSite attribute for CSRF protection # strict: Maximum security, may break some OAuth flows # lax: Good balance of security and compatibility (recommended) # none: Requires Secure=true, allows cross-site usage COOKIE_SAMESITE=lax ##################################### # Security Headers Configuration ##################################### # Enable security headers middleware (true/false) SECURITY_HEADERS_ENABLED=true # X-Frame-Options setting - Controls iframe embedding (also sets CSP frame-ancestors) # DENY: Prevents all iframe embedding (recommended for security) → frame-ancestors 'none' # SAMEORIGIN: Allows embedding from same domain only → frame-ancestors 'self' # "" (empty string): Allows all iframe embedding → frame-ancestors * file: http: https: # null or none: Completely removes iframe restrictions (no headers sent) # ALLOW-FROM uri: Allows specific domain (deprecated, use CSP instead) # ALLOW-ALL uri: Allows all (*, http, https) # # Both X-Frame-Options header and CSP frame-ancestors directive are automatically synced. # Modern browsers prioritize CSP frame-ancestors over X-Frame-Options. X_FRAME_OPTIONS=DENY # Other security headers (true/false) X_CONTENT_TYPE_OPTIONS_ENABLED=true X_XSS_PROTECTION_ENABLED=true X_DOWNLOAD_OPTIONS_ENABLED=true # HSTS (HTTP Strict Transport Security) settings HSTS_ENABLED=true # HSTS max age in seconds (31536000 = 1 year) HSTS_MAX_AGE=31536000 HSTS_INCLUDE_SUBDOMAINS=true # Remove server identification headers (true/false) REMOVE_SERVER_HEADERS=true # Enable HTTP Basic Auth for docs endpoints (in addition to Bearer token auth) # Uses the same credentials as BASIC_AUTH_USER and BASIC_AUTH_PASSWORD # DOCS_ALLOW_BASIC_AUTH is already defined in Basic Server Configuration section ##################################### # Response Compression Configuration ##################################### # Enable response compression (Brotli, Zstd, GZip) # Options: true (default), false # Reduces bandwidth by 30-70% for text-based responses (JSON, HTML, CSS, JS) # Automatically negotiates compression algorithm based on client Accept-Encoding header # Priority: Brotli (best compression) > Zstd (fast) > GZip (universal fallback) COMPRESSION_ENABLED=true # Minimum response size in bytes to compress # Responses smaller than this won't be compressed (compression overhead not worth it) # Default: 500 bytes # Set to 0 to compress all responses COMPRESSION_MINIMUM_SIZE=500 # GZip compression level (1-9) # 1 = fastest compression, larger files # 6 = balanced (recommended default) # 9 = best compression, slower # Default: 6 COMPRESSION_GZIP_LEVEL=6 # Brotli compression quality (0-11) # 0-3 = fast compression (lower quality) # 4-9 = balanced compression (recommended) # 10-11 = maximum compression (slower) # Default: 4 (balanced) # Note: Brotli offers 15-20% better compression than GZip at similar speeds COMPRESSION_BROTLI_QUALITY=4 # Zstd compression level (1-22) # 1-3 = fast compression # 4-9 = balanced compression # 10+ = slower, maximum compression # Default: 3 (fast) # Note: Zstd is the fastest algorithm with good compression ratio COMPRESSION_ZSTD_LEVEL=3 ##################################### # HTTPX Client Connection Pool Configuration ##################################### # Controls HTTP client settings for outbound requests (federation, health checks, # A2A, SSO, MCP server connections, etc.). Most requests use a shared singleton # client for ~20x better performance. SSE/streaming MCP connections use factory # clients with the same settings for proper connection lifecycle management. # Maximum total connections in the pool (default: 200, range: 10-1000) # Formula: concurrent_outbound_requests × 1.5 HTTPX_MAX_CONNECTIONS=200 # Maximum keepalive connections (default: 100, range: 1-500) # Connections held open for reuse; typically 50% of max_connections HTTPX_MAX_KEEPALIVE_CONNECTIONS=100 # Keepalive connection expiry in seconds (default: 30.0, range: 5.0-300.0) # How long idle connections stay in the pool before being closed HTTPX_KEEPALIVE_EXPIRY=30.0 # Connection timeout in seconds (default: 5.0, range: 1.0-60.0) # Time to establish a new TCP connection (5s for LAN, increase for WAN) HTTPX_CONNECT_TIMEOUT=5.0 # Read timeout in seconds (default: 120.0, range: 1.0-600.0) # Time to wait for response data after connection established # Set high to accommodate slow MCP tool calls (60-90s+) HTTPX_READ_TIMEOUT=120.0 # Write timeout in seconds (default: 30.0, range: 1.0-600.0) # Time to wait when sending request data HTTPX_WRITE_TIMEOUT=30.0 # Pool timeout in seconds (default: 10.0, range: 1.0-120.0) # Time to wait for a connection from the pool (fail fast on exhaustion) HTTPX_POOL_TIMEOUT=10.0 # Enable HTTP/2 support (default: false) # HTTP/2 provides multiplexing but may not be supported by all upstream servers HTTPX_HTTP2_ENABLED=false # Admin operations read timeout in seconds (default: 30.0, range: 1.0-120.0) # Shorter timeout for admin UI operations (model fetching, health checks) # Use this to fail fast on admin pages instead of waiting for httpx_read_timeout HTTPX_ADMIN_READ_TIMEOUT=30.0 ##################################### # Retry Config for HTTP Requests ##################################### RETRY_MAX_ATTEMPTS=3 # seconds RETRY_BASE_DELAY=1.0 # seconds RETRY_MAX_DELAY=60.0 # fraction of delay RETRY_JITTER_MAX=0.5 ##################################### # Logging ##################################### # Logging verbosity level # Options: DEBUG, INFO, WARNING, ERROR (default), CRITICAL # DEBUG: Detailed diagnostic info (verbose) # INFO: General operational messages # WARNING: Warning messages for potential issues # ERROR: Error messages for failures (recommended for production) # CRITICAL: Only critical failures # PRODUCTION: Use ERROR to minimize I/O overhead and improve performance LOG_LEVEL=ERROR # Disable access logging for performance # Options: true, false (default) # When true: Disables both gunicorn and uvicorn access logs # Access logs create massive I/O overhead under high concurrency # Default: true (disabled for performance) # Set to false to enable access logging for debugging # DISABLE_ACCESS_LOG=true # Log output format # Options: json (default), text # json: Structured JSON logs (good for log aggregation) # text: Human-readable plain text LOG_FORMAT=json # Enable file logging (in addition to console output) # Options: true, false (default) LOG_TO_FILE=false # Enable request payload logging for debugging # Options: true, false (default) # When enabled, logs HTTP request method, headers, query params, and body # Sensitive data (passwords, tokens, etc.) is automatically masked LOG_REQUESTS=false # File write mode when LOG_TO_FILE=true # Options: a+ (append, default), w (overwrite on startup) LOG_FILEMODE=a+ LOG_FILE=mcpgateway.log LOG_FOLDER=logs LOG_ROTATION_ENABLED=false LOG_MAX_SIZE_MB=1 LOG_BACKUP_COUNT=5 LOG_BUFFER_SIZE_MB=1.0 # Maximum request body size to log in detailed mode (bytes) # Controls how much of the request body is parsed and logged when LOG_REQUESTS=true # Separate from LOG_MAX_SIZE_MB which is for log file rotation # Default: 16384 (16KB), Range: 1024-1048576 (1KB-1MB) LOG_DETAILED_MAX_BODY_SIZE=16384 # ═══════════════════════════════════════════════════════════════════════════════ # Structured Log Database Persistence # ═══════════════════════════════════════════════════════════════════════════════ # Persist structured logs to the database for search, tracing, and metrics. # Options: true, false (default) # # When ENABLED, you get: # - Log Search API (/api/logs/search) - search logs by level, component, user, time # - Request Tracing (/api/logs/trace/{id}) - trace all logs for a correlation ID # - Performance Metrics - aggregated p50/p95/p99 latencies, error rates # - Admin UI log viewer with filtering and search # # When DISABLED: # - Logs only go to console/file (no database writes) # - Better performance (no DB I/O per log entry) # - Log search/trace/metrics APIs return empty results # - Use this if you have an external log aggregator (ELK, Datadog, etc.) # # PERFORMANCE NOTE: Each log entry triggers a synchronous database write. # Disable this in high-throughput production environments or use external logging. STRUCTURED_LOGGING_DATABASE_ENABLED=false # Correlation ID / Request Tracking # Enable automatic correlation ID tracking for unified request tracing # Options: true (default), false CORRELATION_ID_ENABLED=true # HTTP header name for correlation ID (default: X-Correlation-ID) CORRELATION_ID_HEADER=X-Correlation-ID # Preserve incoming correlation IDs from clients (default: true) CORRELATION_ID_PRESERVE=true # Include correlation ID in HTTP response headers (default: true) CORRELATION_ID_RESPONSE_HEADER=true # ═══════════════════════════════════════════════════════════════════════════════ # Database Query Logging (N+1 Detection) # ═══════════════════════════════════════════════════════════════════════════════ # Enable database query logging to file for N+1 detection and performance analysis # Use: make dev-query-log (starts server with logging enabled) # Use: make query-log-analyze (analyze logs for N+1 patterns) # DB_QUERY_LOG_ENABLED=false # DB_QUERY_LOG_FILE=logs/db-queries.log # DB_QUERY_LOG_JSON_FILE=logs/db-queries.jsonl # DB_QUERY_LOG_FORMAT=both # DB_QUERY_LOG_MIN_QUERIES=1 # DB_QUERY_LOG_DETECT_N1=true # DB_QUERY_LOG_N1_THRESHOLD=3 # ============================================================================= # Execution Metrics Recording # ============================================================================= # Controls whether tool/resource/prompt/server/A2A execution metrics are written # to the database. Each MCP operation (tool call, resource read, etc.) creates # one database row with: entity_id, timestamp, response_time (seconds), is_success, error_message. # # Disable if you rely entirely on external observability (ELK, Datadog, Splunk) # to reduce database I/O overhead. # # Note: This does NOT affect: # - Log aggregation (METRICS_AGGREGATION_ENABLED) - aggregates StructuredLogEntry into PerformanceMetric # - Prometheus metrics (ENABLE_METRICS) - /metrics endpoint for Prometheus scraping # - Observability metrics (OBSERVABILITY_METRICS_ENABLED) - internal observability system # # To disable log aggregation as well, also set METRICS_AGGREGATION_ENABLED=false DB_METRICS_RECORDING_ENABLED=true # ============================================================================= # Metrics Buffer Configuration # ============================================================================= # Batches tool/resource/prompt/server metric writes to reduce DB pressure under load # Enable buffered metrics writes (default: true) # When enabled, metrics are accumulated in memory and flushed periodically METRICS_BUFFER_ENABLED=true # Seconds between automatic metrics buffer flushes (default: 60, range: 5-300) # Lower values = more frequent writes, higher values = better batching METRICS_BUFFER_FLUSH_INTERVAL=60 # Maximum buffered metrics before forced flush (default: 1000, range: 100-10000) # Prevents unbounded memory growth under very high load METRICS_BUFFER_MAX_SIZE=1000 # Metrics Cache Configuration # ============================================================================= # Caches aggregate metrics queries to reduce database load under high traffic # See GitHub Issue #1734 for performance optimization details # Enable in-memory caching for aggregate metrics queries (default: true) # When enabled, aggregate_metrics() results are cached to reduce database load METRICS_CACHE_ENABLED=true # TTL for cached aggregate metrics in seconds (default: 60, range: 1-300) # Lower values = fresher data, higher values = better performance # Recommended: 60-300 seconds for high-traffic deployments (see Issue #1906) METRICS_CACHE_TTL_SECONDS=60 # Metrics Cleanup Configuration # ============================================================================= # Automatically deletes old metrics data to prevent unbounded table growth # Enable automatic cleanup of old metrics data (default: true) METRICS_CLEANUP_ENABLED=true # Days to retain raw metrics when rollup is disabled (default: 7, range: 1-365) # This is a fallback - when METRICS_DELETE_RAW_AFTER_ROLLUP=true, raw metrics # are deleted based on METRICS_DELETE_RAW_AFTER_ROLLUP_HOURS instead. METRICS_RETENTION_DAYS=7 # Hours between automatic cleanup runs (default: 1, range: 1-168) METRICS_CLEANUP_INTERVAL_HOURS=1 # Batch size for metrics deletion (default: 10000, range: 100-100000) # Larger batches are faster but may cause longer table locks METRICS_CLEANUP_BATCH_SIZE=10000 # Metrics Rollup Configuration # ============================================================================= # Aggregates raw metrics into hourly summaries for efficient historical queries # Rollups preserve counts, averages, and percentiles (p50, p95, p99) # Enable hourly metrics rollup for efficient historical queries (default: true) METRICS_ROLLUP_ENABLED=true # Hours between rollup runs (default: 1, range: 1-24) METRICS_ROLLUP_INTERVAL_HOURS=1 # Days to retain hourly rollup data (default: 365, range: 30-3650) METRICS_ROLLUP_RETENTION_DAYS=365 # Hours to re-process on each rollup run to catch late-arriving data (default: 1, range: 1-48) # Smaller = less CPU/IO overhead, larger = more tolerance for delayed metrics METRICS_ROLLUP_LATE_DATA_HOURS=1 # Delete raw metrics after hourly rollup exists (default: true) # When true, raw metrics older than METRICS_DELETE_RAW_AFTER_ROLLUP_HOURS are # deleted once hourly rollups exist. Rollups preserve all analytics. # # Set to false only if you need raw metrics indefinitely (e.g., exact error # messages, individual request debugging without external observability). # # If using ELK, Datadog, Splunk, or similar platforms for debugging, keep this # true - your external platform handles detailed logs and traces. METRICS_DELETE_RAW_AFTER_ROLLUP=true # Hours to retain raw metrics when hourly rollup exists (default: 1, range: 1-8760) # After this period, raw metrics are deleted but hourly rollups remain. # Increase to 168 if you need raw data for debugging without external observability. METRICS_DELETE_RAW_AFTER_ROLLUP_HOURS=1 # Authentication Cache Configuration # ============================================================================= # Caches authentication data (user, team, revocation) to reduce database queries # Uses Redis when available, falls back to in-memory cache # Enable Redis/in-memory caching for authentication data (default: true) # Significantly reduces database queries during authentication AUTH_CACHE_ENABLED=true # TTL in seconds for cached user data (default: 60, range: 10-300) AUTH_CACHE_USER_TTL=60 # TTL in seconds for token revocation cache (default: 30, range: 5-120) # Security-critical: keep short to limit exposure window for revoked tokens AUTH_CACHE_REVOCATION_TTL=30 # TTL in seconds for team membership cache (default: 60, range: 10-300) AUTH_CACHE_TEAM_TTL=60 # TTL in seconds for user role in team cache (default: 60, range: 10-300) # Caches get_user_role_in_team() which is called 11+ times per team operation AUTH_CACHE_ROLE_TTL=60 # Enable caching for get_user_teams() (default: true) # Set to false to disable teams list caching (useful for debugging) AUTH_CACHE_TEAMS_ENABLED=true # TTL in seconds for user teams list cache (default: 60, range: 10-300) # Caches get_user_teams() which is called 20+ times per request for auth checks AUTH_CACHE_TEAMS_TTL=60 # Batch auth DB queries into single call (default: true) # Reduces 3 separate queries to 1, improving performance under load AUTH_CACHE_BATCH_QUERIES=true # Registry Cache Configuration # ============================================================================= # Caches registry list endpoints (tools, prompts, resources, agents, servers, gateways) # Uses Redis when available, falls back to in-memory cache # Reduces DB queries for frequently accessed list endpoints # Enable registry caching (default: true) REGISTRY_CACHE_ENABLED=true # TTL in seconds for tools list cache (default: 20, range: 5-300) REGISTRY_CACHE_TOOLS_TTL=20 # TTL in seconds for prompts list cache (default: 15, range: 5-300) REGISTRY_CACHE_PROMPTS_TTL=15 # TTL in seconds for resources list cache (default: 15, range: 5-300) REGISTRY_CACHE_RESOURCES_TTL=15 # TTL in seconds for A2A agents list cache (default: 20, range: 5-300) REGISTRY_CACHE_AGENTS_TTL=20 # TTL in seconds for servers list cache (default: 20, range: 5-300) REGISTRY_CACHE_SERVERS_TTL=20 # TTL in seconds for gateways list cache (default: 20, range: 5-300) REGISTRY_CACHE_GATEWAYS_TTL=20 # TTL in seconds for catalog servers list cache (default: 300, range: 60-600) # Longer TTL since external catalog changes infrequently REGISTRY_CACHE_CATALOG_TTL=300 # Tool Lookup Cache Configuration # ============================================================================= # Caches tool lookup by name in the invoke_tool hot path # Uses in-memory L1 cache and optional Redis L2 cache when CACHE_TYPE=redis # Enable tool lookup caching (default: true) TOOL_LOOKUP_CACHE_ENABLED=true # TTL in seconds for tool lookup cache entries (default: 60, range: 5-600) TOOL_LOOKUP_CACHE_TTL_SECONDS=60 # TTL in seconds for negative cache entries (default: 10, range: 1-60) # Used for missing/inactive/offline tool lookups TOOL_LOOKUP_CACHE_NEGATIVE_TTL_SECONDS=10 # Max entries for in-memory L1 tool cache (default: 10000, range: 100-1000000) TOOL_LOOKUP_CACHE_L1_MAXSIZE=10000 # Enable Redis L2 cache when CACHE_TYPE=redis (default: true) TOOL_LOOKUP_CACHE_L2_ENABLED=true # Admin Stats Cache Configuration # ============================================================================= # Caches admin dashboard statistics (entity counts, observability metrics) # Reduces expensive aggregate queries under dashboard load # Enable admin stats caching (default: true) ADMIN_STATS_CACHE_ENABLED=true # TTL in seconds for system stats cache (default: 60, range: 10-300) ADMIN_STATS_CACHE_SYSTEM_TTL=60 # TTL in seconds for observability stats cache (default: 30, range: 10-120) ADMIN_STATS_CACHE_OBSERVABILITY_TTL=30 # TTL in seconds for tags listing cache (default: 120, range: 30-600) ADMIN_STATS_CACHE_TAGS_TTL=120 # TTL in seconds for plugin stats cache (default: 120, range: 30-600) ADMIN_STATS_CACHE_PLUGINS_TTL=120 # TTL in seconds for performance aggregates cache (default: 60, range: 15-300) ADMIN_STATS_CACHE_PERFORMANCE_TTL=60 # Team Member Count Cache # Reduces N+1 queries in admin UI team listings # Enable team member count caching (default: true) TEAM_MEMBER_COUNT_CACHE_ENABLED=true # TTL in seconds for team member count cache (default: 300, range: 30-3600) TEAM_MEMBER_COUNT_CACHE_TTL=300 # Transport Protocol Configuration # Options: all (default), sse, streamablehttp, http # - all: Enable all transport protocols # - sse: Server-Sent Events only # - streamablehttp: Streaming HTTP only # - http: Standard HTTP JSON-RPC only TRANSPORT_TYPE=all # WebSocket keepalive ping interval in seconds # Prevents connection timeout for idle WebSocket connections WEBSOCKET_PING_INTERVAL=30 # SSE client retry timeout in milliseconds # Time client waits before reconnecting after SSE connection loss SSE_RETRY_TIMEOUT=5000 # Enable SSE keepalive events to prevent proxy/firewall timeouts # Options: true (default), false SSE_KEEPALIVE_ENABLED=true # SSE keepalive event interval in seconds # How often to send keepalive events when SSE_KEEPALIVE_ENABLED=true SSE_KEEPALIVE_INTERVAL=30 # Streaming HTTP Configuration # Enable stateful sessions (stores session state server-side) # Options: true, false (default) # false: Stateless mode (better for scaling) USE_STATEFUL_SESSIONS=false # Enable JSON response format for streaming HTTP # Options: true (default), false # true: Return JSON responses, false: Return SSE stream JSON_RESPONSE_ENABLED=true # Federation Configuration # Timeout for federation requests in seconds # Default: 120 seconds (matches config.py) FEDERATION_TIMEOUT=120 # Resource Configuration RESOURCE_CACHE_SIZE=1000 RESOURCE_CACHE_TTL=3600 MAX_RESOURCE_SIZE=10485760 # Allowed MIME types for resources (JSON array) # Controls which content types are allowed for resource handling # Default includes common text, image, and data formats # Example: ["text/plain", "text/markdown", "application/json", "image/png"] # To add custom types: ["text/plain", "application/pdf", "video/mp4"] # ALLOWED_MIME_TYPES=["text/plain", "text/markdown", "text/html", "application/json", "application/xml", "image/png", "image/jpeg", "image/gif"] # Tool Configuration TOOL_TIMEOUT=60 MAX_TOOL_RETRIES=3 TOOL_RATE_LIMIT=100 TOOL_CONCURRENT_LIMIT=10 GATEWAY_TOOL_NAME_SEPARATOR=- # Prompt Configuration PROMPT_CACHE_SIZE=100 MAX_PROMPT_SIZE=102400 PROMPT_RENDER_TIMEOUT=10 ##################################### # MCP Server Health Check Configuration ##################################### # Interval between health checks in seconds (default: 300) HEALTH_CHECK_INTERVAL=300 # Health check timeout in seconds (default: 5) HEALTH_CHECK_TIMEOUT=5 # Per-check timeout (seconds) to bound total time of one gateway health check (default: 5.0) GATEWAY_HEALTH_CHECK_TIMEOUT=5.0 # Consecutive failures before marking gateway offline (default: 3) UNHEALTHY_THRESHOLD=3 # Gateway URL validation timeout in seconds (default: 5) GATEWAY_VALIDATION_TIMEOUT=5 # Maximum concurrent health checks per worker (default: 10) MAX_CONCURRENT_HEALTH_CHECKS=10 # Enable automatic tools/prompts/resources refresh from the mcp servers during health checks (default: false) # If the tools/prompts/resources in the mcp servers are not updated frequently, it is recommended to keep this disabled to reduce load on the servers AUTO_REFRESH_SERVERS=false # File lock name for gateway service leader election # Used to coordinate multiple gateway instances when running in cluster mode # Default: "gateway_service_leader.lock" FILELOCK_NAME=gateway_service_leader.lock ##################################### # MCP Session Pool Configuration ##################################### # Enable MCP session pooling for reduced latency (10-20x improvement) # Sessions are isolated per user/tenant via identity hashing # Default: false (enable explicitly after testing) MCP_SESSION_POOL_ENABLED=false # Max sessions per (URL, identity, transport) tuple # Default: 10 MCP_SESSION_POOL_MAX_PER_KEY=10 # Session TTL before forced close (seconds) # Default: 300 MCP_SESSION_POOL_TTL=300.0 # Idle time before session health check (seconds) # Auto-aligned with min(HEALTH_CHECK_INTERVAL, this value) # Default: 60 MCP_SESSION_POOL_HEALTH_CHECK_INTERVAL=60.0 # Timeout waiting for available session slot (seconds) # Default: 30 MCP_SESSION_POOL_ACQUIRE_TIMEOUT=30.0 # Timeout creating new session (seconds) # Default: 30 MCP_SESSION_POOL_CREATE_TIMEOUT=30.0 # Circuit breaker: failures before opening circuit # Default: 5 MCP_SESSION_POOL_CIRCUIT_BREAKER_THRESHOLD=5 # Circuit breaker: seconds before reset # Default: 60 MCP_SESSION_POOL_CIRCUIT_BREAKER_RESET=60.0 # Evict idle pool keys after this time (seconds) # Prevents unbounded growth with rotating tokens # Default: 600 MCP_SESSION_POOL_IDLE_EVICTION=600.0 # Transport timeout for pooled sessions (seconds) # Applies to all HTTP operations (connect, read, write) on pooled sessions. # Use a higher value for deployments with long-running tool calls. # Default: 30 (matches MCP SDK default) MCP_SESSION_POOL_TRANSPORT_TIMEOUT=30.0 # Force explicit RPC (list_tools) on gateway health checks # Off by default: pool's internal staleness check is sufficient # Enable for stricter verification at ~5ms latency cost per check # Default: false MCP_SESSION_POOL_EXPLICIT_HEALTH_RPC=false # Configurable health check chain - ordered list of methods to try (JSON array) # Options: ping, list_tools, list_prompts, list_resources, skip # Default: ["ping", "skip"] (try lightweight ping, skip if unsupported) # Examples: # ["ping", "skip"] - Modern servers (recommended, fastest) # ["ping", "list_tools", "skip"] - Legacy server support # ["skip"] - No health check (maximum performance, use with caution) # ["ping"] - Strict (fail if ping unsupported) MCP_SESSION_POOL_HEALTH_CHECK_METHODS=["ping", "skip"] # Timeout in seconds for each health check attempt # Default: 5.0 MCP_SESSION_POOL_HEALTH_CHECK_TIMEOUT=5.0 ##################################### # Default Root Paths ##################################### # Default root paths (JSON array) # List of default root paths for resource resolution # Example: ["/api/v1", "/mcp"] # Default: [] DEFAULT_ROOTS=[] ##################################### # OpenTelemetry Observability Configuration ##################################### # Enable distributed tracing and metrics collection # Options: true (default), false OTEL_ENABLE_OBSERVABILITY=false # Traces exporter backend # Options: otlp (default), jaeger, zipkin, console, none # - otlp: OpenTelemetry Protocol (works with many backends) # - jaeger: Direct Jaeger integration # - zipkin: Direct Zipkin integration # - console: Print to stdout (debugging) # - none: Disable tracing OTEL_TRACES_EXPORTER=otlp # OTLP endpoint for traces and metrics # Examples: # - Phoenix: http://localhost:4317 # - Jaeger: http://localhost:4317 # - Tempo: http://localhost:4317 OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4317 # OTLP protocol # Options: grpc (default), http OTEL_EXPORTER_OTLP_PROTOCOL=grpc # Use insecure connection (no TLS) for OTLP # Options: true (default for localhost), false (use TLS) OTEL_EXPORTER_OTLP_INSECURE=true # OTEL_EXPORTER_OTLP_HEADERS=key1=value1,key2=value2 # OTEL_EXPORTER_JAEGER_ENDPOINT=http://localhost:14268/api/traces # OTEL_EXPORTER_ZIPKIN_ENDPOINT=http://localhost:9411/api/v2/spans OTEL_SERVICE_NAME=mcp-gateway # OTEL_RESOURCE_ATTRIBUTES=service.version=1.0.0,environment=production OTEL_BSP_MAX_QUEUE_SIZE=2048 OTEL_BSP_MAX_EXPORT_BATCH_SIZE=512 OTEL_BSP_SCHEDULE_DELAY=5000 # Copy resource attributes to span attributes (for Arize compatibility) # Some observability backends like Arize require certain attributes as span attributes # rather than resource attributes. Enable this to copy arize.project.name and model_id. # OTEL_COPY_RESOURCE_ATTRS_TO_SPANS=false # Prometheus Metrics Configuration # Enable Prometheus-compatible metrics exposition for monitoring and alerting # Options: true (default), false # When true: Exposes metrics at /metrics/prometheus in Prometheus format # When false: Returns HTTP 503 on metrics endpoint ENABLE_METRICS=true # Comma-separated regex patterns for endpoints to exclude from metrics collection # Use this to avoid high-cardinality issues with dynamic paths or reduce overhead # Examples: # - Exclude SSE endpoints: /servers/.*/sse # - Exclude static files: /static/.* # - Exclude health checks: .*health.* # - Multiple patterns: /servers/.*/sse,/static/.*,.*health.* # Default: "" (no exclusions) METRICS_EXCLUDED_HANDLERS= # Prometheus metrics namespace (prefix for all metric names) # Used to group metrics by application or organization # Example: mycompany_gateway_http_requests_total # Default: "default" METRICS_NAMESPACE=default # Prometheus metrics subsystem (secondary prefix for metric names) # Used for further categorization within namespace # Example: mycompany_api_http_requests_total (if subsystem=api) # Default: "" (no subsystem) METRICS_SUBSYSTEM= # Custom static labels for app_info gauge metric # Format: comma-separated "key=value" pairs (low-cardinality values only) # WARNING: Never use high-cardinality values (user IDs, request IDs, timestamps) # Examples: # - Single label: environment=production # - Multiple labels: environment=production,region=us-east-1,team=platform # - K8s example: cluster=prod-us-east,namespace=mcp-gateway # Default: "" (no custom labels) METRICS_CUSTOM_LABELS= # Plugin Framework Configuration # Enable the plugin system for extending gateway functionality # Options: true, false (default) # When true: Loads and executes plugins from PLUGIN_CONFIG_FILE PLUGINS_ENABLED=false # Path to the plugin configuration file # Contains plugin definitions, hooks, and settings # Default: plugins/config.yaml PLUGIN_CONFIG_FILE=plugins/config.yaml # Optional defaults for mTLS when connecting to external MCP plugins (STREAMABLEHTTP transport) # Provide file paths inside the container. Plugin-specific TLS blocks override these defaults. # PLUGINS_CLIENT_MTLS_CA_BUNDLE=/app/certs/plugins/ca.crt # PLUGINS_CLIENT_MTLS_CERTFILE=/app/certs/plugins/gateway-client.pem # PLUGINS_CLIENT_MTLS_KEYFILE=/app/certs/plugins/gateway-client.key # PLUGINS_CLIENT_MTLS_KEYFILE_PASSWORD= # PLUGINS_CLIENT_MTLS_VERIFY=true # PLUGINS_CLIENT_MTLS_CHECK_HOSTNAME=true ##################################### # Well-Known URI Configuration ##################################### # Enable well-known URI endpoints (/.well-known/*) WELL_KNOWN_ENABLED=true # robots.txt content - Default blocks all crawlers (private API) # Use multiline with proper escaping or keep on one line WELL_KNOWN_ROBOTS_TXT="User-agent: *\nDisallow: /\n\n# MCP Gateway is a private API gateway\n# Public crawling is disabled by default" # security.txt content - Define your security contact information # Format: RFC 9116 (https://www.rfc-editor.org/rfc/rfc9116.html) # Leave empty to disable security.txt # Example: # WELL_KNOWN_SECURITY_TXT="Contact: mailto:security@example.com\nExpires: 2025-12-31T23:59:59Z\nPreferred-Languages: en\nCanonical: https://example.com/.well-known/security.txt" WELL_KNOWN_SECURITY_TXT="" # Additional custom well-known files (JSON format) # Example: {"ai.txt": "AI Usage: This service uses AI for tool orchestration...", "dnt-policy.txt": "We respect DNT headers..."} WELL_KNOWN_CUSTOM_FILES="{}" # Cache control for well-known files (seconds) - 3600 = 1 hour WELL_KNOWN_CACHE_MAX_AGE=3600 ##################################### # Well-Known URI Examples ##################################### # Example 1: Basic security.txt # WELL_KNOWN_SECURITY_TXT="Contact: mailto:security@mycompany.com\nContact: https://mycompany.com/security\nEncryption: https://mycompany.com/pgp-key.txt\nPreferred-Languages: en, es\nCanonical: https://api.mycompany.com/.well-known/security.txt" # Example 2: Custom AI policy # WELL_KNOWN_CUSTOM_FILES={"ai.txt": "# AI Usage Policy\n\nThis MCP Gateway uses AI for:\n- Tool orchestration\n- Response generation\n- Error handling\n\nWe do not use AI for:\n- User data analysis\n- Behavioral tracking\n- Decision making without human oversight"} # Example 3: Allow specific crawlers # WELL_KNOWN_ROBOTS_TXT="User-agent: internal-monitor\nAllow: /health\nAllow: /metrics\n\nUser-agent: *\nDisallow: /" # Example 4: Multiple custom files # WELL_KNOWN_CUSTOM_FILES={"ai.txt": "# AI Usage Policy\n\nThis MCP Gateway uses AI for:\n- Tool orchestration\n- Response generation\n- Error handling\n\nWe do not use AI for:\n- User data analysis\n- Behavioral tracking\n- Decision making without human oversight", "dnt-policy.txt": "# Do Not Track Policy\n\nWe respect the DNT header.\nNo tracking cookies are used.\nOnly essential session data is stored.", "change-password": "https://mycompany.com/account/password"} ##################################### # Validation Settings ##################################### # These settings control input validation and security patterns # Most users won't need to change these defaults # HTML/JavaScript injection patterns (regex) # Used to detect potentially dangerous HTML/JS content # VALIDATION_DANGEROUS_HTML_PATTERN - Pattern to detect dangerous HTML tags # VALIDATION_DANGEROUS_JS_PATTERN - Pattern to detect JavaScript injection attempts # Allowed URL schemes for external requests # Controls which URL schemes are permitted for gateway operations # Default: ["http://", "https://", "ws://", "wss://"] # VALIDATION_ALLOWED_URL_SCHEMES=["http://", "https://", "ws://", "wss://"] # Character validation patterns (regex) # Used to validate various input fields # VALIDATION_NAME_PATTERN - Pattern for validating names (allows spaces) # VALIDATION_IDENTIFIER_PATTERN - Pattern for validating IDs (no spaces) # VALIDATION_SAFE_URI_PATTERN - Pattern for safe URI characters # VALIDATION_UNSAFE_URI_PATTERN - Pattern to detect unsafe URI characters # VALIDATION_TOOL_NAME_PATTERN - MCP tool naming pattern # VALIDATION_TOOL_METHOD_PATTERN - MCP tool method naming pattern # Size limits for various inputs (in characters or bytes) # VALIDATION_MAX_NAME_LENGTH=255 # VALIDATION_MAX_DESCRIPTION_LENGTH=8192 # VALIDATION_MAX_TEMPLATE_LENGTH=65536 # VALIDATION_MAX_CONTENT_LENGTH=1048576 # VALIDATION_MAX_JSON_DEPTH=10 # VALIDATION_MAX_URL_LENGTH=2048 # VALIDATION_MAX_RPC_PARAM_SIZE=262144 # VALIDATION_MAX_METHOD_LENGTH=128 # Rate limiting for validation operations # Maximum requests per minute for validation endpoints # VALIDATION_MAX_REQUESTS_PER_MINUTE=60 # Allowed MIME types for validation (JSON array) # Controls which content types pass validation checks # VALIDATION_ALLOWED_MIME_TYPES=["text/plain", "text/html", "text/css", "text/markdown", "text/javascript", "application/json", "application/xml", "application/pdf", "image/png", "image/jpeg", "image/gif", "image/svg+xml", "application/octet-stream"] ##################################### # Gunicorn Production Server Configuration ##################################### # These settings are used by run-gunicorn.sh for production deployments. # They control the Gunicorn WSGI server behavior. # Number of worker processes # Options: "auto" (default, 2*CPU+1 capped at 16), or any positive integer # Recommendation: 2-4 x $(NUM_CORES) for CPU-bound, more for I/O-bound workloads # GUNICORN_WORKERS=auto # Worker timeout in seconds # Workers that don't respond within this time are killed and restarted # Increase for long-running requests (e.g., LLM streaming, large file uploads) # Default: 600 (10 minutes) # GUNICORN_TIMEOUT=600 # Maximum requests per worker before automatic restart # Helps prevent memory leaks by periodically recycling workers # Default: 100000 # GUNICORN_MAX_REQUESTS=100000 # Random jitter added to max requests (prevents thundering herd on restart) # Default: 100 # GUNICORN_MAX_REQUESTS_JITTER=100 # Preload application before forking workers # Options: true (default), false # true: Saves memory (shared code), runs migrations once before forking # false: Each worker loads app independently (more memory, better isolation) # GUNICORN_PRELOAD_APP=true # Developer mode with hot reload # Options: true, false (default) # Enables --reload flag and reduces workers for easier debugging # WARNING: Disables preload_app. Not for production! # GUNICORN_DEV_MODE=false # Force start even if another instance is running # Options: true, false (default) # Bypasses the lock file check at /tmp/mcpgateway-gunicorn.lock # FORCE_START=false ##################################### # Gunicorn SSL/TLS Configuration ##################################### # Enable HTTPS for production deployments # Enable TLS/SSL # Options: true, false (default) # SSL=false # Path to SSL certificate file (PEM format) # CERT_FILE=certs/cert.pem # Path to SSL private key file (PEM format) # KEY_FILE=certs/key.pem # Passphrase for encrypted private key (optional) # If your key is passphrase-protected, set this value # The key will be decrypted by the SSL key manager before Gunicorn starts # KEY_FILE_PASSWORD= # CERT_PASSPHRASE= ##################################### # Development Configuration ##################################### # Enable development mode (relaxed security, verbose logging) # Options: true, false (default) # WARNING: Never use in production! DEV_MODE=false # Enable auto-reload on code changes (for development) # Options: true, false (default) # Requires: Running with uvicorn directly (not gunicorn) RELOAD=false # Enable Jinja2 template auto-reload (for development) # Options: true, false (default) # Set to true for development to auto-detect template changes # Performance impact: Disabling reduces CPU usage for admin UI under load # Note: `make dev` automatically sets this to true TEMPLATES_AUTO_RELOAD=false # Enable debug mode (verbose error messages, stack traces) # Options: true, false (default) # WARNING: May expose sensitive information! DEBUG=false # Header Passthrough (WARNING: Security implications) ENABLE_HEADER_PASSTHROUGH=false ENABLE_OVERWRITE_BASE_HEADERS=false DEFAULT_PASSTHROUGH_HEADERS=["X-Tenant-Id", "X-Trace-Id"] # Authorization Header Conflict Resolution: # When gateway uses auth, use X-Upstream-Authorization header to pass # authorization to upstream servers (automatically renamed to Authorization) # GlobalConfig In-Memory Cache TTL (Issue #1715) # Caches GlobalConfig (passthrough headers settings) in memory to reduce DB queries. # Under load (1000+ concurrent users), this eliminates 42,000+ redundant DB queries. # Trade-off: Config changes take up to TTL seconds to propagate (use admin API to force refresh). # Default: 60 seconds. Range: 5-3600 seconds. GLOBAL_CONFIG_CACHE_TTL=300 # A2A Stats In-Memory Cache TTL # Caches A2A agent counts (total, active) to avoid redundant COUNT queries on /metrics calls. # Trade-off: Agent count changes take up to TTL seconds to propagate (use admin API to force refresh). # Default: 30 seconds. Range: 5-3600 seconds. A2A_STATS_CACHE_TTL=30 # Enable auto-completion for plugins CLI PLUGINS_CLI_COMPLETION=false MCPGATEWAY_UI_TOOL_TEST_TIMEOUT=120000 # Set markup mode for plugins CLI # Valid options: # rich: use rich markup # markdown: allow markdown in help strings # disabled: disable markup # If unset (commented out), uses "rich" if rich is detected, otherwise disables it. PLUGINS_CLI_MARKUP_MODE=rich ##################################### # Security Validation Settings ##################################### # Minimum length for secret keys (JWT, encryption) MIN_SECRET_LENGTH=32 # Minimum length for passwords MIN_PASSWORD_LENGTH=12 # Enforce strong secrets (set to true to fail startup on critical issues) # Default is false to maintain backward compatibility REQUIRE_STRONG_SECRETS=false # Security validation thresholds # Set to false to allow startup with security warnings # NOT RECOMMENDED for production! # REQUIRE_STRONG_SECRETS=false ##################################### # ToolOps Configuration ##################################### # Enable the ToolOps functionality (true/false) # When disabled, ToolOps features will be completely hidden from UI and APIs # Default: false (must be explicitly enabled) TOOLOPS_ENABLED=false ##################################### # LLM Chat MCP Client Configuration ##################################### # Enable the LLM Chat functionality (true/false) # When disabled, LLM chat features will be completely hidden from UI and APIs # Default: false (must be explicitly enabled) LLMCHAT_ENABLED=false # LLM Provider Configuration # All LLM providers are now configured via Admin UI -> Settings -> LLM Settings. # Add providers (OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, Ollama, watsonx) # and their models through the Admin UI. API keys and credentials are securely # stored in the database. # Redis Configuration for chat session storage and maintaining history # CACHE_TYPE should be set to "redis" and REDIS_URL configured appropriately as mentioned in the caching section. # Seconds for active_session key TTL LLMCHAT_SESSION_TTL=300 # Seconds for lock expiry LLMCHAT_SESSION_LOCK_TTL=30 # How many times to poll while waiting LLMCHAT_SESSION_LOCK_RETRIES=10 # Seconds between polls LLMCHAT_SESSION_LOCK_WAIT=0.2 # Seconds for chat history expiry LLMCHAT_CHAT_HISTORY_TTL=3600 # Maximum message history to store per user LLMCHAT_CHAT_HISTORY_MAX_MESSAGES=50 ##################################### # LLM Settings (Internal API) ##################################### # These settings control the internal LLM API that allows the gateway to # act as a unified LLM provider. Configure external providers in the Admin UI. # API prefix for internal LLM endpoints (OpenAI-compatible) # Default: /v1 LLM_API_PREFIX=/v1 # Request timeout in seconds for LLM API calls # Default: 120 LLM_REQUEST_TIMEOUT=120 # Enable streaming responses for LLM Chat # Default: true LLM_STREAMING_ENABLED=true # Provider health check interval in seconds # Default: 300 (5 minutes) LLM_HEALTH_CHECK_INTERVAL=300 ##################################### # Pagination Configuration ##################################### # Default number of items per page for paginated endpoints # Applies to: tools, resources, prompts, servers, gateways, users, teams, tokens, etc. # Default: 50, Min: 1, Max: 1000 PAGINATION_DEFAULT_PAGE_SIZE=50 # Maximum allowed items per page (prevents abuse) # Default: 500, Min: 1, Max: 10000 PAGINATION_MAX_PAGE_SIZE=500 # Minimum items per page # Default: 1 PAGINATION_MIN_PAGE_SIZE=1 # Threshold for switching from offset to cursor-based pagination # When result set exceeds this count, use cursor-based pagination for performance # Default: 10000 PAGINATION_CURSOR_THRESHOLD=10000 # Enable cursor-based pagination globally # Options: true (default), false # When false, only offset-based pagination is used PAGINATION_CURSOR_ENABLED=true # Default sort field for paginated queries # Default: created_at PAGINATION_DEFAULT_SORT_FIELD=created_at # Default sort order for paginated queries # Options: asc, desc (default) PAGINATION_DEFAULT_SORT_ORDER=desc # Maximum offset allowed for offset-based pagination (prevents abuse) # Default: 100000 (100K records) PAGINATION_MAX_OFFSET=100000 # Cache pagination counts for performance (seconds) # Set to 0 to disable caching # Default: 300 (5 minutes) PAGINATION_COUNT_CACHE_TTL=300 # Enable pagination links in API responses # Options: true (default), false PAGINATION_INCLUDE_LINKS=true # Base URL for pagination links (defaults to request URL) # PAGINATION_BASE_URL=https://api.example.com ##################################### # gRPC Support Settings (EXPERIMENTAL) ##################################### # Enable gRPC to MCP translation support (disabled by default) # Requires: pip install mcp-contextforge-gateway[grpc] # MCPGATEWAY_GRPC_ENABLED=false # Enable gRPC server reflection by default for service discovery # MCPGATEWAY_GRPC_REFLECTION_ENABLED=true # Maximum gRPC message size in bytes (4MB default) # MCPGATEWAY_GRPC_MAX_MESSAGE_SIZE=4194304 # Default gRPC call timeout in seconds # MCPGATEWAY_GRPC_TIMEOUT=30 # Enable TLS for gRPC connections by default # MCPGATEWAY_GRPC_TLS_ENABLED=false ##################################### # Audit Trail Logging ##################################### # Enable audit trail logging for compliance (CRUD operations on all resources) # Default: false (disabled for performance - causes a DB write on EVERY API request) # When enabled, logs all create, read, update, delete operations to the audit_trails table # WARNING: This can generate millions of rows during load testing! # Only enable for production compliance requirements (SOC2, HIPAA, etc.) # AUDIT_TRAIL_ENABLED=false ##################################### # Security Event Logging ##################################### # Enable security event logging (authentication attempts, authorization failures, etc.) # Default: false (disabled for performance - can cause high DB write load) # When enabled, the AuthContextMiddleware will log authentication events to the database # This is INDEPENDENT of observability settings # SECURITY_LOGGING_ENABLED=false # Security logging level - controls what events are logged to the database # Options: # - "all" : Log ALL events including successful auth (WARNING: high DB load!) # - "failures_only" : Log only authentication/authorization failures (recommended) # - "high_severity" : Log only high/critical severity events # Default: failures_only # SECURITY_LOGGING_LEVEL=failures_only ##################################### # Observability Settings ##################################### # Enable observability tracing and metrics collection # When enabled, all HTTP requests will be traced with detailed timing, status codes, and context # OBSERVABILITY_ENABLED=false # Automatically trace HTTP requests # OBSERVABILITY_TRACE_HTTP_REQUESTS=true # Number of days to retain trace data # OBSERVABILITY_TRACE_RETENTION_DAYS=7 # Maximum number of traces to retain (prevents unbounded growth) # OBSERVABILITY_MAX_TRACES=100000 # Trace sampling rate (0.0-1.0) - 1.0 means trace everything, 0.1 means trace 10% # OBSERVABILITY_SAMPLE_RATE=1.0 # Paths to include for tracing (JSON array of regex patterns) # OBSERVABILITY_INCLUDE_PATHS=["^/rpc/?$","^/sse$","^/message$","^/mcp(?:/|$)","^/servers/[^/]+/mcp/?$","^/servers/[^/]+/sse$","^/servers/[^/]+/message$","^/a2a(?:/|$)"] # Paths to exclude from tracing (JSON array of regex patterns, applied after include patterns) # OBSERVABILITY_EXCLUDE_PATHS=["/health", "/healthz", "/ready", "/metrics", "/static/.*"] # Enable metrics collection # OBSERVABILITY_METRICS_ENABLED=true # Enable event logging within spans # OBSERVABILITY_EVENTS_ENABLED=true ##################################### # Performance Monitoring Settings ##################################### # Enable performance tracking tab in admin UI (default: false) # Shows real-time CPU, memory, disk, network, worker, and request metrics # MCPGATEWAY_PERFORMANCE_TRACKING=false # Metric collection interval in seconds (default: 10) # How often to sample system metrics for historical data # MCPGATEWAY_PERFORMANCE_COLLECTION_INTERVAL=10 # Snapshot retention period in hours (default: 24) # How long to keep individual metric snapshots # MCPGATEWAY_PERFORMANCE_RETENTION_HOURS=24 # Aggregate retention period in days (default: 90) # How long to keep hourly/daily aggregated metrics # MCPGATEWAY_PERFORMANCE_RETENTION_DAYS=90 # Maximum performance snapshots to retain (default: 10000) # Prevents unbounded database growth # MCPGATEWAY_PERFORMANCE_MAX_SNAPSHOTS=10000 # Enable distributed mode for multi-container deployments (default: false) # Uses Redis to aggregate metrics from multiple workers/containers # MCPGATEWAY_PERFORMANCE_DISTRIBUTED=false # Enable network connections counting (default: true) # psutil.net_connections() can be CPU intensive under heavy load # Disable to skip network connection counting entirely # MCPGATEWAY_PERFORMANCE_NET_CONNECTIONS_ENABLED=true # Cache TTL for net_connections in seconds (default: 15) # Throttles expensive psutil.net_connections() calls # Higher values reduce CPU usage but report stale connection counts # MCPGATEWAY_PERFORMANCE_NET_CONNECTIONS_CACHE_TTL=15 ##################################### # Ed25519 Key Support ##################################### # Ed25519 private key for signing ENABLE_ED25519_SIGNING=false PREV_ED2519_PRIVATE_KEY="" ED25519_PRIVATE_KEY=private_key

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/IBM/mcp-context-forge'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

.env.example•76.3 KiB