CodeGraph CLI MCP Server

CONFIGURATION_SYSTEM.md•11.9 kB

# CodeGraph Configuration System ## Overview The CodeGraph configuration system provides a comprehensive, flexible, and extensible framework for managing embedding models and performance modes. It supports runtime configuration switching, hot reloading, environment variable overrides, and predefined templates for common use cases. ## Architecture ### Core Components 1. **AdvancedConfig**: Main configuration structure containing all settings 2. **EmbeddingModelConfig**: Configuration for embedding models (OpenAI, Local, etc.) 3. **PerformanceModeConfig**: Performance tuning settings (accuracy vs speed tradeoffs) 4. **ConfigurationManager**: Runtime configuration management with hot reload support 5. **CLI Integration**: Command-line interface for configuration management ### Module Structure ``` crates/codegraph-core/src/ ├── advanced_config.rs # Main configuration module ├── embedding_config.rs # Embedding model configurations ├── performance_config.rs # Performance mode configurations └── cli_config.rs # CLI integration interface ``` ## Configuration Schema ### Embedding Configuration ```toml [embedding] provider = "openai" # Options: openai, local, cohere, huggingface, custom dimension = 1536 # Vector dimension size cache_enabled = true # Enable embedding cache cache_ttl_secs = 3600 # Cache TTL in seconds normalize_embeddings = true # Normalize vectors # Provider-specific settings [embedding.openai] model = "text-embedding-3-small" api_key_env = "OPENAI_API_KEY" api_base = "https://api.openai.com/v1" max_retries = 3 timeout_secs = 30 [embedding.local] model_path = "./models/all-MiniLM-L6-v2" model_type = "sentence-transformers" device = "cpu" # Options: cpu, cuda batch_size = 32 max_sequence_length = 512 ``` ### Performance Configuration ```toml [performance] mode = "balanced" # Options: high_accuracy, balanced, high_speed, ultra_fast, custom auto_tune = true # Enable automatic performance tuning profile_enabled = false # Enable profiling [performance.index] index_type = "IVFFlat" # FAISS index type nprobe = 16 # Number of probes for search nlist = 100 # Number of clusters m = 32 # HNSW parameter ef_construction = 200 # HNSW construction parameter ef_search = 64 # HNSW search parameter use_gpu = false # Use GPU acceleration quantization = "PQ8" # Optional quantization [performance.cache] enabled = true max_size_mb = 256 ttl_secs = 3600 eviction_policy = "lru" # Options: lru, lfu, fifo preload_common = false [performance.processing] batch_size = 32 parallel_workers = 8 chunk_size = 512 overlap_size = 50 max_queue_size = 1000 timeout_secs = 30 ``` ### Runtime Configuration ```toml [runtime] allow_runtime_switching = true hot_reload = true config_watch_interval_secs = 30 fallback_configs = ["config/fallback.toml"] [runtime.environment_overrides] CODEGRAPH_CACHE_SIZE = "performance.cache.max_size_mb" CODEGRAPH_BATCH_SIZE = "performance.processing.batch_size" ``` ### Monitoring Configuration ```toml [monitoring] enabled = true metrics_enabled = true trace_enabled = false profile_enabled = false metrics_interval_secs = 60 export_targets = ["prometheus://localhost:9090"] ``` ## Presets and Templates ### Embedding Presets | Preset | Provider | Model | Dimension | Use Case | |--------|----------|-------|-----------|----------| | openai-small | OpenAI | text-embedding-3-small | 1536 | Fast, cost-effective | | openai-large | OpenAI | text-embedding-3-large | 3072 | High quality | | openai-ada | OpenAI | text-embedding-ada-002 | 1536 | Legacy, stable | | local-minilm | Local | all-MiniLM-L6-v2 | 384 | Fast, lightweight | | local-mpnet | Local | all-mpnet-base-v2 | 768 | High quality | ### Performance Profiles | Profile | Mode | Use Cases | |---------|------|-----------| | research | HighAccuracy | Academic research, precision-critical analysis | | production | Balanced | Web applications, API services | | realtime | HighSpeed | Interactive applications, live search | | edge | UltraFast | Edge devices, IoT, resource-limited | ### Quick Configurations | Template | Embedding | Performance | Monitoring | Description | |----------|-----------|-------------|------------|-------------| | development | local-minilm | research | Yes | Development with fast iteration | | staging | openai-small | production | Yes | Staging with balanced performance | | production | openai-large | production | Yes | Production with high performance | | edge | local-minilm | edge | No | Edge deployment with minimal resources | ## CLI Usage ### Installation The configuration CLI is integrated into the codegraph-core crate: ```bash cargo install --path crates/codegraph-core ``` ### Basic Commands ```bash # Initialize configuration codegraph-config init --template production # Show current configuration codegraph-config show codegraph-config show --json --pretty # Validate configuration codegraph-config validate --strict # Apply template codegraph-config apply staging --dry-run codegraph-config apply production # List available presets codegraph-config list --detailed ``` ### Embedding Configuration ```bash # Set embedding provider codegraph-config embedding set openai --model text-embedding-3-small --dimension 1536 # Apply embedding preset codegraph-config embedding preset openai-large # Show embedding configuration codegraph-config embedding show # List available presets codegraph-config embedding list ``` ### Performance Configuration ```bash # Set performance mode codegraph-config performance set high_speed --auto-tune # Apply performance profile codegraph-config performance profile realtime # Show performance configuration codegraph-config performance show # Run auto-tuning codegraph-config performance auto-tune --iterations 20 ``` ### Advanced Commands ```bash # Generate configuration schema codegraph-config schema --format json --output schema.json codegraph-config schema --format yaml # Test configuration codegraph-config test --quick codegraph-config test --profile # Use custom configuration file codegraph-config --config custom.toml show ``` ## Runtime Management ### Configuration Manager API ```rust use codegraph_core::{AdvancedConfig, ConfigurationManager}; // Create manager from file let manager = ConfigurationManager::from_file("config.toml")?; // Get current configuration let config = manager.get_config().await; // Switch performance mode at runtime manager.switch_performance_mode(PerformanceMode::HighSpeed).await?; // Switch embedding preset manager.switch_embedding_preset("openai-large").await?; // Update configuration let mut config = manager.get_config().await; config.performance.cache.max_size_mb = 512; manager.update_config(config).await?; // Enable hot reload manager.start_hot_reload().await?; ``` ### Environment Variable Overrides Set environment variables to override configuration values: ```bash # Override embedding provider export CODEGRAPH_EMBEDDING_PROVIDER=openai # Override performance mode export CODEGRAPH_PERFORMANCE_MODE=high_speed # Custom overrides (defined in config) export CODEGRAPH_CACHE_SIZE=1024 export CODEGRAPH_BATCH_SIZE=64 ``` ### Auto-Tuning The system supports automatic performance tuning based on available resources: ```rust let mut config = PerformanceModeConfig::balanced(); let available_memory_mb = 8192; let cpu_cores = 16; config.apply_auto_tuning(available_memory_mb, cpu_cores); ``` ## Best Practices ### Development Environment 1. Use local embedding models for faster iteration 2. Enable hot reload for configuration changes 3. Use the "development" quick config template 4. Enable all monitoring features for debugging ### Production Environment 1. Use OpenAI large models for best quality 2. Apply the "production" profile 3. Enable metrics but disable profiling 4. Set up proper API key management 5. Configure fallback configurations ### Performance Optimization 1. **High Accuracy**: Use for research and precision-critical tasks - Flat index for exact search - Smaller batch sizes - More overlap in chunking 2. **Balanced**: Default for most applications - IVFFlat index for good accuracy/speed balance - Moderate batch sizes - Standard worker count 3. **High Speed**: For real-time applications - IVFPQ index with quantization - Larger batch sizes - More parallel workers 4. **Ultra Fast**: For edge deployment - Aggressive quantization - Minimal cache - Maximum parallelization ## Validation Rules The configuration system enforces the following validation rules: 1. **Embedding Dimension**: Must be between 1 and 8192 2. **Provider Configuration**: Provider-specific config must exist when provider is selected 3. **Batch Size**: Must be greater than 0 4. **Worker Count**: Must be greater than 0 5. **Chunk Size**: Must be greater than overlap size 6. **Cache Size**: Must be greater than 0 when enabled 7. **Index Parameters**: nprobe must be between 1 and nlist ## Examples ### Example 1: Production Configuration ```toml # config/production.toml [embedding] provider = "openai" dimension = 3072 [embedding.openai] model = "text-embedding-3-large" api_key_env = "OPENAI_API_KEY" max_retries = 5 timeout_secs = 60 [performance] mode = "balanced" auto_tune = true [runtime] allow_runtime_switching = false hot_reload = false [monitoring] enabled = true metrics_enabled = true export_targets = ["prometheus://metrics.example.com:9090"] ``` ### Example 2: Edge Configuration ```toml # config/edge.toml [embedding] provider = "local" dimension = 384 [embedding.local] model_path = "/opt/models/minilm" device = "cpu" batch_size = 16 [performance] mode = "ultra_fast" [performance.cache] enabled = true max_size_mb = 32 [performance.processing] batch_size = 128 parallel_workers = 4 chunk_size = 2048 [monitoring] enabled = false ``` ## Integration with MCP Server The configuration system integrates seamlessly with the MCP (Model Context Protocol) server: ```rust // In MCP server initialization let config = AdvancedConfig::from_file("config.toml")?; let manager = ConfigurationManager::new(config); // Use configuration in MCP handlers let embedding_config = manager.get_config().await.embedding; let performance_config = manager.get_config().await.performance; // Dynamic reconfiguration via MCP manager.switch_performance_mode(mode).await?; ``` ## Troubleshooting ### Common Issues 1. **Configuration not loading**: Check file path and TOML syntax 2. **Validation errors**: Review error messages for specific constraints 3. **Environment overrides not working**: Ensure variables are exported 4. **Hot reload not detecting changes**: Check file permissions and watch interval 5. **Performance issues**: Run auto-tuning or adjust worker counts ### Debug Commands ```bash # Validate configuration with verbose output codegraph-config validate --strict --verbose # Test configuration with profiling codegraph-config test --profile # Show parsed configuration as JSON codegraph-config show --json --pretty # Generate schema for validation codegraph-config schema --format json ``` ## Future Enhancements 1. **Dynamic Model Selection**: Automatic model selection based on query characteristics 2. **A/B Testing Support**: Built-in support for comparing configurations 3. **Cloud Configuration**: Remote configuration management 4. **Performance Analytics**: Detailed performance metrics and recommendations 5. **Configuration Migration**: Tools for migrating between configuration versions 6. **Multi-Tenant Support**: Per-tenant configuration overrides 7. **Configuration Versioning**: Track and rollback configuration changes 8. **Automated Optimization**: ML-based configuration optimization

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Jakedismo/codegraph-rust'

If you have feedback or need assistance with the MCP directory API, please join our Discord server