AutoDocs MCP Server

MIT License

Overview InspectNew Endpoints Schema Related Servers Reviews Score

autodoc-mcp

CLAUDE.md•25.7 kB

# CLAUDE.md This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository. ## Project Overview This is the **AutoDocs MCP Server** project - a **fully implemented** Model Context Protocol (MCP) server that automatically provides AI assistants with contextual, version-specific documentation for Python project dependencies. The goal is to eliminate manual package documentation lookup and provide more accurate AI coding assistance. ## Architecture The project follows a modular, layered architecture with comprehensive separation of concerns. The system is organized into several key layers: ### Core Services Layer (`src/autodocs_mcp/core/`) - **dependency_parser.py**: PyProject.toml parsing with graceful degradation for malformed dependencies - **dependency_resolver.py**: Enhanced dependency resolution with conflict detection - **version_resolver.py**: Version constraint resolution using PyPI API - **doc_fetcher.py**: PyPI documentation fetching with concurrent request handling - **cache_manager.py**: High-performance JSON file-based caching with version-specific keys - **context_fetcher.py**: Phase 4 comprehensive context fetching with dependency analysis - **context_formatter.py**: AI-optimized documentation formatting with token management - **network_client.py**: HTTP client abstraction with retry logic and connection pooling - **network_resilience.py**: Advanced network reliability with circuit breakers and backoff - **error_formatter.py**: Structured error handling with user-friendly messages ### Infrastructure Layer (`src/autodocs_mcp/`) - **main.py**: FastMCP server with 8 MCP tools, graceful shutdown, and async lifecycle management - **config.py**: Comprehensive configuration management with environment validation - **security.py**: Input validation and security controls - **observability.py**: Metrics collection, performance tracking, and monitoring - **health.py**: Health checks for orchestration and load balancing - **models.py**: Pydantic data models for type safety - **exceptions.py**: Custom exception hierarchy with error context ## MCP Tools Available The server exposes eight MCP tools organized into three categories: ### Core Documentation Tools 1. **scan_dependencies**: Parse pyproject.toml and extract dependencies with graceful error handling 2. **get_package_docs**: Legacy single-package documentation tool with version-specific caching 3. **get_package_docs_with_context**: 🚀 **Primary Phase 4 Tool** - Comprehensive documentation context including dependencies with smart scoping ### Cache Management Tools 4. **refresh_cache**: Clear the entire documentation cache 5. **get_cache_stats**: View cache statistics and cached packages ### System Health & Monitoring Tools 6. **health_check**: Comprehensive health status for monitoring and load balancers 7. **ready_check**: Kubernetes-style readiness check for deployment orchestration 8. **get_metrics**: Performance statistics and system metrics for monitoring ### Tool Usage Recommendations - **For AI contexts**: Use `get_package_docs_with_context` for rich dependency-aware documentation - **For simple lookups**: Use `get_package_docs` for single-package documentation - **For development**: Use `scan_dependencies` to understand project structure - **For operations**: Use health/ready checks and metrics for deployment monitoring ## Key Technical Decisions ### Caching & Performance - **Version-Based Caching**: Immutable cache keys `{package_name}-{version}` with no time expiration - **Concurrent Processing**: Parallel dependency fetching with configurable limits - **Connection Pooling**: HTTP connection reuse with automatic cleanup - **Circuit Breakers**: Network resilience with exponential backoff ### Architecture & Reliability - **Layered Architecture**: Clear separation between core services and infrastructure - **Graceful Degradation**: Continues processing with partial failures and malformed data - **Comprehensive Error Handling**: Structured error responses with recovery suggestions - **Production Readiness**: Health checks, metrics, graceful shutdown, and configuration validation ### Integration & Protocol - **MCP Transport**: stdio protocol compliance with stderr-only logging - **PyPI Integration**: `https://pypi.org/pypi/{package_name}/json` for version resolution and documentation - **Input Validation**: Security-focused validation for all user inputs - **Async Architecture**: Full async support with proper resource management ### Dependencies & Ecosystem - **Core Framework**: FastMCP for MCP server implementation - **HTTP Client**: httpx for async HTTP operations - **Data Validation**: Pydantic v2 for type safety and serialization - **Logging**: structlog for structured, production-ready logging - **Configuration**: Environment-aware configuration with validation - **Testing**: Comprehensive pytest ecosystem (mock, asyncio, httpx, cov, xdist) ### Phase 4 Context Features - **Smart Dependency Scoping**: Intelligent selection of relevant dependencies - **Token Budget Management**: Automatic context truncation for AI model limits - **Dependency Analysis**: Runtime vs dev dependency classification - **Context Prioritization**: Most relevant packages selected first ## Implementation Status: PHASE 4 COMPLETE ✅ **Priority 1 (Core Validation)**: ✅ Complete - Dependency parsing with graceful degradation and security validation - MCP protocol integration with 8 comprehensive tools - Extensive test coverage with pytest ecosystem **Priority 2 (Documentation Fetching)**: ✅ Complete - PyPI API integration with concurrent processing and version resolution - High-performance version-based caching system - AI-optimized documentation formatting with query filtering - Production-tested with major packages (requests, pydantic, fastapi, django, etc.) **Priority 3 (Graceful Degradation)**: ✅ Complete - Network resilience with circuit breakers and exponential backoff - Comprehensive error handling with structured user-friendly messages - Partial failure recovery with detailed performance metrics - Production-ready health checks and monitoring **Priority 4 (Dependency Context)**: ✅ Complete - Smart dependency context fetching with configurable scoping - Token budget management for AI model limits - Runtime vs development dependency classification - Concurrent processing with performance optimization - Context prioritization and truncation strategies ## Installation & Usage ### Install the MCP Server ```bash # Install with uv (recommended) uv sync # Or install with pip pip install -e . ``` ### Development & Testing ```bash # Test dependency scanning uv run python scripts/dev.py test-scan # Test documentation fetching uv run python scripts/dev.py test-docs requests ">=2.28.0" # Check cache stats uv run python scripts/dev.py cache-stats # Clear cache uv run python scripts/dev.py clear-cache ``` ### Run as MCP Server ```bash # Start the MCP server (stdio protocol) uv run python -m autodoc_mcp.main ``` ## MCP Integration ### For Claude Code Sessions To install this MCP server in Claude Code sessions: ```bash # 1. Install the server in current session uv sync # 2. Start the MCP server (in background) uv run python -m autodoc_mcp.main & # 3. The server will be available with these tools: # - scan_dependencies: Parse project dependencies # - get_package_docs: Fetch package documentation # - refresh_cache: Clear documentation cache # - get_cache_stats: View cache statistics ``` ### For Other AI Clients (Cursor, etc.) MCP server configuration: ```json { "mcpServers": { "autodoc-mcp": { "command": "uv", "args": ["run", "python", "-m", "autodocs_mcp.main"], "cwd": "/Users/bradleyfay/autodocs", "env": { "CACHE_DIR": "~/.cache/autodoc-mcp" } } } } ``` ## Cache Configuration - **Default Location**: `~/.cache/autodoc-mcp/` - **Format**: JSON files named `{package}-{version}.json` - **Expiration**: No time-based expiration (versions are immutable) - **Performance**: Instant cache hits for previously fetched versions ## Development Workflow & Standards ### Git Branching Strategy This project uses **GitFlow with Release Branches** for flexible development and bundled releases: #### Branch Hierarchy: - **`main`** - Production-ready code, source of truth, tagged releases only - **`develop`** - Integration branch for ongoing development work - **`feature/*`** - Individual features/changes, branched from `develop` - **`release/v0.x.x`** - Release preparation, branched from `develop` #### Daily Development Workflow: 1. **Work on develop**: Most development happens directly on `develop` branch 2. **Feature branches (optional)**: For larger changes, create `feature/description` from `develop` 3. **Merge features**: `feature/*` → `develop` (via PR or direct merge) 4. **Rapid iteration**: Small changes can be committed directly to `develop` #### Release Workflow: 1. **Prepare release**: `git checkout -b release/v0.x.x` from `develop` 2. **Version bump**: Update version in `pyproject.toml` on release branch 3. **Final tweaks**: Bug fixes, documentation updates in release branch 4. **Deploy**: Push `release/v0.x.x` → triggers CI/CD → PyPI deployment 5. **Complete release**: - Merge `release/v0.x.x` → `main` - Tag release on `main`: `git tag v0.x.x` - Merge `release/v0.x.x` → `develop` (to sync any release fixes) #### Benefits: - **Rapid development** on `develop` without deployment pressure - **Bundle changes** by choosing what goes into each release - **`main` stays clean** with only completed, tested releases - **No accidental deployments** from development work ### Commit Message Standards **REQUIRED**: All commits MUST follow [Conventional Commits](https://www.conventionalcommits.org/) specification: ``` <type>[optional scope]: <description> [optional body] [optional footer(s)] ``` #### Commit Types: - **feat**: New feature (triggers MINOR version bump) - **fix**: Bug fix (triggers PATCH version bump) - **docs**: Documentation changes - **style**: Code style changes (formatting, etc.) - **refactor**: Code refactoring without feature/bug changes - **test**: Adding or updating tests - **chore**: Build process, dependency updates, etc. - **ci**: CI/CD pipeline changes #### Breaking Changes: - Add `!` after type: `feat!: redesign API interface` (triggers MAJOR version bump) - Or include `BREAKING CHANGE:` in footer #### Examples: ```bash feat: add semantic search for package documentation fix: resolve cache corruption on network timeouts docs: update MCP integration examples chore: bump dependencies to latest versions ci: add automated security scanning ``` ### Semantic Versioning (SemVer) This project strictly follows [Semantic Versioning 2.0.0](https://semver.org/): - **MAJOR** (`1.0.0`): Breaking changes, incompatible API changes - **MINOR** (`0.1.0`): New features, backward-compatible functionality - **PATCH** (`0.0.1`): Bug fixes, backward-compatible fixes #### Version Bumping Rules: - `feat:` commits → MINOR version bump - `fix:` commits → PATCH version bump - `feat!:` or `BREAKING CHANGE:` → MAJOR version bump - Other commit types → No automatic version bump #### Release Process: 1. Update `CHANGELOG.md` with all changes since last version 2. Bump version in `pyproject.toml` 3. Commit: `chore: bump version to v1.2.3` 4. Create and push tag: `git tag v1.2.3 && git push --tags` 5. GitHub Actions automatically deploys to PyPI ### Changelog Management **REQUIRED**: Update `CHANGELOG.md` with every version bump using [Keep a Changelog](https://keepachangelog.com/) format: ```markdown ## [1.2.3] - 2025-08-07 ### Added - New features and capabilities ### Changed - Changes in existing functionality ### Deprecated - Soon-to-be removed features ### Removed - Now removed features ### Fixed - Bug fixes ### Security - Vulnerability fixes ``` ## Claude Code Instructions ### Agent Selection Protocol **MANDATORY**: Before using any agent, Claude MUST: 1. **Analyze task type** against available agent descriptions 2. **Match task to specialized expertise** rather than defaulting to general-purpose 3. **Consider multi-agent coordination** when tasks span multiple domains 4. **Use agent-design-architect for**: - Multi-agent system design - Agent workflow optimization - Agent collaboration patterns - Agent architecture decisions #### Task-to-Agent Mapping Decision Tree **Multi-agent systems, agent workflows, agent architecture** → `agent-design-architect` - Designing new agents or agent systems - Optimizing agent collaboration patterns - Multi-agent workflow architecture - Agent system performance analysis **Documentation, APIs, integration guides** → `docs-integration` - Writing API documentation - Creating integration guides - Technical documentation structure - User onboarding materials **MCP protocol, tool schemas, server development** → `mcp-protocol` - MCP server implementation - MCP tool schema design - Protocol compliance issues - MCP client integrations **Testing, pytest, coverage, test automation** → `testing-specialist` - Writing comprehensive tests - Test infrastructure improvements - Coverage analysis and improvement - Test automation frameworks **Production deployment, monitoring, configuration** → `production-ops` - Deployment pipeline configuration - Production monitoring setup - Performance optimization - Infrastructure configuration **Core business logic, dependency resolution** → `core-services` - Business logic implementation - Service architecture design - Dependency management - Core feature development **Technical writing, documentation frameworks** → `technical-writer` - User-facing documentation - Tutorial and guide creation - Documentation strategy - Content structure optimization **Product strategy, feature prioritization** → `product-manager` - Feature requirement analysis - Product roadmap planning - User feedback integration - Strategic alignment **General coding, search, multi-step tasks** → `general-purpose` - Code implementation tasks - File searching and analysis - Multi-step development work - General problem solving #### Agent Selection Validation Before proceeding with any task, ask yourself: - Does this task involve designing or optimizing agent systems? → `agent-design-architect` - Does this task require domain-specific expertise? → Use appropriate specialist - Is this a complex, multi-faceted task? → Consider multiple agents or `general-purpose` - Would a specialist provide significantly better results? → Use the specialist ### Technical Debt Management **MANDATORY**: All technical debt MUST be documented using disciplined tracking strategies: #### When to Log Technical Debt - **During Development**: When making compromises due to time constraints - **During Refactoring**: When identifying improvement opportunities - **During Code Review**: When spotting inconsistencies or anti-patterns - **During Bug Fixes**: When finding root causes that indicate deeper issues #### Technical Debt Documentation Requirements 1. **Location**: Document in `planning/technical_debt.md` 2. **Classification**: High/Medium/Low priority based on impact 3. **Required Fields**: Description, Impact, Root Cause, Solution, Effort Estimate, Dependencies 4. **Dating**: Always include creation date 5. **Ownership**: Assign to specific areas (testing, infrastructure, code quality, etc.) #### Standard Technical Debt Categories - **Code Quality**: Inconsistent patterns, deprecated usage, code smells - **Testing Infrastructure**: Missing tests, inconsistent mocking, test reliability - **Documentation**: Missing or outdated docs, unclear APIs - **Infrastructure**: Build issues, deployment problems, environment inconsistencies - **Performance**: Known bottlenecks, scalability concerns - **Security**: Vulnerabilities, insecure patterns, missing validations #### Technical Debt Review Process - **Monthly Reviews**: Assess priority and status - **Before Major Releases**: Address high-priority items - **During Refactoring**: Target related debt items - **Completion Logging**: Move completed items to completion log ### Third-Party Dependencies Philosophy **PRINCIPLE**: Lean heavily on widely-accepted third-party packages to build robust software. Do NOT reinvent the wheel. #### Dependency Guidelines 1. **Favor Established Ecosystems**: Prefer packages from well-known ecosystems (pytest plugins, FastAPI ecosystem, etc.) 2. **Quality Indicators**: Look for packages with: - Active maintenance (recent commits/releases) - Good documentation - Strong community adoption - Comprehensive test coverage - Clear versioning/changelog 3. **Testing Dependencies**: Be especially liberal with testing dependencies - they don't affect production 4. **Development Dependencies**: Add tooling that improves developer experience without hesitation #### Pytest Plugin Philosophy The pytest ecosystem is mature and battle-tested. **Always prefer pytest plugins** over custom solutions: - **pytest-mock**: For all mocking needs (never use unittest.mock directly) - **pytest-asyncio**: For async test handling - **pytest-cov**: For coverage reporting - **pytest-xdist**: For parallel test execution - **pytest-timeout**: For test timeout management - **pytest-httpx**: For HTTP client mocking - **pytest-randomly**: For test order randomization - **pytest-sugar**: For better test output - **pytest-benchmark**: For performance testing #### When to Add Dependencies - **Testing Improvements**: Always acceptable - **Developer Experience**: Tools that speed up development - **Standard Patterns**: Libraries that implement well-established patterns - **Security**: Libraries that improve security posture - **Performance**: Well-maintained libraries that solve performance issues #### When to be Cautious - **Core Runtime Dependencies**: Evaluate carefully but don't avoid unnecessarily - **Bleeding Edge**: Avoid alpha/beta packages for core functionality - **Unmaintained**: Check maintenance status before adding - **License Conflicts**: Ensure license compatibility ### Mandatory Workflow Requirements When working in this repository, you MUST: 1. **Work primarily on `develop` branch** - Most development happens on `develop`, not `main` 2. **Always use conventional commits** - Every commit message must follow the conventional commits specification 3. **Use release branches for deployment** - Only `release/v0.x.x` branches trigger PyPI deployment 4. **Follow SemVer** - Version bumps must match the type of changes made 5. **Update CHANGELOG.md with releases** - Add entries when creating release branches 6. **Document technical debt** - Log any compromises or future improvements in `planning/technical_debt.md` ### Branch Management for Development **MANDATORY**: Before making any code changes, you MUST: 1. **Check current branch**: Always run `git branch` or `git status` first 2. **Default to develop branch**: Most work should happen on `develop` branch 3. **Create feature branch for large changes**: For significant features, create `feature/description` from `develop` 4. **Never commit directly to main** - `main` is only for completed releases ### Branch Management for Development **MANDATORY**: Before making any code changes, you MUST: 1. **Check current branch**: Always run `git branch` or `git status` first 2. **Create feature branch if on main**: If currently on `main` branch, immediately create and switch to a feature branch before any code modifications: ```bash git checkout -b feature/description-of-changes ``` 3. **Use descriptive branch names**: `feature/add-search`, `fix/cache-corruption`, `docs/update-readme` 4. **Never commit directly to main** - All development work must happen on feature branches This ensures we follow the mandatory PR workflow from the start of development. ### Commit Message Examples ```bash # ✅ GOOD - Conventional commits feat: add query filtering for documentation search fix: resolve cache corruption on concurrent access docs: update installation instructions for Claude Code chore: bump dependencies to latest versions # ❌ BAD - Non-conventional commits Add new feature Fixed bug Updated docs ``` ### Version Bump Protocol When bumping versions: 1. **Update CHANGELOG.md first** with all changes since last version 2. **Bump version in pyproject.toml** following SemVer rules 3. **Commit with conventional message**: `chore: bump version to v1.2.3` 4. **Create and push tag**: `git tag v1.2.3 && git push --tags` ### Pre-commit Hook Integration All commits automatically run: - Ruff linting and formatting - MyPy type checking - File hygiene checks - YAML validation **CRITICAL RULE**: If pre-commit hooks fail, you MUST fix the issues and commit again. **STRICTLY FORBIDDEN**: Never use `git commit --no-verify` or `git commit -n` to bypass pre-commit hooks. This defeats the purpose of code quality enforcement and can introduce bugs, formatting inconsistencies, and security vulnerabilities. **Required Process**: 1. Run `pre-commit run --all-files` to identify all issues 2. Fix each identified issue properly in the code 3. Commit normally with all hooks passing 4. If you accidentally bypassed hooks, amend the commit with fixes immediately ### Testing Standards and Consistency **MANDATORY**: All testing must follow consistent patterns and use pytest ecosystem tools: #### Mock Usage Standards - **ALWAYS use pytest-mock**: Import `mocker` fixture, never `from unittest.mock import` - **Use mock_services fixture**: For MCP tool tests, use the provided `mock_services` fixture from conftest.py - **Pattern**: `def test_something(self, mocker):` or `def test_something(self, mock_services, mocker):` - **No unittest.mock**: The following are **FORBIDDEN** in new code: ```python # FORBIDDEN from unittest.mock import patch, AsyncMock, MagicMock with patch("module.service") as mock: pass # CORRECT def test_something(self, mocker): mock_service = mocker.patch("module.service") mock_async = mocker.AsyncMock() mock_regular = mocker.MagicMock() ``` #### Test Fixture Usage - **Use mock_services**: For tests involving MCP tools (scan_dependencies, get_package_docs, etc.) - **Consistent Setup**: Don't duplicate service mocking when fixtures exist - **Proper Isolation**: Each test should be independent #### Testing Patterns - **Async Tests**: Always use `@pytest.mark.asyncio` for async test functions - **Error Testing**: Test both success and failure paths - **Mock Verification**: Verify mock calls when testing interactions - **Clear Assertions**: Use descriptive assertions with clear failure messages #### When Adding New Tests 1. Check if existing fixtures can be reused 2. Use pytest-mock patterns exclusively 3. Follow the established naming conventions 4. Include both positive and negative test cases 5. Document any new fixtures in conftest.py ## Future Development & Enhancement Opportunities ### Immediate Robustness Improvements 1. **Enhanced Input Validation**: Expand security validation patterns for edge cases in package names and version constraints 2. **Cache Corruption Recovery**: Add automatic cache validation and repair mechanisms for malformed cache entries 3. **Memory Management**: Implement memory pressure monitoring and automatic cleanup for long-running instances 4. **Dependency Graph Analysis**: Add circular dependency detection and resolution strategies ### Performance & Scalability Enhancements 1. **Streaming Context Delivery**: Implement incremental context streaming for very large dependency trees 2. **Predictive Caching**: Pre-fetch commonly requested packages based on usage patterns 3. **Delta Documentation**: Cache and deliver only documentation changes between package versions 4. **Distributed Caching**: Support for Redis or similar backends for multi-instance deployments ### AI Integration Improvements 1. **Semantic Documentation Filtering**: Use embedding models to identify most relevant documentation sections 2. **Context Ranking**: ML-based relevance scoring for dependency prioritization 3. **Documentation Quality Scoring**: Automated assessment of documentation completeness and usefulness 4. **Custom Context Templates**: User-configurable templates for different AI use cases ### Enterprise & Production Features 1. **Authentication & Authorization**: Support for API keys, JWT tokens, and role-based access 2. **Rate Limiting**: Per-client request throttling and quota management 3. **Multi-tenant Support**: Isolated caching and configuration per client/organization 4. **Audit Logging**: Comprehensive request/response logging for compliance 5. **Configuration Hot-reload**: Dynamic configuration updates without service restart ### Monitoring & Observability 1. **OpenTelemetry Integration**: Full tracing and spans for distributed system visibility 2. **Custom Metrics**: Business-specific metrics (cache hit rates, package popularity, etc.) 3. **Alerting Integration**: Proactive monitoring with configurable alert thresholds 4. **Performance Profiling**: Built-in profiling for identifying bottlenecks ### Documentation & Developer Experience 1. **Interactive API Documentation**: OpenAPI/Swagger integration for MCP tools 2. **SDK Generation**: Client libraries for popular programming languages 3. **CLI Tool Enhancement**: Rich CLI with interactive modes and better output formatting 4. **Integration Examples**: More comprehensive examples for different AI platforms

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bradleyfay/autodoc-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server