Enables direct execution of the MCP server from GitHub repositories, offering multiple configuration options for running the server through Git integration.
Provides report generation capabilities in Markdown format, allowing formatted documentation output from system monitoring and data analysis.
Integrates pandas for CSV file analysis, enabling statistical summaries, correlations, distribution analysis, and trend detection with automated visualization.
Leverages Python for executing server operations, with support for loading Python dependencies and running Python scripts.
Implements a built-in SQLite database for data persistence, offering custom SQL query execution, data export capabilities, and automatic schema management.
Advanced MCP Server
A comprehensive Model Context Protocol (MCP) server that provides advanced capabilities for web scraping, data analysis, system monitoring, file operations, API integrations, and report generation. Built with Python and designed for seamless integration with MCP-compatible clients like Claude Desktop.
🚀 Quick Start
The fastest way to get started is using the GitHub-hosted version:
Add this configuration to your MCP client (e.g., Claude Desktop) and start using the server immediately!
✨ Key Features
🌐 Web Scraping & Content Analysis
- Advanced HTML parsing with BeautifulSoup4
- Smart content extraction - links, images, and clean text
- Automatic data persistence in SQLite database
- Content analytics and comprehensive statistics
- Duplicate detection and content deduplication
📊 Data Analysis & Visualization
- CSV file analysis with pandas integration
- Statistical insights - summaries, correlations, distributions
- Trend detection and pattern analysis
- Professional visualizations with matplotlib and seaborn
- Export capabilities in multiple formats
🖥️ System Monitoring & Performance
- Real-time monitoring of CPU, memory, and disk usage
- Configurable intervals and monitoring duration
- Historical data tracking with trend analysis
- Performance alerts and threshold monitoring
- Resource optimization recommendations
📁 Advanced File Operations
- Intelligent file search with pattern matching
- Content analysis and metadata extraction
- Automated backup creation and management
- Cleanup operations for temporary and cache files
- File comparison and diff analysis
- Batch operations support
🔌 API Integration & Caching
- Full HTTP method support (GET, POST, PUT, DELETE, etc.)
- Intelligent caching system with configurable TTL
- Automatic retry logic with exponential backoff
- Response validation and error handling
- Rate limiting and request throttling
🗄️ Database Operations & Management
- Built-in SQLite database for data persistence
- Custom SQL query execution with safety checks
- Data export utilities (CSV, JSON, XML)
- Schema management and migrations
- Query optimization and performance monitoring
📋 Comprehensive Report Generation
- System health reports with actionable recommendations
- Web scraping analytics and content summaries
- Data analysis reports with statistical insights
- Multiple output formats (Markdown, JSON, HTML, PDF)
- Integrated charts and interactive visualizations
- Automated scheduling and delivery options
📦 Installation
Prerequisites
- Python 3.8+ (recommended: Python 3.11+)
- pip package manager
- Git (for GitHub installation)
Method 1: Direct GitHub Installation (Recommended)
No local setup required! The server runs directly from GitHub:
Method 2: Local Installation
For development or customization:
- Clone the repository:
- Install dependencies:
- Make executable (Unix/Linux/macOS):
- Add to MCP configuration:
Method 3: Package Installation
Install as a Python package:
Then use in your MCP configuration:
- Add to your MCP configuration:
🛠️ Available Tools
🌐 scrape_website
Advanced web scraping with content analysis and database storage.
Parameters:
url
(required): Target URL to scrapeextract_links
(optional, default: false): Extract all hyperlinksextract_images
(optional, default: false): Extract image URLs and metadatasave_to_db
(optional, default: true): Persist content to database
Real-world Examples:
📊 analyze_data
Comprehensive statistical analysis with professional visualizations.
Parameters:
file_path
(required): Path to CSV fileanalysis_type
(required): Analysis type - "summary", "correlation", "distribution", "trends", "outliers"create_visualization
(optional, default: true): Generate charts and graphscolumns
(optional): Specific columns to analyzesave_results
(optional, default: true): Save analysis to database
Real-world Examples:
🖥️ system_monitor
Real-time system performance monitoring with alerting.
Parameters:
duration_minutes
(optional, default: 1): Monitoring durationinterval_seconds
(optional, default: 10): Data collection intervalsave_metrics
(optional, default: true): Store metrics in databasealert_thresholds
(optional): CPU/Memory/Disk usage alert levelsgenerate_report
(optional, default: false): Create performance report
Real-world Examples:
📁 file_operations
Advanced file system operations with intelligent search and analysis.
Parameters:
operation
(required): Operation type - "search", "analyze", "backup", "cleanup", "compare", "organize"path
(required): Target file or directory pathpattern
(optional): Search pattern (supports regex and glob)options
(optional): Operation-specific configurationrecursive
(optional, default: true): Include subdirectoriesfile_types
(optional): Filter by file extensions
Real-world Examples:
🔌 api_integration
Robust API integration with intelligent caching and error handling.
Parameters:
endpoint
(required): API endpoint URLmethod
(optional, default: "GET"): HTTP methodheaders
(optional): Request headers dictionarydata
(optional): Request body (JSON, form data, or raw)params
(optional): URL query parametersuse_cache
(optional, default: true): Enable response cachingcache_duration_hours
(optional, default: 1): Cache TTLtimeout_seconds
(optional, default: 30): Request timeoutretry_attempts
(optional, default: 3): Number of retry attempts
Real-world Examples:
🗄️ database_query
Execute SQL queries with safety checks and result formatting.
Parameters:
query
(required): SQL query to executetable
(optional): Target table for simple operationslimit
(optional, default: 100): Maximum number of resultsformat
(optional, default: "json"): Output format - "json", "csv", "html"parameters
(optional): Query parameters for prepared statementsexport_file
(optional): Save results to file
Real-world Examples:
📋 generate_report
Create comprehensive reports with visualizations and insights.
Parameters:
report_type
(required): Report type - "system_health", "web_analysis", "data_summary", "api_usage", "custom"format
(optional, default: "markdown"): Output format - "markdown", "json", "html", "pdf"include_charts
(optional, default: true): Generate visualizationstime_range
(optional): Data time range filtersave_to_file
(optional): Save report to specified fileemail_recipients
(optional): Email addresses for report deliverytemplate
(optional): Custom report template
Real-world Examples:
🗄️ Database Schema
The server automatically creates and manages an SQLite database with optimized tables:
📊 web_scrapes
Stores web scraping results with full-text search capabilities.
id
: Primary key (INTEGER)url
: Scraped URL (TEXT, INDEXED)title
: Page title (TEXT)content
: Extracted text content (TEXT)links_count
: Number of extracted links (INTEGER)images_count
: Number of extracted images (INTEGER)word_count
: Content word count (INTEGER)scraped_at
: Timestamp (TIMESTAMP, INDEXED)
📈 system_metrics
Real-time and historical system performance data.
id
: Primary key (INTEGER)cpu_percent
: CPU usage percentage (REAL)memory_percent
: Memory usage percentage (REAL)disk_usage
: Disk usage percentage (REAL)network_io
: Network I/O statistics (TEXT)process_count
: Active process count (INTEGER)recorded_at
: Timestamp (TIMESTAMP, INDEXED)
🔄 api_cache
Intelligent API response caching with TTL management.
id
: Primary key (INTEGER)endpoint
: API endpoint URL (TEXT, INDEXED)method
: HTTP method (TEXT)response_data
: Cached response (TEXT)response_status
: HTTP status code (INTEGER)response_size
: Response size in bytes (INTEGER)cached_at
: Cache timestamp (TIMESTAMP, INDEXED)expires_at
: Cache expiration (TIMESTAMP, INDEXED)
📁 file_operations
File operation history and metadata.
id
: Primary key (INTEGER)operation_type
: Operation performed (TEXT)file_path
: Target file/directory path (TEXT)file_size
: File size in bytes (INTEGER)operation_result
: Operation outcome (TEXT)execution_time
: Time taken in seconds (REAL)created_at
: Timestamp (TIMESTAMP, INDEXED)
🚨 Troubleshooting
Common Installation Issues
Python Version Compatibility
Missing Dependencies
Permission Errors (Unix/Linux/macOS)
MCP Configuration Issues
Server Not Starting
- Check MCP client logs for error messages
- Verify file paths in configuration are absolute
- Test server manually:
Import Errors in GitHub Mode
Tool Approval Issues
Make sure tools are in the autoApprove
list:
Runtime Issues
Memory Usage
- Large datasets: Process in chunks using
limit
parameters - Long monitoring: Use shorter intervals or duration
- Database size: Regular cleanup of old data
Network Timeouts
- Increase timeout values in API calls
- Check firewall settings for outbound connections
- Use smaller batch sizes for bulk operations
File Permission Errors
- Windows: Run as administrator if needed
- Unix/Linux: Check file/directory permissions
- Use absolute paths to avoid relative path issues
Performance Optimization
Database Performance
Memory Management
- Use streaming for large file operations
- Clear visualization plots after generation
- Implement result pagination for large queries
🔒 Security Considerations
Data Protection
- Database encryption: Sensitive data is stored in local SQLite
- API keys: Store in environment variables, not in code
- File access: Server respects system file permissions
- Input validation: All inputs are sanitized and validated
Network Security
- HTTPS only: API calls use secure connections
- Rate limiting: Built-in request throttling
- Error handling: Sensitive information not exposed in errors
- Caching: Sensitive API responses can be excluded from cache
Best Practices
- Regular updates: Keep dependencies updated
- Access control: Limit MCP server access appropriately
- Monitoring: Use system monitoring for security events
- Backups: Regular database and configuration backups
🤝 Contributing
We welcome contributions! Here's how you can help improve the Advanced MCP Server:
🚀 Quick Start for Contributors
- Fork the repository on GitHub
- Clone your fork locally:
- Create a feature branch:
- Set up development environment:
🛠️ Development Guidelines
Code Style
- Follow PEP 8 Python style guidelines
- Use type hints for function parameters and returns
- Add docstrings for all functions and classes
- Keep functions focused and single-purpose
Testing
Adding New Tools
- Define the tool in
handle_list_tools()
:
- Implement the handler in
handle_call_tool()
:
- Create the method:
- Update documentation in README.md
Database Changes
- Create migration scripts for schema changes
- Maintain backward compatibility when possible
- Document schema changes in commit messages
📝 Commit Guidelines
Use conventional commits:
🐛 Bug Reports
When reporting bugs, please include:
- Operating System and Python version
- MCP client being used (Claude Desktop, etc.)
- Error messages and stack traces
- Steps to reproduce the issue
- Expected vs actual behavior
💡 Feature Requests
For new features, please:
- Check existing issues to avoid duplicates
- Describe the use case and problem you're solving
- Provide examples of how the feature would be used
- Consider backward compatibility
🔄 Pull Request Process
- Update documentation as needed
- Add/update tests for new functionality
- Ensure all tests pass
- Update CHANGELOG.md if applicable
- Request review from maintainers
📋 Development Roadmap
Planned features and improvements:
- Multi-format data export (Excel, Parquet, XML)
- Advanced scheduling for automated reports
- Plugin system for custom extensions
- Web dashboard for server management
- Docker containerization for easy deployment
- Distributed caching with Redis support
- Machine learning integration for data analysis
- Real-time notifications and alerting
- Authentication and authorization system
- API rate limiting per client
- Comprehensive test suite with CI/CD
- Performance benchmarking tools
🏗️ Repository Structure
📁 Key Files Description
advanced_mcp_server.py
: Core server implementation with all toolsrequirements.txt
: All Python dependencies with version constraintssetup.py
: Package metadata and installation configurationgithub_mcp_runner.py
: Wrapper for running server from GitHub- Config files: Ready-to-use MCP configurations for different scenarios
🌟 Use Cases & Examples
📊 Business Intelligence Dashboard
🔍 Content Research & Analysis
🔧 DevOps & System Administration
📈 Data Science Workflow
📚 Related Projects & Resources
🔗 MCP Ecosystem
- Model Context Protocol - Official MCP documentation
- MCP Python SDK - Python SDK for MCP
- Claude Desktop - Popular MCP client
🛠️ Tools & Libraries Used
- aiohttp - Async HTTP client/server
- pandas - Data manipulation and analysis
- matplotlib - Data visualization
- BeautifulSoup - HTML parsing
- psutil - System monitoring
🎓 Learning Resources
- Python Async Programming - Asyncio documentation
- SQLite Tutorial - SQLite learning guide
- Data Analysis with Python - Pandas user guide
🆕 Changelog
Version 1.0.0 (Current)
- ✅ Initial release with core functionality
- ✅ Web scraping with content analysis
- ✅ Data analysis with visualizations
- ✅ System monitoring with metrics storage
- ✅ File operations with search capabilities
- ✅ API integration with caching
- ✅ Database operations with SQL support
- ✅ Report generation in multiple formats
Planned for Version 1.1.0
- 🔄 Enhanced error handling and recovery
- 🔄 Plugin architecture for extensibility
- 🔄 Web dashboard for server management
- 🔄 Docker support for containerized deployment
- 🔄 Advanced authentication and authorization
Planned for Version 1.2.0
- 🔄 Machine learning integration
- 🔄 Real-time notifications and alerts
- 🔄 Distributed caching with Redis
- 🔄 Performance optimization and benchmarking
- 🔄 Comprehensive test suite with CI/CD
⚖️ License
This project is licensed under the MIT License - see the LICENSE file for details.
MIT License Summary
- ✅ Commercial use allowed
- ✅ Modification allowed
- ✅ Distribution allowed
- ✅ Private use allowed
- ❌ No warranty provided
- ❌ No liability assumed
🙏 Acknowledgments
💡 Inspiration & Thanks
- Anthropic for creating Claude and the MCP standard
- Model Context Protocol community for the excellent framework
- Open source contributors of all the amazing libraries used in this project
- Early adopters and testers who provided valuable feedback
🌟 Special Recognition
- Python Software Foundation for the incredible Python ecosystem
- SQLite team for the reliable embedded database
- Matplotlib/Seaborn teams for powerful visualization tools
- pandas team for making data analysis accessible
📞 Support & Community
🆘 Getting Help
- 📖 Documentation: Start with this README and the
/docs
folder - 🐛 Issues: Report bugs and request features on GitHub Issues
- 💬 Discussions: Join the conversation on GitHub Discussions
- 📧 Email: For private inquiries, contact the maintainers
🌍 Community Guidelines
- Be respectful and inclusive
- Help others learn and succeed
- Share knowledge and experiences
- Contribute positively to the ecosystem
- Follow the code of conduct
🔮 Future Vision
Our goal is to make the Advanced MCP Server the go-to solution for:
- Data professionals needing comprehensive analysis tools
- Developers requiring robust system integration
- Businesses seeking automated reporting and monitoring
- Researchers conducting data-driven investigations
- DevOps teams managing complex infrastructures
⭐ If this project helps you, please give it a star! ⭐
🚀 Ready to supercharge your MCP experience? Get started now! 🚀
Made with ❤️ by the MCP Server community
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A comprehensive Model Context Protocol server providing capabilities for web scraping, data analysis, system monitoring, file operations, API integrations, and report generation.
Related MCP Servers
- -securityAlicense-qualityA Model Context Protocol server that allows LLMs to interact with web content through standardized tools, currently supporting web scraping functionality.Last updated -PythonMIT License
- AsecurityFlicenseAqualityA Model Context Protocol server that intelligently fetches and processes web content, transforming websites and documentation into clean, structured markdown with nested URL crawling capabilities.Last updated -25521TypeScript
- AsecurityFlicenseAqualityA Model Context Protocol server that provides tools for fetching and posting HTTP data, with built-in prompts for URL summarization and API analysis.Last updated -2TypeScript
- AsecurityAlicenseAqualityA comprehensive Model Context Protocol server providing access to 70+ IT tools for developers and system administrators, including encoding/decoding, text manipulation, hashing, and network utilities.Last updated -764006TypeScriptMIT License