Used for HTTP requests in the web scraping functionality to extract AI use case data from websites.
Provides HTML parsing capabilities for extracting structured data from scraped webpages containing AI use cases.
Enables headless browser capabilities for scraping JavaScript-heavy pages that contain AI use case information.
Provides database storage for collected AI use cases and scraping configurations.
Handles schema validation for all user inputs and data structures in the AI use cases server.
AI Use Cases MCP Server
A Model Context Protocol (MCP) server for collecting, analyzing, and managing AI use case data from various information sources.
Features
🔍 Web Scraping Tools
- URL Scraping: Extract AI use case data from specified URLs with customizable selectors
- Information Source Management: Add, edit, and manage scraping configurations for different websites
- Automatic Data Extraction: Extract titles, summaries, categories, and publication dates
📊 Data Analysis Tools
- Keyword Extraction: Automatically extract relevant AI and technology keywords from content
- Use Case Categorization: Automatically categorize AI use cases by technology and industry
- Search & Filter: Search through collected use cases with various filters
📈 Resources
- AI Use Cases Data: Access structured AI use case data by category and limit
- Statistics: Get overview statistics of collected data including categories and industries
🤖 Prompts
- Summarize Use Cases: Create concise summaries of AI use cases
- Suggest Sources: Get recommendations for new information sources
- Analyze Trends: Analyze trends in AI use cases over time
Installation
Usage
As an MCP Server
The server can be used with any MCP-compatible client (Claude Desktop, Cursor, etc.) by adding it to your MCP configuration:
Available Tools
1. Web Scraping Tool (scrape-url
)
Scrape AI use case data from a specified URL.
Parameters:
url
(string): The URL to scrapeselectors
(object, optional): CSS selectors for data extractionextractKeywords
(boolean, default: true): Whether to extract keywords
Example:
2. Add Information Source (add-source
)
Add a new information source for automated data collection.
Parameters:
name
(string): Source nameurl
(string): Source URLselectors
(object): CSS selectors for data extraction
3. Search Use Cases (search-use-cases
)
Search through collected AI use cases with filters.
Parameters:
query
(string): Search querycategory
(string, optional): Filter by categoryindustry
(string, optional): Filter by industrytechnology
(string, optional): Filter by technologylimit
(number, default: 20): Maximum results
4. Extract Keywords (extract-keywords
)
Extract relevant keywords from text content.
Parameters:
text
(string): Text to analyzemaxKeywords
(number, default: 10): Maximum keywords to extractcategory
(string, optional): Category for context-specific extraction
5. Categorize Use Case (categorize-use-case
)
Automatically categorize an AI use case based on its content.
Parameters:
title
(string): Use case titlesummary
(string): Use case summarycontent
(string, optional): Additional content
Available Resources
1. AI Use Cases (ai-use-cases://{category}/{limit}
)
Access AI use case data by category and limit.
Parameters:
category
(string): Category filter (optional)limit
(number): Maximum number of results
2. Statistics (statistics://overview
)
Get overview statistics of collected data.
Available Prompts
1. Summarize Use Case (summarize-use-case
)
Create a concise summary of an AI use case.
Parameters:
title
(string): Use case titlecontent
(string): Use case contentmaxLength
(number, default: 200): Maximum summary length
2. Suggest Sources (suggest-sources
)
Get recommendations for new information sources.
Parameters:
industry
(string, optional): Target industrytechnology
(string, optional): Target technologycategory
(string, optional): Target category
3. Analyze Trends (analyze-trends
)
Analyze trends in AI use cases over time.
Parameters:
timeframe
(string): Analysis timeframecategory
(string, optional): Category filterindustry
(string, optional): Industry filter
Data Structure
AI Use Case
Scraping Configuration
Supported Categories
The system automatically categorizes AI use cases into the following categories:
- Natural Language Processing: NLP, text analysis, chatbots, translation
- Computer Vision: Image recognition, video analysis, object detection
- Machine Learning: Algorithms, predictive analytics, classification
- Robotics & Automation: Industrial automation, autonomous systems
- Data Analytics: Business intelligence, reporting, visualization
- Healthcare & Medical: Medical diagnosis, drug discovery, telemedicine
- Finance & Banking: Fraud detection, risk assessment, trading
- E-commerce & Retail: Recommendation systems, personalization
- Transportation & Logistics: Route optimization, autonomous vehicles
- Education & Training: Personalized learning, adaptive systems
Development
Project Structure
Building
Adding New Features
- New Tools: Add to
src/mcp/server.ts
in thesetupTools()
method - New Resources: Add to
src/mcp/server.ts
in thesetupResources()
method - New Prompts: Add to
src/mcp/server.ts
in thesetupPrompts()
method - New Types: Add to
src/types/index.ts
Configuration
Database
The server uses SQLite for data storage. The database file (ai_use_cases.db
) is created automatically in the project root.
Scraping Settings
- Default timeout: 10 seconds
- Retry attempts: 3
- Delay between requests: 1 second
- User agent: Standard browser user agent
External Dependencies
- @modelcontextprotocol/sdk: MCP protocol implementation
- axios: HTTP client for web scraping
- cheerio: HTML parsing
- puppeteer: Headless browser for JavaScript-heavy pages
- sqlite3: Database storage
- natural: Natural language processing for keyword extraction
- zod: Schema validation
Security Considerations
- The server includes DNS rebinding protection for HTTP transport
- All user inputs are validated using Zod schemas
- Web scraping respects robots.txt and includes delays between requests
- No sensitive data is stored or transmitted
Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
License
MIT License - see LICENSE file for details.
Support
For issues and questions:
- Check the documentation
- Search existing issues
- Create a new issue with detailed information
Note: This server is designed for educational and research purposes. Please respect website terms of service and robots.txt when scraping data.
This server cannot be installed
A Model Context Protocol server that collects, analyzes, and manages AI use case data from various information sources with features for web scraping, data analysis, and trend identification.
Related MCP Servers
- AsecurityAlicenseAqualityThis server implements the Model Context Protocol to facilitate meaningful interaction and understanding development between humans and AI through structured tools and progressive interaction patterns.Last updated -1351TypeScriptMIT License
- AsecurityFlicenseAqualityA Model Context Protocol server that enables AI models to interact with SourceSync.ai's knowledge management platform for managing documents, ingesting content from various sources, and performing semantic searches.Last updated -25543
- AsecurityAlicenseAqualityA Model Context Protocol server enabling AI assistants to scrape web content with high accuracy and flexibility, supporting multiple scraping modes and content formatting options.Last updated -42562TypeScriptMIT License
- -securityAlicense-qualityA server that implements the Model Context Protocol, providing a standardized way to connect AI models to different data sources and tools.Last updated -28TypeScriptMIT License