Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Spider MCPsearch for latest AI developments in news from past week"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Spider MCP - Web Search Crawler Service
A web search MCP service based on pure crawler technology, built with Node.js.
Features
β No Official API Required: Completely based on crawler technology, no dependency on third-party official APIs
π Intelligent Search: Supports Bing web and news search
π° News Search: Built-in news search with time filtering
π·οΈ Pure Crawler: No official API dependency, uses Puppeteer for web scraping
π High Performance: Supports batch web scraping
π Health Monitoring: Complete health check and metrics monitoring
π Structured Logging: Uses Winston for structured logs
π Anti-Detection: Supports User-Agent rotation and other anti-bot measures
π Smart URL Cleaning: Automatically cleans promotional parameters while preserving essential information
Related MCP server: Puppeteer MCP Server
Tech Stack
Node.js (>= 18.0.0)
Express.js - Web framework
Puppeteer - Browser automation
Cheerio - HTML parsing
Axios - HTTP client
Winston - Logging
@modelcontextprotocol/sdk - MCP protocol support
Quick Start
1. Install dependencies
npm installor use pnpm
pnpm install2. Download Puppeteer browser
npx puppeteer browsers install chrome3. Environment configuration
Copy and configure the environment variables file:
cp .env.example .envEdit the .env file according to your needs.
4. Start the service
Development mode:
npm run devProduction mode:
npm startThe service will start at http://localhost:3000.
MCP Tools
web_search
Unified search tool supporting both web and news search:
Web Search:
searchType: "web"News Search:
searchType: "news"with time filteringNote:
searchTypeis a required parameter and must be explicitly specified
Usage Examples:
# Web search
Use web_search tool to search "Node.js tutorial" with searchType set to web, return 10 results
# News search
Use web_search tool to search "tech news" with searchType set to news, return 5 results from past 24 hoursOther Tools
get_webpage_content: Get webpage content and convert to specified formatget_webpage_source: Get raw HTML source code of webpagebatch_webpage_scrape: Batch scrape multiple webpages
MCP Configuration
Chatbox Configuration
Create mcp-config.json file in Chatbox:
{
"mcpServers": {
"spider-mcp": {
"command": "node",
"args": ["src/mcp/server.js"],
"env": {
"NODE_ENV": "production"
},
"description": "Spider MCP - Web search and webpage scraping tools",
"capabilities": {
"tools": {}
}
}
}
}Other MCP Clients
{
"mcpServers": {
"spider-mcp": {
"command": "node",
"args": ["path/to/spider-mcp/src/mcp/server.js"]
}
}
}Important Notes
Anti-bot Measures: This service uses various techniques to avoid detection, but still needs to comply with robots.txt and terms of use
Rate Limiting: It's recommended to control request frequency reasonably to avoid putting pressure on target websites
Legal Compliance: Please ensure compliance with local laws and website terms of use when using this service
Resource Consumption: Puppeteer will start Chrome browser, please pay attention to memory and CPU usage
URL Cleaning: Automatically cleans promotional parameters but may affect some special link functionality
Development
Project Structure
spider-mcp/
βββ src/
β βββ index.js # Main entry file
β βββ mcp/
β β βββ server.js # MCP server
β βββ routes/ # Route definitions
β β βββ search.js # Search routes
β β βββ health.js # Health check routes
β βββ services/ # Business logic
β β βββ searchService.js # Search service
β βββ utils/ # Utility functions
β βββ logger.js # Logging utility
βββ logs/ # Log files directory
βββ tests/ # Test files
βββ package.json # Project configuration
βββ .env.example # Environment variables example
βββ mcp-config.json # MCP configuration example
βββ README.md # Project documentationLicense
MIT License
Contributing
Issues and Pull Requests are welcome!