The UseScraper MCP Server enables web scraping with customizable extraction options:
Scrape webpage content: Extract content from any webpage by providing its URL
Choose output format: Save scraped content as text, HTML, or markdown (default is markdown)
Bypass restrictions: Use advanced proxy to circumvent bot detection
Custom extraction: Specify specific data to extract using the extract_object parameter
AI-optimized: Default markdown format ensures optimal processing for AI systems
Allows scraping web content and converting it to Markdown format.
The MCP server is built using TypeScript.
UseScraper MCP Server
This is a TypeScript-based MCP server that provides web scraping capabilities using the UseScraper API. It exposes a single tool 'scrape' that can extract content from web pages in various formats.
Features
Tools
scrape
- Extract content from a webpage- Parameters:
url
(required): The URL of the webpage to scrapeformat
(optional): The format to save the content (text, html, markdown). Default: markdownadvanced_proxy
(optional): Use advanced proxy to circumvent bot detection. Default: falseextract_object
(optional): Object specifying data to extract
- Parameters:
Installation
Installing via Smithery
To install UseScraper for Claude Desktop automatically via Smithery:
Manual Installation
- Clone the repository:
- Install dependencies:
- Build the server:
Configuration
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
Replace /path/to/usescraper-server
with the actual path to the server and your-api-key-here
with your UseScraper API key.
Usage
Once configured, you can use the 'scrape' tool through the MCP interface. Example usage:
Development
For development with auto-rebuild:
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
The Inspector will provide a URL to access debugging tools in your browser.
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Tools
A TypeScript-based MCP server utilizing the UseScraper API to provide web scraping capabilities, allowing users to extract content from webpages in various formats.
Related Resources
Related MCP Servers
- AsecurityAlicenseAqualityMCP Server enabling integration with Scrapezy to retrieve structured data from websites.Last updated -12977MIT License
- -securityAlicense-qualityA Python-based MCP server that crawls websites to extract and save content as markdown files, with features for mapping website structure and links.Last updated -3MIT License
- AsecurityAlicenseAqualityA TypeScript-based web scraping server built on the Model Context Protocol that offers multiple export formats, content extraction rules, and support for both static and dynamic (SPA) websites.Last updated -7258MIT License
- -securityAlicense-qualityTypeScript implementation of an MCP server that provides tools for web crawling, content extraction, and browser automation, enabling AI systems to access and process web content through 15 specialized tools.Last updated -2556MIT License