Hyperbrowser
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Integrations
Enables scraping webpage content and converting it to Markdown format for better readability in AI contexts.
Hyperbrowser MCP Server
This project is a server implementation for the Hyperbrowser using the Model Context Protocol (MCP). The server provides various tools to scrape, extract structured data, and crawl webpages.
More information about the Hyperbrowser can be found here. The hyperbrowser API supports a superset of features present in the mcp server.
More information about the Model Context Protocol can be found here.
Table of Contents
Installation
To install the server, run:
Development
For development purposes, you can run the server directly from the source code.
- Clone the repository:Copy
- Install dependencies:Copy
- Run the server:Copy
Example config
This is an example config for the Hyperbrowser MCP server for the Claude Desktop client.
Other client (such as Cursor) do not support the env
field in the config or as a part of the command param. In that case, you can use a shell script to run the server with the correct environment variable. An example shell script is provided in the repository as run_server.sh
.
If for some reason you can't provide the API key in the config or in a shell script, you can set it within whatever prompt you are using. It will be upto the mcp client to pass it to the server.
Tools
Scrape Webpage
This tool allows you to scrape a webpage and retrieve content in various formats such as markdown, HTML, links, and screenshots.
Parameters:
url
: The URL of the webpage to scrape.apiKey
: (Optional) The API key to use for the scrape.sessionOptions
: (Optional) Options for the browser session.outputFormat
: The format of the output (from a list of markdown, html, links, screenshot).
Extract Structured Data
This tool extracts structured information from a list of webpages using a specified prompt and JSON schema.
Parameters:
urls
: The list of URLs of the webpages to extract structured information from.apiKey
: (Optional) The API key to use for the extraction.sessionOptions
: (Optional) Options for the browser session.prompt
: (Optional - if not provided, the tool will try to infer the prompt from the schema) The prompt to use for the extraction.schema
: (Optional - if not provided, the tool will try to infer the schema from the prompt) The JSON schema to use for the extraction.
Crawl Webpages
This tool crawls a list of webpages, optionally following links and limiting the number of pages.
Parameters:
url
: The URL of the webpage to crawl.apiKey
: (Optional) The API key to use for the crawl.sessionOptions
: (Optional) Options for the browser session.outputFormat
: The format of the output (from a list of markdown, html, links, screenshot).followLinks
: Whether to follow links on the crawled webpages.maxPages
: The maximum number of pages to crawl.
Session Options
The sessionOptions
parameter allows you to configure various aspects of the browser session. It is an optional parameter and can include the following fields:
useProxy
: (Optional) Whether to use a proxy.useStealth
: (Optional) Whether to use stealth mode.solveCaptchas
: (Optional) Whether to solve captchas.acceptCookies
: (Optional) Whether to automatically close the accept cookies popup.
These options help in customizing the behavior of the browser session to suit your specific needs.
Configuration
The server can be configured using environment variables or by modifying the source code directly. Ensure that the HB_API_KEY
environment variable is set if you are not providing an API key directly in the requests.
License
This project is licensed under the MIT License.
This server cannot be installed
Welcome to Hyperbrowser, the Internet for AI. Hyperbrowser is the next-generation platform empowering AI agents and enabling effortless, scalable browser automation. Built specifically for AI developers, it eliminates the headaches of local infrastructure and performance bottlenecks, allowing you to