Integrations
MCP Server
MCP (Machine Control Protocol) Server that provides browser automation capabilities through a simple API.
Description
This server creates an API endpoint that allows you to run browser automation tasks using natural language commands. It utilizes:
- FastMCP: For creating a simple API server
- browser-use: For browser automation capabilities
- OpenAI's GPT models: To interpret natural language commands
Prerequisites
- Python 3.11+
- Poetry for dependency management
- OpenAI API key
Setup
1. Install dependencies
2. Configure environment variables
Create a .env
file in the root directory with:
Run
The server will start with SSE (Server-Sent Events) transport on the default port.
Features
Browser Automation
Use the /run_browser_task
endpoint to execute browser automation tasks using natural language. For example:
This server cannot be installed
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Provides browser automation capabilities through an API endpoint that interprets natural language commands to perform web tasks using OpenAI's GPT models.
Related Resources
Related MCP Servers
- AsecurityFlicenseAqualityEnables AI agents to interact with web browsers using natural language, featuring automated browsing, form filling, vision-based element detection, and structured JSON responses for systematic browser control.Last updated -138Python
- AsecurityFlicenseAqualityEnables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.Last updated -418Python
- -securityFlicense-qualityA web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.Last updated -1Python
- -securityAlicense-qualityAI-driven browser automation server that implements the Model Context Protocol to enable natural language control of web browsers for tasks like navigation, form filling, and visual interaction.Last updated -1PythonMIT License