MCP Server

by jonnyhoff

Integrations

  • Uses .env files to manage configuration settings including API keys for secure environment variable management

  • Leverages OpenAI's GPT models to interpret natural language commands for browser automation tasks

  • Handles dependency management and packaging for the MCP server

MCP Server

MCP (Machine Control Protocol) Server that provides browser automation capabilities through a simple API.

Description

This server creates an API endpoint that allows you to run browser automation tasks using natural language commands. It utilizes:

  • FastMCP: For creating a simple API server
  • browser-use: For browser automation capabilities
  • OpenAI's GPT models: To interpret natural language commands

Prerequisites

  • Python 3.11+
  • Poetry for dependency management
  • OpenAI API key

Setup

1. Install dependencies

poetry install

2. Configure environment variables

Create a .env file in the root directory with:

OPENAI_API_KEY=your_openai_api_key_here

Run

poetry run python main.py

The server will start with SSE (Server-Sent Events) transport on the default port.

Features

Browser Automation

Use the /run_browser_task endpoint to execute browser automation tasks using natural language. For example:

-
security - not tested
F
license - not found
-
quality - not tested

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Provides browser automation capabilities through an API endpoint that interprets natural language commands to perform web tasks using OpenAI's GPT models.

  1. Description
    1. Prerequisites
      1. Setup
        1. 1. Install dependencies
        2. 2. Configure environment variables
      2. Run
        1. Features
          1. Browser Automation

        Related MCP Servers

        • A
          security
          F
          license
          A
          quality
          Enables AI agents to interact with web browsers using natural language, featuring automated browsing, form filling, vision-based element detection, and structured JSON responses for systematic browser control.
          Last updated -
          1
          38
          Python
          • Linux
          • Apple
        • A
          security
          F
          license
          A
          quality
          Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.
          Last updated -
          4
          18
          Python
          • Linux
        • -
          security
          F
          license
          -
          quality
          A web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.
          Last updated -
          1
          Python
          • Apple
        • -
          security
          A
          license
          -
          quality
          AI-driven browser automation server that implements the Model Context Protocol to enable natural language control of web browsers for tasks like navigation, form filling, and visual interaction.
          Last updated -
          1
          Python
          MIT License
          • Apple

        View all related MCP servers

        ID: d2sek4qalu