Search for:

Tools and Techniques for Browser Automation

  • Why this server?

    Enables browser automation without screenshots or visually-tuned models.

    -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots, providing browser automation capabilities without requiring screenshots or visually-tuned models.
    Last updated -
    TypeScript
    Apache 2.0
    • Apple
    • Linux
  • Why this server?

    Enables AI agents to control web browsers via a standardized interface.

    -
    security
    F
    license
    -
    quality
    Enables AI agents to control web browsers via a standardized interface for operations like launching, interacting with, and closing browsers.
    Last updated -
    0
    JavaScript
  • Why this server?

    Provides Playwright capabilities for controlling web browsers, capturing screenshots, extracting content, and performing complex interactions through an MCP interface.

    -
    security
    A
    license
    -
    quality
    A browser automation server providing Playwright capabilities for controlling web browsers, capturing screenshots, extracting content, and performing complex interactions through an MCP interface.
    Last updated -
    Python
    Apache 2.0
    • Apple
  • Why this server?

    Enables browser automation and real-time computer vision tasks through AI-driven commands.

    -
    security
    A
    license
    -
    quality
    Enables browser automation and real-time computer vision tasks through AI-driven commands, offering zero-cost digital navigation and interaction for enhanced web experiences.
    Last updated -
    0
    1
    JavaScript
    MIT License
  • Why this server?

    An MCP server paired with a Firefox extension that enables LLM clients to control the user's browser, supporting tab management, history search, and content reading.

    -
    security
    A
    license
    -
    quality
    An MCP server paired with a Firefox extension that enables LLM clients to control the user's browser, supporting tab management, history search, and content reading.
    Last updated -
    17
    TypeScript
    MIT License
  • Why this server?

    Enables AI agents to interact with web browsers using natural language, featuring automated browsing, form filling, vision-based element detection, and structured JSON responses for systematic browser control.

    A
    security
    F
    license
    A
    quality
    Enables AI agents to interact with web browsers using natural language, featuring automated browsing, form filling, vision-based element detection, and structured JSON responses for systematic browser control.
    Last updated -
    1
    34
    Python
    • Linux
    • Apple
  • Why this server?

    A FastMCP server that enables browser automation through natural language commands, allowing Language Models to browse the web, fill out forms, click buttons, and perform other web-based tasks via a simple API.

    -
    security
    -
    license
    -
    quality
    A FastMCP server that enables browser automation through natural language commands, allowing Language Models to browse the web, fill out forms, click buttons, and perform other web-based tasks via a simple API.
    Last updated -
    1
    Python
  • Why this server?

    This server enables users to fetch and process web content for use as context in LLMs directly from Cline or Claude Desktop.

    A
    security
    A
    license
    A
    quality
    This MCP server provides tools for interacting with Cloudflare Browser Rendering, allowing you to fetch and process web content for use as context in LLMs directly from Cline or Claude Desktop.
    Last updated -
    5
    1
    TypeScript
    MIT License
    • Apple
  • Why this server?

    An open-source self-hosted browser agent that provides a dockerized browser environment for AI automation.

    -
    security
    F
    license
    -
    quality
    An open-source self-hosted browser agent that provides a dockerized browser environment for AI automation, allowing other AI apps and agents to perform human-like web browsing tasks through natural language instructions.
    Last updated -
    119
    16
    TypeScript
  • Why this server?

    Enables browser automation using the Selenium WebDriver through MCP, supporting browser management, element location, and both basic and advanced user interactions.

    -
    security
    A
    license
    -
    quality
    Allows AI agents to control web browser sessions via Selenium WebDriver, enabling web automation tasks like scraping, testing, and form filling through the Model Context Protocol.
    Last updated -
    TypeScript
    MIT License