WebEvalAgent MCP Server

Official

Overview Schema Related Servers Score Discussions

setup_browser_state

Configure and save browser authentication state for automated web testing by launching an interactive browser session to complete login processes.

Instructions

Sets up and saves browser state for future use.

This tool should only be called in one scenario:

The user explicitly requests to set up browser state/authentication

Launches a non-headless browser for user interaction, allows login/authentication, and saves the browser state (cookies, local storage, etc.) to a local file.

Args: url: Optional URL to navigate to upon opening the browser. ctx: The MCP context (used for progress reporting, not directly here).

Returns: list[TextContent]: Confirmation of state saving or error messages.

Input Schema

TableJSON Schema

Name	Required	Description	Default
`url`	No

Implementation Reference

webEvalAgent/src/tool_handlers.py:609-764 (handler)

Core handler function that initializes Playwright, launches non-headless Chromium with persistent user data directory (~/.operative/browser_user_data), navigates to provided URL or blank, waits for user interaction (tab close or 180s timeout), saves browser storage state to ~/.operative/browser_state/state.json and cookies to cookies.json. Includes anti-detection measures like hiding webdriver property.

async def handle_setup_browser_state(arguments: Dict[str, Any], ctx: Context, api_key: str) -> list[TextContent]:
    """Handle setup_browser_state tool calls
    
    This function launches a non-headless browser for user interaction, allows login/authentication,
    and saves the browser state (cookies, local storage, etc.) to a local file.
    
    Args:
        arguments: The tool arguments, may contain 'url' to navigate to
        ctx: The MCP context for reporting progress
        api_key: The API key for authentication (not used directly here)
        
    Returns:
        list[TextContent]: Confirmation of state saving or error messages
    """
    # Initialize log server
    try:
        start_log_server()
        await asyncio.sleep(1)
        open_log_dashboard()
        send_log("Log dashboard initialized for browser state setup", "🚀")
    except Exception as log_server_error:
        print(f"Warning: Could not start log dashboard: {log_server_error}")
    
    # Get the URL if provided
    url = arguments.get("url", "about:blank")
    
    # Ensure URL has a protocol (add https:// if missing)
    if url != "about:blank" and not url.startswith(("http://", "https://", "file://", "data:", "chrome:", "javascript:")):
        url = "https://" + url
        send_log(f"Added https:// protocol to URL: {url}", "🔗")
    
    # Ensure the state directory exists
    state_dir = os.path.expanduser("~/.operative/browser_state")
    os.makedirs(state_dir, exist_ok=True)
    state_file = os.path.join(state_dir, "state.json")
    
    send_log("🚀 Starting interactive login session", "🚀")
    send_log(f"Browser state will be saved to {state_file}", "💾")
    
    # Create a user data directory if it doesn't exist
    user_data_dir = os.path.expanduser("~/.operative/browser_user_data")
    os.makedirs(user_data_dir, exist_ok=True)
    send_log(f"Using browser user data directory: {user_data_dir}", "📁")
    
    playwright = None
    context = None
    page = None
    
    try:
        # Initialize Playwright
        playwright = await async_playwright().start()
        send_log("Playwright initialized", "🎭")
        
        # Launch browser with a persistent context using the user_data_dir parameter
        # This replaces the previous browser.launch() + context.new_context() approach
        context = await playwright.chromium.launch_persistent_context(
            user_data_dir=user_data_dir,  # Use as a direct parameter instead of an arg
            headless=False,  # Non-headless for user interaction
            args=[
                "--no-sandbox",
                "--disable-blink-features=AutomationControlled"
            ],
            ignore_default_args=["--enable-automation"],
            # Include the context options directly
            user_agent="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/123.0.0.0 Safari/537.36",
            viewport={"width": 1280, "height": 800},
            device_scale_factor=2,
            is_mobile=False,
            has_touch=False,
            locale="en-US",
            timezone_id="America/Los_Angeles",
            permissions=["geolocation", "notifications"]
        )
        
        send_log("Browser launched in interactive mode with persistent context", "🎭")
        
        # Modify the navigator.webdriver property to avoid detection
        await context.add_init_script("""
            Object.defineProperty(navigator, 'webdriver', {
                get: () => false,
            });
        """)
        send_log("Browser context created with anti-detection measures", "🛡️")
        
        # Create a new page and navigate to the URL
        page = await context.new_page()
        await page.goto(url)
        send_log(f"🔗 Navigated to: {url}", "🔗")
        
        send_log("Waiting for user interaction (close browser tab or 180s timeout)...", "🖱️")
        
        # Set up an event listener for page close
        page_close_event = asyncio.Event()
        
        # Define the handler function that will be called when page closes
        async def on_page_close():
            send_log("Page close event detected", "👁️")
            page_close_event.set()
        
        # Register the handler to be called when page closes
        page.once("close", lambda: asyncio.create_task(on_page_close()))
        
        # Wait for either the event to be set or timeout
        try:
            # Wait for 180 seconds (3 minutes) or until the page is closed
            await asyncio.wait_for(page_close_event.wait(), timeout=180)
            # If we get here without a timeout exception, the page was closed
            send_log("User closed the browser tab, saving state", "🖱️")
        except asyncio.TimeoutError:
            # If we get a timeout, the 180 seconds elapsed
            send_log("Timeout reached (180s), saving current state", "⚠️")
        
        # Save the browser state to a file
        await context.storage_state(path=state_file)
        
        # Also save cookies for debugging purposes
        cookies = await context.cookies()
        cookies_file = os.path.join(state_dir, "cookies.json")
        with open(cookies_file, 'w') as f:
            json.dump(cookies, f, indent=2)
            
        send_log(f"Saved browser state to {state_file}", "💾")
        send_log(f"Saved cookies to {cookies_file} for reference", "🍪")
        
        return [TextContent(
            type="text",
            text=f"✅ Browser state saved successfully to {state_file}. This state will be used automatically in future web_eval_agent sessions."
        )]
        
    except Exception as e:
        error_msg = f"Error during browser state setup: {e}\n{traceback.format_exc()}"
        send_log(error_msg, "❌")
        return [TextContent(
            type="text",
            text=f"❌ Failed to save browser state: {e}"
        )]
    finally:
        # Close resources in reverse order
        if page:
            try:
                await page.close()
            except Exception:
                pass
        if context:
            try:
                await context.close()
            except Exception:
                pass
        if playwright:
            try:
                await playwright.stop()
            except Exception:
                pass
            
        send_log("Browser session completed", "🏁")

webEvalAgent/mcp_server.py:103-142 (registration)

MCP tool registration for 'setup_browser_state' using @mcp.tool decorator. This wrapper function handles API key validation and delegates execution to the core handler in tool_handlers.py.

@mcp.tool(name=BrowserTools.SETUP_BROWSER_STATE)
async def setup_browser_state(url: str = None, ctx: Context = None) -> list[TextContent]:
    """Sets up and saves browser state for future use.

    This tool should only be called in one scenario:
    1. The user explicitly requests to set up browser state/authentication

    Launches a non-headless browser for user interaction, allows login/authentication,
    and saves the browser state (cookies, local storage, etc.) to a local file.

    Args:
        url: Optional URL to navigate to upon opening the browser.
        ctx: The MCP context (used for progress reporting, not directly here).

    Returns:
        list[TextContent]: Confirmation of state saving or error messages.
    """
    is_valid = await validate_api_key(api_key)

    if not is_valid:
        error_message_str = "❌ Error: API Key validation failed when running the tool.\n"
        error_message_str += "   Reason: Free tier limit reached.\n"
        error_message_str += "   👉 Please subscribe at https://operative.sh to continue."
        return [TextContent(type="text", text=error_message_str)]
    try:
        # Generate a new tool_call_id for this specific tool call
        tool_call_id = str(uuid.uuid4())
        send_log(f"Generated new tool_call_id for setup_browser_state: {tool_call_id}")
        return await handle_setup_browser_state(
            {"url": url, "tool_call_id": tool_call_id},
            ctx,
            api_key
        )
    except Exception as e:
        tb = traceback.format_exc()
        return [TextContent(
            type="text",
            text=f"Error executing setup_browser_state: {str(e)}\n\nTraceback:\n{tb}"
        )]

webEvalAgent/mcp_server.py:37-40 (registration)
Enum definition providing the tool name 'setup_browser_state' used in the @mcp.tool decorator.
```
class BrowserTools(str, Enum):
    WEB_EVAL_AGENT = "web_eval_agent"
    SETUP_BROWSER_STATE = "setup_browser_state"  # Add new tool enum
```
webEvalAgent/mcp_server.py:25-25 (registration)
Import of the core handler function from tool_handlers.py.
```
from webEvalAgent.src.tool_handlers import handle_web_evaluation, handle_setup_browser_state
```

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It describes key behaviors: launches a non-headless browser for user interaction, allows login/authentication, saves browser state to a local file, and returns confirmation/error messages. However, it lacks details on permissions needed, file storage location, rate limits, or error conditions. It compensates somewhat but not fully for the annotation gap.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and concise: it starts with a clear purpose statement, provides specific usage guidelines, details the tool's behavior, and lists parameters and returns. Each sentence adds value without redundancy. The formatting with sections (Args, Returns) enhances readability without unnecessary length.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (involving browser automation and state saving) and lack of annotations or output schema, the description does a good job covering key aspects: purpose, usage, behavior, parameters, and returns. It explains the return type ('list[TextContent]') and what it contains. However, it could improve by detailing error handling or state file management to be fully complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description adds meaningful context for the 'url' parameter: 'Optional URL to navigate to upon opening the browser.' This clarifies its purpose beyond the schema's basic title. With 0% schema description coverage and only one parameter, the description adequately compensates. The mention of 'ctx' as an MCP context for progress reporting adds useful semantic insight, though it's not in the input schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Sets up and saves browser state for future use' with specific actions like launching a non-headless browser, allowing login/authentication, and saving state to a file. It distinguishes from the sibling 'web_eval_agent' by focusing on state setup rather than evaluation. However, it doesn't explicitly contrast with the sibling beyond implied different use cases.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage guidelines: 'This tool should only be called in one scenario: 1. The user explicitly requests to set up browser state/authentication.' This clearly defines when to use it and implicitly suggests alternatives (e.g., not for general browsing). It offers strong guidance without misleading information.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/refreshdotdev/web-eval-agent'

If you have feedback or need assistance with the MCP directory API, please join our Discord server