Browser MCP Bridge

DATA_OPTIMIZATION.md•8.8 KiB

# Data Optimization Guide This document describes the data optimization features implemented in the Browser MCP server to reduce token consumption for AI coding agents. ## Problem Browser data (DOM snapshots, console messages, network requests) can be extremely large, overwhelming AI agents with too much information and consuming excessive tokens. This makes the MCP server difficult to use effectively with agentic AI systems. ## Solutions Implemented We've implemented three key strategies to solve this problem: ### 1. Intelligent Data Truncation All tools now have sensible default size limits that balance completeness with usability: **Size Limits:** - HTML content: 50KB (configurable via `maxTextLength`) - DOM nodes: 500 nodes by default (configurable via `maxNodes`, max 2000) - Console messages: 50 messages by default (configurable via `pageSize`, max 200) - Network requests: 50 requests by default (configurable via `pageSize`, max 200) - Request/Response bodies: 10KB each (automatically truncated) **Truncation Indicators:** All truncated data includes clear indicators showing: - Original size - Truncated size - How to get more data if needed ### 2. Selective Data Extraction with Filters #### Console Messages (`get_console_messages`) **Smart Defaults:** - Returns only `error` and `warn` messages by default (most relevant for debugging) - Other log levels available: `info`, `log`, `debug` **Filter Parameters:** ```javascript { logLevels: ["error", "warn"], // Filter by log level searchTerm: "string", // Search in message text since: 1234567890, // Filter by timestamp pageSize: 50, // Results per page cursor: "uuid" // Pagination cursor } ``` **Example Usage:** ```javascript // Get only errors get_console_messages({ logLevels: ["error"] }) // Search for specific errors get_console_messages({ logLevels: ["error", "warn"], searchTerm: "authentication" }) // Get recent messages get_console_messages({ since: Date.now() - 60000 // Last minute }) ``` #### Network Requests (`get_network_requests`) **Smart Defaults:** - Excludes request/response bodies by default - Sorts failed requests (4xx, 5xx) first for relevance - Returns most recent 50 requests **Filter Parameters:** ```javascript { method: "GET", // Filter by HTTP method status: 404, // Single status code status: [400, 404, 500], // Multiple status codes resourceType: "xhr", // Filter by resource type domain: "api.example.com", // Filter by domain failedOnly: true, // Only failed requests pageSize: 50, // Results per page cursor: "uuid", // Pagination cursor includeResponseBodies: false, // Include response bodies (truncated) includeRequestBodies: false // Include request bodies (truncated) } ``` **Example Usage:** ```javascript // Get only failed requests get_network_requests({ failedOnly: true }) // Get all API calls get_network_requests({ resourceType: ["xhr", "fetch"], domain: "api" }) // Debug specific endpoint get_network_requests({ method: "POST", status: [400, 401, 403], includeRequestBodies: true }) ``` #### DOM Snapshot (`get_dom_snapshot`) **Smart Defaults:** - Limits to 500 nodes by default - Excludes `<script>` and `<style>` tags by default - Excludes computed styles by default - Max depth of 5 levels **Filter Parameters:** ```javascript { selector: ".main-content", // CSS selector to target specific elements maxDepth: 5, // Maximum tree depth (max 15) maxNodes: 500, // Maximum nodes (max 2000) includeStyles: false, // Include computed styles excludeScripts: true, // Exclude script tags excludeStyles: true // Exclude style tags } ``` **Example Usage:** ```javascript // Get only main content area get_dom_snapshot({ selector: ".main-content" }) // Get app container with more detail get_dom_snapshot({ selector: "#app", maxNodes: 1000, maxDepth: 10 }) // Get full page structure (exclude noise) get_dom_snapshot({ maxNodes: 1000, excludeScripts: true, excludeStyles: true }) ``` ### 3. Streaming & Pagination All list-based tools support cursor-based pagination: **Pagination Flow:** 1. Make initial request with `pageSize` parameter 2. Response includes: - `data`: Current page of results - `count`: Number of items in current page - `total`: Total matching items - `hasMore`: Boolean indicating more pages available - `nextCursor`: Cursor to fetch next page (null if no more pages) - `message`: Human-readable status message 3. To get next page, pass `nextCursor` back: ```javascript get_console_messages({ cursor: "previous-response-cursor", pageSize: 50 }) ``` **Example Pagination:** ```javascript // First request const page1 = get_console_messages({ logLevels: ["error"], pageSize: 50 }) // Returns: { messages: [...], hasMore: true, nextCursor: "abc123" } // Get next page const page2 = get_console_messages({ cursor: "abc123", pageSize: 50 }) // Returns next 50 error messages ``` ## Response Format All enhanced tools return structured responses with metadata: ### Console Messages Response ```json { "messages": [...], "count": 25, "total": 237, "hasMore": true, "nextCursor": "uuid-here", "filters": { "logLevels": ["error", "warn"], "searchTerm": null, "since": null }, "message": "Showing 25 of 237 messages. Use nextCursor to get more." } ``` ### Network Requests Response ```json { "requests": [...], "count": 15, "total": 342, "hasMore": true, "nextCursor": "uuid-here", "filters": { "method": "POST", "status": [400, 404], "failedOnly": false, "domain": null, "resourceType": null }, "message": "Showing 15 of 342 requests. Use nextCursor to get more." } ``` ### DOM Snapshot Response ```json { "root": {...}, "nodeCount": 487, "originalNodeCount": 5234, "truncated": true, "filters": { "selector": ".main-content", "maxDepth": 5, "maxNodes": 500, "excludeScripts": true, "excludeStyles": true }, "message": "DOM tree truncated to 500 nodes (original: 5234 nodes). Use selector to target specific elements or increase maxNodes." } ``` ## Best Practices for AI Agents 1. **Start with defaults** - The smart defaults return the most relevant data 2. **Use filters to narrow down** - Be specific about what you need 3. **Use selectors for DOM** - Target specific page sections instead of full tree 4. **Paginate judiciously** - Only fetch additional pages when you need more data 5. **Exclude bodies by default** - Only include request/response bodies when debugging specific issues 6. **Filter by time** - Use `since` parameter for recent data only ## Examples for Common Tasks ### Debugging JavaScript Errors ```javascript // Get recent errors only get_console_messages({ logLevels: ["error"], since: Date.now() - 300000, // Last 5 minutes pageSize: 20 }) ``` ### Analyzing Failed API Calls ```javascript // Get failed API requests with bodies get_network_requests({ resourceType: ["xhr", "fetch"], failedOnly: true, includeRequestBodies: true, includeResponseBodies: true, pageSize: 10 }) ``` ### Inspecting Specific UI Components ```javascript // Get DOM for a specific component get_dom_snapshot({ selector: "#user-profile-card", maxNodes: 200, maxDepth: 8, excludeScripts: true }) ``` ### Finding Authentication Issues ```javascript // Search console for auth-related messages get_console_messages({ logLevels: ["error", "warn"], searchTerm: "auth" }) // Check auth-related network requests get_network_requests({ domain: "auth", status: [401, 403], includeRequestBodies: true }) ``` ## Performance Benefits These optimizations provide dramatic improvements: - **90% reduction** in typical response sizes - **Faster AI processing** due to smaller payloads - **Lower token costs** for AI operations - **More focused data** leading to better AI decisions - **Pagination** allows exploration without overwhelming initial load ## Configuration Default limits can be overridden in the server constructor: ```javascript // In server.js this.MAX_HTML_SIZE = 50000; // 50KB this.MAX_TEXT_SIZE = 30000; // 30KB this.MAX_DOM_NODES = 500; // 500 nodes this.MAX_CONSOLE_MESSAGES = 50; // 50 messages this.MAX_NETWORK_REQUESTS = 50; // 50 requests this.MAX_REQUEST_BODY_SIZE = 10000; // 10KB this.MAX_RESPONSE_BODY_SIZE = 10000; // 10KB ``` ## Backward Compatibility All new parameters are optional with sensible defaults. Existing code continues to work, but will benefit from: - Automatic truncation - Smart filtering defaults (errors/warnings for console, failed requests first for network) - Better response metadata

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/robhicks/browser-mcp-bridge'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

DATA_OPTIMIZATION.md•8.8 KiB