Skip to main content
Glama

scraping_browser_get_html

Extract HTML content from web pages for data collection and analysis, with options for full-page capture when script tags are needed.

Instructions

Get the HTML content of the current page. Avoid using the full_page option unless it is important to see things like script tags since this can be large

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
full_pageNoWhether to get the full page HTML including head and script tags Avoid this if you only need the extra HTML, since it can be quite large

Implementation Reference

  • Handler function that retrieves HTML content from the current browser page, either body innerHTML or full page content based on full_page parameter.
    execute: async({full_page = false})=>{ const page = await (await require_browser()).get_page(); try { if (!full_page) return await page.$eval('body', body=>body.innerHTML); const html = await page.content(); if (!full_page && html) return html.split('<body>')[1].split('</body>')[0]; return html; } catch(e){ throw new UserError(`Error getting HTML content: ${e}`); } },
  • Zod schema defining the input parameters for the tool: optional full_page boolean.
    parameters: z.object({ full_page: z.boolean().optional().describe([ 'Whether to get the full page HTML including head and script tags', 'Avoid this if you only need the extra HTML, since it can be ' +'quite large', ].join('\n')), }),
  • Registration of the tool in the exported tools array, conditionally if API_TOKEN is set.
    export const tools = process.env.API_TOKEN ? [ scraping_browser_navigate, scraping_browser_go_back, scraping_browser_go_forward, scraping_browser_links, scraping_browser_click, scraping_browser_type, scraping_browser_wait_for, scraping_browser_screenshot, scraping_browser_get_text, scraping_browser_get_html, scraping_browser_scroll, scraping_browser_scroll_to, ] : [scraping_browser_activation_instructions];

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/dsouza-anush/brightdata-mcp-heroku'

If you have feedback or need assistance with the MCP directory API, please join our Discord server