render.extract_dom
Extract the DOM structure from webpages for security analysis and vulnerability testing within bug bounty hunting workflows.
Instructions
Extract and return the DOM structure of a webpage
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL to extract DOM from | |
| waitTime | No | Wait time in ms |
Implementation Reference
- src/tools/render.ts:84-131 (handler)The handler function for the 'render.extract_dom' tool. It uses Puppeteer to launch a browser, navigate to the URL, wait for the specified time, extract the page's HTML (truncated), title, forms (with input details), and links (top 100), then returns a formatted ToolResult.async ({ url, waitTime = 2000 }: any): Promise<ToolResult> => { let page: Page | null = null; try { const browserInstance = await getBrowser(); page = await browserInstance.newPage(); await page.goto(url, { waitUntil: 'networkidle2', timeout: 30000 }); await new Promise(resolve => setTimeout(resolve, waitTime)); const html = await page.content(); const title = await page.title(); const forms = await page.$$eval('form', (forms) => forms.map((form) => ({ action: form.action, method: form.method, inputs: Array.from(form.querySelectorAll('input')).map((input: any) => ({ name: input.name, type: input.type, id: input.id, })), })) ); const links = await page.$$eval('a', (links) => links.map((link: any) => ({ href: link.href, text: link.textContent?.trim(), })) ); await page.close(); return formatToolResult(true, { url, title, html: html.substring(0, 50000), // Limit size forms, links: links.slice(0, 100), // Limit links summary: { formsCount: forms.length, linksCount: links.length, }, }); } catch (error: any) { if (page) await page.close().catch(() => {}); return formatToolResult(false, null, error.message); } }
- src/tools/render.ts:74-83 (schema)The input schema definition for the 'render.extract_dom' tool, specifying the required 'url' parameter and optional 'waitTime'.description: 'Extract and return the DOM structure of a webpage', inputSchema: { type: 'object', properties: { url: { type: 'string', description: 'URL to extract DOM from' }, waitTime: { type: 'number', description: 'Wait time in ms', default: 2000 }, }, required: ['url'], }, },
- src/tools/render.ts:72-132 (registration)The registration of the 'render.extract_dom' tool via server.tool(), including the tool name, schema, and handler function within the registerRenderTools function.'render.extract_dom', { description: 'Extract and return the DOM structure of a webpage', inputSchema: { type: 'object', properties: { url: { type: 'string', description: 'URL to extract DOM from' }, waitTime: { type: 'number', description: 'Wait time in ms', default: 2000 }, }, required: ['url'], }, }, async ({ url, waitTime = 2000 }: any): Promise<ToolResult> => { let page: Page | null = null; try { const browserInstance = await getBrowser(); page = await browserInstance.newPage(); await page.goto(url, { waitUntil: 'networkidle2', timeout: 30000 }); await new Promise(resolve => setTimeout(resolve, waitTime)); const html = await page.content(); const title = await page.title(); const forms = await page.$$eval('form', (forms) => forms.map((form) => ({ action: form.action, method: form.method, inputs: Array.from(form.querySelectorAll('input')).map((input: any) => ({ name: input.name, type: input.type, id: input.id, })), })) ); const links = await page.$$eval('a', (links) => links.map((link: any) => ({ href: link.href, text: link.textContent?.trim(), })) ); await page.close(); return formatToolResult(true, { url, title, html: html.substring(0, 50000), // Limit size forms, links: links.slice(0, 100), // Limit links summary: { formsCount: forms.length, linksCount: links.length, }, }); } catch (error: any) { if (page) await page.close().catch(() => {}); return formatToolResult(false, null, error.message); } } );
- src/tools/render.ts:9-17 (helper)Helper function to lazily initialize and return the shared Puppeteer browser instance used by render tools.async function getBrowser(): Promise<Browser> { if (!browser) { browser = await puppeteer.launch({ headless: true, args: ['--no-sandbox', '--disable-setuid-sandbox'], }); } return browser; }
- src/index.ts:42-42 (registration)Invocation of registerRenderTools(server) in the main index file, which registers the render tools including 'render.extract_dom'.registerRenderTools(server);