spa_read
Extract content from JavaScript-heavy Single Page Applications by rendering pages with a headless browser and converting them to LLM-ready Markdown.
Instructions
Render a JavaScript SPA page and extract its content as LLM-ready Markdown. Uses a headless browser to execute JavaScript, then extracts the main article content.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL of the SPA page to read | |
| waitForSelector | No | CSS selector to wait for before extraction | |
| waitTimeout | No | Navigation timeout in ms (default: 30000) | |
| includeMetadata | No | Include title/author/excerpt as YAML frontmatter (default: true) | |
| cookies | No | Cookies to inject before page load (e.g., session tokens) | |
| headers | No | Custom HTTP headers (e.g., Authorization) |