web_content
Extract fully rendered web content from dynamic pages, SPAs, and infinite scroll sites by handling JavaScript execution and capturing dynamically loaded elements with configurable scrolling options.
Instructions
Fetch fully rendered DOM content using browserless.io. Handles AJAX/JavaScript dynamic loading. Optimized for SPAs and infinite scroll pages. Returns the complete rendered HTML after all JavaScript execution, including dynamically loaded content. Each scroll waits for page height changes and network activity to settle.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to fetch | |
| initialWaitTime | No | Time to wait (in milliseconds) after loading the page before scrolling | |
| scrollCount | No | Number of times to scroll down the page | |
| scrollWaitTime | No | Time to wait (in milliseconds) between each scroll action |
Input Schema (JSON Schema)
{
"properties": {
"initialWaitTime": {
"default": 3000,
"description": "Time to wait (in milliseconds) after loading the page before scrolling",
"type": "number"
},
"scrollCount": {
"default": 0,
"description": "Number of times to scroll down the page",
"type": "number"
},
"scrollWaitTime": {
"default": 3000,
"description": "Time to wait (in milliseconds) between each scroll action",
"type": "number"
},
"url": {
"description": "The URL to fetch",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}