fetch_webpage
Extract text content from web pages using customizable parameters such as URL, resource blocking, and character limits for efficient data retrieval.
Instructions
Retrieve text content from a web page
Input Schema
Name | Required | Description | Default |
---|---|---|---|
blockResources | No | Whether to block images, stylesheets, and fonts to improve performance (default: true) | |
headers | No | Custom headers to include in the request | |
maxLength | No | Maximum number of characters to return for content extraction (default: 2000 if not provided) | |
password | No | Password for basic authentication | |
resourceTypesToBlock | No | List of resource types to block (e.g., "image", "stylesheet", "font") | |
startIndex | No | Start character index for content extraction (default: 0) | |
timeout | No | Navigation timeout in milliseconds (default: 60000) | |
url | Yes | The URL of the webpage to fetch | |
username | No | Username for basic authentication |