crawl_url_content
Extract content from a single web page, including main text, links, and images, with customizable options for maximum content length, using the Open Search MCP server.
Instructions
Crawl and extract content from a single web page
Input Schema
Name | Required | Description | Default |
---|---|---|---|
extractImages | No | Extract all images from the page | |
extractLinks | No | Extract all links from the page | |
extractText | No | Extract main text content | |
maxContentLength | No | Maximum length of extracted content | |
url | Yes | URL of the web page to crawl |
Input Schema (JSON Schema)
{
"properties": {
"extractImages": {
"default": false,
"description": "Extract all images from the page",
"type": "boolean"
},
"extractLinks": {
"default": false,
"description": "Extract all links from the page",
"type": "boolean"
},
"extractText": {
"default": true,
"description": "Extract main text content",
"type": "boolean"
},
"maxContentLength": {
"default": 5000,
"description": "Maximum length of extracted content",
"maximum": 20000,
"minimum": 100,
"type": "number"
},
"url": {
"description": "URL of the web page to crawl",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}