crw_scrape
Extract content from any web page and convert it to markdown, HTML, or links for data analysis and processing.
Instructions
Scrape a single URL and return its content as markdown, HTML, or links. Use this to extract content from any web page.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| excludeTags | No | CSS selectors to exclude from output | |
| formats | No | Output formats (default: ["markdown"]) | |
| includeTags | No | CSS selectors to include (only content matching these selectors) | |
| onlyMainContent | No | Extract only the main content, removing nav/footer/etc (default: true) | |
| url | Yes | The URL to scrape |