parse
Extract webpage content into clean, LLM-optimized Markdown by removing ads, navigation, and non-essential elements. Retrieve article title, main content, excerpt, byline, and site name using Mozilla's Readability algorithm.
Instructions
Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
url | Yes | The website URL to parse |