parse
Extract and clean webpage content into structured Markdown using Mozilla's Readability algorithm. Retrieve article title, main text, excerpt, byline, and site name while removing ads and non-essential elements.
Instructions
Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
url | Yes | The website URL to parse |