extract_article
Extract structured article content from news and blog URLs, including title, text, author, and date, handling paywalls and JavaScript-rendered pages.
Instructions
Extract the main article content from a news article or blog post. Returns title, body text, author, and publish date as structured JSON. Handles paywalled and JavaScript-rendered articles. Optimised for editorial content — use extract_url for product pages, listings, or generic data. Read-only — makes no changes to any external system. Requires HAUNT_API_KEY environment variable. Free tier: 100 requests/month. Returns an error if rate limit or API key is invalid.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The full URL of the article or blog post to extract content from. |