diffbot.articles.extract
Extract article text, author, date, tags, sentiment, and images from blog or news URLs, including multi-page articles.
Instructions
Extract article text, author, date, tags, sentiment, and images from any blog or news URL with multi-page support (Diffbot)
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Article or blog post URL to extract text, author, and metadata | |
| paging | No | Follow multi-page articles and concatenate text (default true) | |
| maxTags | No | Maximum number of topic tags to return (1-50, default 10) | |
| timeout | No | Request timeout in milliseconds (5000-30000, default 15000) |