Extract Article Text
diffbot.articles.extractExtract article text, author, date, tags, sentiment, and images from any blog or news URL. Follows multi-page articles to concatenate complete content.
Instructions
Extract article text, author, date, tags, sentiment, and images from any blog or news URL with multi-page support (Diffbot)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Article or blog post URL to extract text, author, and metadata | |
| paging | No | Follow multi-page articles and concatenate text (default true) | |
| maxTags | No | Maximum number of topic tags to return (1-50, default 10) | |
| timeout | No | Request timeout in milliseconds (5000-30000, default 15000) |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | No | Tool response payload. Shape varies per tool — consult the tool description and inputSchema. May be an object, array, string, or number depending on the upstream provider response. | |
| error | No | Present only when the call failed. Includes error code, message, request_id, and any provider-specific extras. |