Skip to main content
Glama

read_article

Extract clean article content from web URLs by converting pages to Markdown format. Remove ads and navigation noise to prepare text for AI analysis, summarization, or translation.

Instructions

读取指定 URL 的文章内容,返回 LLM 友好的 Markdown 格式

通过 Jina AI Reader 将网页转换为干净的 Markdown,自动去除广告、导航栏等噪音内容。 适合用于:阅读新闻正文、获取文章详情、分析文章内容。

典型使用流程:

  1. 先用 search_news(include_url=True) 搜索新闻获取链接

  2. 再用 read_article(url=链接) 读取正文内容

  3. AI 对 Markdown 正文进行分析、摘要、翻译等

Args: url: 文章链接(必需),以 http:// 或 https:// 开头 timeout: 请求超时时间(秒),默认 30,最大 60

Returns: JSON格式的文章内容,包含完整 Markdown 正文

Examples: - read_article(url="https://example.com/news/123")

Note: - 使用 Jina AI Reader 免费服务(100 RPM 限制) - 每次请求间隔 5 秒(内置速率控制) - 部分付费墙/登录墙页面可能无法完整获取

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
urlYes
timeoutNo

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault
resultYes
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It effectively describes key traits: the tool uses Jina AI Reader for conversion, removes noise content, has rate limits (100 RPM, 5-second intervals), and may fail on restricted pages. It also specifies the return format (JSON with Markdown). However, it lacks details on error handling or timeout behavior beyond the default.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and well-structured, with a clear purpose statement upfront, followed by usage guidelines, workflow examples, parameter details, and notes. Every section adds value without redundancy, and it uses bullet points and formatting (like **典型使用流程**) for readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (web scraping with rate limits), no annotations, and an output schema (which handles return values), the description is complete. It covers purpose, usage, parameters, behavioral traits (like rate limits and limitations), and integration with sibling tools, leaving no significant gaps for an AI agent to understand and invoke the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate fully. It adds significant meaning beyond the bare schema: it explains that 'url' is required and must start with http:// or https://, and that 'timeout' has a default of 30 seconds and a maximum of 60 seconds. This provides clear semantic context that the schema alone does not.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('读取' - read, '返回' - return) and resources ('文章内容' - article content, 'Markdown 格式' - Markdown format). It distinguishes from siblings by focusing on reading individual articles rather than searching, aggregating, or analyzing news, which are handled by other tools like search_news or analyze_data_insights.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool versus alternatives, including a '典型使用流程' (typical workflow) that recommends using search_news first to get URLs and then read_article for content extraction. It also lists suitable scenarios ('适合用于' - suitable for) like reading news content or analyzing articles, and notes exclusions for paywalled or login-required pages.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/LeePresident/TrendRadar'

If you have feedback or need assistance with the MCP directory API, please join our Discord server