Skip to main content
Glama

deep_read_topic

Search multiple academic databases, download real PDFs, extract full text and evidence chunks, and provide local file paths for direct paper inspection.

Instructions

Search, download, extract full text, and return evidence chunks plus local PDF paths for direct inspection.

By default the result is just [result_dict]: it carries every downloaded PDF's local path in pdf_paths (and in deep_reads[].pdf_path / downloads[].pdf_path), so a client can open the files when needed without any base64 in the payload. Opt in to inline content when your client benefits: render_top_pages=True appends the top paper's relevant pages as images (vision models); attach_top_pdf=True embeds its PDF as an application/pdf resource (Claude API style). include_scihub=True adds a Sci-Hub fallback; write_graph=True also renders a citation graph (path in graph_path).

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
topicYes
research_questionNo
limit_per_sourceNo
related_limitNo
download_top_nNo
top_chunks_per_paperNo
chunk_size_charsNo
chunk_overlap_charsNo
include_scihubNo
scihub_fallback_limitNo
from_yearNo
to_yearNo
open_access_onlyNo
write_to_zoteroNo
existing_collection_keyNo
existing_collection_nameNo
create_collection_nameNo
attach_pdfsNo
write_graphNo
render_top_pagesNo
max_render_pagesNo
render_scaleNo
attach_top_pdfNo
attach_pdf_max_mbNo
attach_pdf_max_pagesNo
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are present, so the description carries the full burden. It explains default behavior (result dict with pdf_paths), side effects (write_graph, write_to_zotero), fallback (include_scihub), and output format details (local paths, no base64). This provides substantial transparency, though it could mention rate limits or authentication requirements if applicable.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is divided into two focused paragraphs: the first states the high-level purpose, and the second details optional behaviors. Every sentence provides useful information, though it could be slightly shortened without losing key details. The structure supports quick scanning.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (25 parameters, no output schema, no annotations), the description covers the main output structure and key options but omits explanations for many parameters. It provides enough context for basic usage but may leave agents uncertain about lower-level settings. Return values beyond paths are not detailed.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must add meaning. It explains several key parameters (render_top_pages, attach_top_pdf, include_scihub, write_graph, etc.) and their effects. However, many parameters (e.g., limit_per_source, related_limit, download_top_n, chunk settings) are not described, leaving gaps. The description adds value but incompletely compensates for the lack of schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description starts with a clear verb and resource: 'Search, download, extract full text, and return evidence chunks plus local PDF paths for direct inspection.' It distinguishes itself from sibling tools like research_topic or search_literature by emphasizing local file paths and optional inline content, making its unique value proposition evident.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides guidance on when to opt into inline content (e.g., 'when your client benefits: render_top_pages=True appends top paper's relevant pages as images...'), but it does not explicitly state when to prefer this tool over its siblings (e.g., research_topic, search_literature) or when to avoid it. The usage context is partially clear but lacks comparative guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/aytzey/paper-pilot'

If you have feedback or need assistance with the MCP directory API, please join our Discord server