Skip to main content
Glama
emi-dm

ArxivSearcher MCP Server

by emi-dm

search_by_author

Find academic papers by a specific author on arXiv, with options to filter by category, date range, and limit results.

Instructions

Search papers by a specific author.

:param author_name: Name of the author to search for :param max_results: Maximum number of results :param category: Optional category filter (e.g., 'cs.SE', 'cs.AI') :param start_date: Optional start date filter (YYYY-MM-DD or YYYY) :param end_date: Optional end date filter (YYYY-MM-DD or YYYY)

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
author_nameYes
max_resultsNo
categoryNo
start_dateNo
end_dateNo

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault

No arguments

Implementation Reference

  • Primary synchronous handler for the 'search_by_author' MCP tool. Constructs arXiv query using 'au:"author"' and optional filters, fetches results, and formats them into a dictionary.
    def search_by_author(
        author_name: str,
        max_results: int = 20,
        category: str | None = None,
        start_date: str | None = None,
        end_date: str | None = None,
    ) -> dict:
        """
        Search papers by a specific author.
    
        :param author_name: Name of the author to search for
        :param max_results: Maximum number of results
        :param category: Optional category filter (e.g., 'cs.SE', 'cs.AI')
        :param start_date: Optional start date filter (YYYY-MM-DD or YYYY)
        :param end_date: Optional end date filter (YYYY-MM-DD or YYYY)
        """
        query_parts = [f'au:"{author_name}"']
    
        if category:
            query_parts.append(f"cat:{category}")
    
        # Add date range if specified
        if start_date or end_date:
            start = "19910814"
            if start_date:
                try:
                    dt = datetime.strptime(start_date, "%Y-%m-%d")
                except ValueError:
                    dt = datetime.strptime(start_date, "%Y")
                start = dt.strftime("%Y%m%d")
    
            end = datetime.now().strftime("%Y%m%d")
            if end_date:
                try:
                    dt = datetime.strptime(end_date, "%Y-%m-%d")
                except ValueError:
                    dt = datetime.strptime(end_date, "%Y")
                    dt = dt.replace(month=12, day=31)
                end = dt.strftime("%Y%m%d")
    
            query_parts.append(f"submittedDate:[{start} TO {end}]")
    
        final_query = " AND ".join(query_parts)
        print(f"[arxiv-search] Author query: {final_query}")
    
        search = arxiv.Search(
            query=final_query,
            max_results=max_results,
            sort_by=arxiv.SortCriterion.SubmittedDate,
            sort_order=arxiv.SortOrder.Descending,
        )
    
        results = []
        for r in search.results():
            results.append(
                {
                    "title": r.title,
                    "authors": [a.name for a in r.authors],
                    "summary": r.summary,
                    "pdf_url": r.pdf_url,
                    "published_date": r.published.strftime("%Y-%m-%d"),
                    "arxiv_id": r.entry_id.split("/")[-1],
                    "categories": r.categories,
                }
            )
    
        return {
            "author": author_name,
            "query_used": final_query,
            "total_results": len(results),
            "results": results,
        }
  • Asynchronous handler for the 'search_by_author' MCP tool in the remote version. Identical logic to the synchronous version but async.
    async def search_by_author(
        author_name: str,
        max_results: int = 20,
        category: str | None = None,
        start_date: str | None = None,
        end_date: str | None = None,
    ) -> dict:
        """
        Search papers by a specific author.
    
        :param author_name: Name of the author to search for
        :param max_results: Maximum number of results
        :param category: Optional category filter (e.g., 'cs.SE', 'cs.AI')
        :param start_date: Optional start date filter (YYYY-MM-DD or YYYY)
        :param end_date: Optional end date filter (YYYY-MM-DD or YYYY)
        """
        query_parts = [f'au:"{author_name}"']
    
        if category:
            query_parts.append(f"cat:{category}")
    
        # Add date range if specified
        if start_date or end_date:
            start = "19910814"
            if start_date:
                try:
                    dt = datetime.strptime(start_date, "%Y-%m-%d")
                except ValueError:
                    dt = datetime.strptime(start_date, "%Y")
                start = dt.strftime("%Y%m%d")
    
            end = datetime.now().strftime("%Y%m%d")
            if end_date:
                try:
                    dt = datetime.strptime(end_date, "%Y-%m-%d")
                except ValueError:
                    dt = datetime.strptime(end_date, "%Y")
                    dt = dt.replace(month=12, day=31)
                end = dt.strftime("%Y%m%d")
    
            query_parts.append(f"submittedDate:[{start} TO {end}]")
    
        final_query = " AND ".join(query_parts)
        print(f"[arxiv-search] Author query: {final_query}")
    
        search = arxiv.Search(
            query=final_query,
            max_results=max_results,
            sort_by=arxiv.SortCriterion.SubmittedDate,
            sort_order=arxiv.SortOrder.Descending,
        )
    
        results = []
        for r in search.results():
            results.append(
                {
                    "title": r.title,
                    "authors": [a.name for a in r.authors],
                    "summary": r.summary,
                    "pdf_url": r.pdf_url,
                    "published_date": r.published.strftime("%Y-%m-%d"),
                    "arxiv_id": r.entry_id.split("/")[-1],
                    "categories": r.categories,
                }
            )
    
        return {
            "author": author_name,
            "query_used": final_query,
            "total_results": len(results),
            "results": results,
        }
  • Helper prompt generator named 'search_by_author' that creates a natural language prompt string for searching papers by author.
    def search_by_author(author_name: str) -> str:
        """Generates a prompt to search for papers by an author."""
        return f"Search for the latest papers by '{author_name}' on Arxiv."
  • Helper prompt generator named 'search_by_author' that creates a natural language prompt string for searching papers by author.
    def search_by_author(author_name: str) -> str:
        """Generates a prompt to search for papers by an author."""
        return f"Search for the latest papers by '{author_name}' on Arxiv."
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. While 'search' implies a read-only operation, the description doesn't address important behavioral aspects: whether authentication is required, rate limits, pagination behavior, what happens when no results are found, or the format/structure of returned results. The description only states what the tool does, not how it behaves.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured and concise. The purpose is stated clearly in the first sentence, followed by a well-organized parameter documentation section. Every sentence earns its place - no redundant information, no unnecessary elaboration. The parameter documentation uses a consistent format that's easy to parse.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given that there's an output schema (which handles return values), no annotations, and the description provides excellent parameter documentation, the main gap is behavioral context. The description adequately covers what the tool does and how to use its parameters, but doesn't address operational aspects like authentication, error conditions, or performance characteristics that would help an agent use it effectively.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description provides excellent parameter semantics that fully compensate. Each parameter is clearly documented with its purpose and format examples (e.g., 'YYYY-MM-DD or YYYY' for dates, 'cs.SE', 'cs.AI' for categories). The description adds substantial value beyond what the bare schema provides, explaining what each parameter means and how to format values.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Search papers by a specific author.' This is a specific verb ('search') with a clear resource ('papers') and target ('by a specific author'). However, it doesn't explicitly differentiate from sibling tools like 'search_papers' - the agent must infer that this tool is author-specific while 'search_papers' might be more general.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives. With sibling tools like 'search_papers' and 'find_related_papers' available, the agent receives no explicit direction about when author-based searching is preferred over other search methods or when this tool should be avoided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/emi-dm/Arxiv-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server