annota

by dengls24

Overview Schema Related Servers Score Discussions

Python

Local

search_zotero_items

Locate Zotero items by title, author, or item key. Use keyword or key queries to directly find specific papers.

Instructions

按标题、作者或 key 搜索 Zotero 条目。

比 list_zotero_items 更高效，可直接定位目标论文。

Args: query: 搜索关键词（标题/作者的部分文字，或 Zotero item key） limit: 最多返回条目数（默认 20）

Input Schema

TableJSON Schema

Name	Required	Description	Default
`query`	Yes
`limit`	No

Output Schema

TableJSON Schema

Name	Required	Description	Default
`result`	Yes

Implementation Reference

annota/server.py:188-199 (handler)

The MCP tool handler for search_zotero_items. Decorated with @mcp.tool(), it accepts a query string and optional limit (default 20), then delegates to zotero_db.search_items() and returns JSON results.

@mcp.tool()
def search_zotero_items(query: str, limit: int = 20) -> str:
    """按标题、作者或 key 搜索 Zotero 条目。

    比 list_zotero_items 更高效，可直接定位目标论文。

    Args:
        query: 搜索关键词（标题/作者的部分文字，或 Zotero item key）
        limit: 最多返回条目数（默认 20）
    """
    items = zotero_db.search_items(query=query, limit=limit)
    return json.dumps(items, ensure_ascii=False, indent=2)

annota/zotero_db.py:235-288 (handler)

The actual implementation of search_items() in the Zotero database layer. It connects to the Zotero SQLite database (read-only copy) and performs a fuzzy search across item titles, creator last/first names, and item keys, returning a list of items with optional PDF attachment info.

def search_items(query: str, limit: int = 20) -> list[dict]:
    """按标题或作者模糊搜索 Zotero 条目。"""
    conn = _connect(readonly=True)
    try:
        pattern = f"%{query}%"
        rows = conn.execute("""
            SELECT DISTINCT
                i.itemID,
                i.key,
                idv_title.value AS title
            FROM items i
            LEFT JOIN itemData id_title
                ON i.itemID = id_title.itemID
                AND id_title.fieldID = (
                    SELECT fieldID FROM fields WHERE fieldName = 'title'
                )
            LEFT JOIN itemDataValues idv_title
                ON id_title.valueID = idv_title.valueID
            LEFT JOIN itemCreators ic ON i.itemID = ic.itemID
            LEFT JOIN creators c ON ic.creatorID = c.creatorID
            WHERE i.itemTypeID NOT IN (1, 3, 28)
                AND i.libraryID = ?
                AND (
                    idv_title.value LIKE ?
                    OR c.lastName LIKE ?
                    OR c.firstName LIKE ?
                    OR i.key = ?
                )
            ORDER BY i.dateModified DESC
            LIMIT ?
        """, (LIBRARY_ID, pattern, pattern, pattern, query, limit)).fetchall()

        result = []
        for row in rows:
            item = {
                "itemID": row["itemID"],
                "key": row["key"],
                "title": row["title"] or "(无标题)",
            }
            att = conn.execute("""
                SELECT ia.itemID AS attID, ia.path
                FROM itemAttachments ia
                WHERE ia.parentItemID = ?
                    AND ia.contentType = 'application/pdf'
                LIMIT 1
            """, (row["itemID"],)).fetchone()
            if att:
                item["pdf_attachment_id"] = att["attID"]
                item["pdf_path"] = att["path"]
            result.append(item)

        return result
    finally:
        conn.close()

annota/server.py:45-58 (registration)

FastMCP server instantiation where the tool is implicitly registered via the @mcp.tool() decorator on line 188.

# ── MCP Server ──────────────────────────────────────────────────
mcp = FastMCP(
    name="annota",
    instructions=(
        "Annota: 读取本地 Zotero 库中的 PDF 文件，提取带坐标的文本，"
        "并精准回写高亮批注和笔记。所有坐标均为 PDF user space（原点左下角）。\n\n"
        "【大 PDF 两阶段工作流】处理超过 10 页的论文时，请遵循：\n"
        "  Phase 1 — 理解：先用 get_pdf_text_bulk 批量提取纯文本（无坐标），让 LLM 理解全文内容，"
        "确定哪些页面的哪些句子需要标注。\n"
        "  Phase 2 — 定位：只对目标页调用 get_pdf_layout_text 获取精确坐标，"
        "然后用 create_pdf_annotation 写入标注。\n"
        "  切勿对每一页都调用 get_pdf_layout_text，这会导致上下文溢出。"
    ),
)

annota/server.py:10-10 (schema)
Tool description in the module docstring: 'search_zotero_items: 按标题/作者/key 搜索条目'
```
- search_zotero_items:   按标题/作者/key 搜索条目
```

annota/zotero_db.py:39-74 (helper)

The _connect() helper function used by search_items to establish a read-only database connection via copying the Zotero SQLite database.

def _connect(readonly: bool = False) -> sqlite3.Connection:
    """创建数据库连接。

    Zotero 使用 EXCLUSIVE 锁模式，运行时外部无法直接读写。
    解决方案：
    - 只读操作：复制数据库到临时文件再读取（毫秒级，安全并发）
    - 写操作：直接连接 + 重试机制（等 Zotero 释放锁的间隙写入）
    """
    if readonly:
        return _connect_readonly_copy()

    conn = sqlite3.connect(
        f"file:{ZOTERO_DB_PATH}",
        uri=True,
        timeout=DB_TIMEOUT,
    )
    conn.row_factory = sqlite3.Row
    conn.execute(f"PRAGMA busy_timeout={DB_TIMEOUT * 1000}")
    conn.execute("PRAGMA journal_mode=WAL")
    return conn


def _connect_readonly_copy() -> sqlite3.Connection:
    """复制数据库文件后连接，避免与 Zotero 的锁冲突。

    Zotero 在运行时持有排他锁，即使 WAL 模式也无法并发读。
    复制 .sqlite 文件（通常 <100MB，耗时 <100ms）到临时目录后读取。
    """
    import shutil
    import tempfile

    tmp_dir = Path(tempfile.gettempdir()) / "annota"
    tmp_dir.mkdir(exist_ok=True)
    tmp_db = tmp_dir / "zotero_readonly.sqlite"

    shutil.copy2(ZOTERO_DB_PATH, tmp_db)

Tool Definition Quality

A4.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It explains the search criteria and limit parameter but does not disclose any behavioral traits such as authentication needs or side effects. The output schema exists, so return format is covered.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise: one sentence for the purpose, one for comparison, and two lines for arguments. No redundant information; every part earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 2 parameters and an existing output schema, the description covers the essential aspects: what it does, how it differs from a sibling, and parameter details. It does not mention pagination beyond the limit, but that is acceptable.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description compensates well. It adds meaning to both parameters: 'query' is described as search keyword (partial title/author or item key), and 'limit' as max results (default 20). This provides sufficient guidance beyond the bare schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool searches Zotero items by title, author, or key. It explicitly distinguishes itself from the sibling tool 'list_zotero_items' by claiming higher efficiency and direct targeting of papers.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear guidance by comparing to list_zotero_items, indicating this tool is more efficient for targeted searches. However, it does not explicitly state when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/dengls24/annota'

If you have feedback or need assistance with the MCP directory API, please join our Discord server