Skip to main content
Glama
wuruiqi

academic-workflow-mcp

by wuruiqi

workflow_find_duplicates

Scan Zotero for duplicate items grouped by identical DOI or highly similar title to prevent duplicate annotation work.

Instructions

Scan Zotero for duplicate items grouped by identical DOI or highly similar title.

Run this before starting a reading session or after a batch import to catch entries that were added twice, preventing duplicate annotation work.

Args: collections: Collection names to scan (e.g. ["1_SLAM与退化导航", "2_3DGS与三维重建"]). Leave empty to scan the entire library. title_threshold: Word-overlap ratio for title similarity (default 0.85). 0.85 catches near-identical titles; lower values cast a wider net.

Returns: { "duplicate_groups": [ { "reason": "Same DOI" | "Similar title", "doi": str, "suggested_keep": str, # item key recommended to keep "items": [ {key, title, year, authors, journal, doi, zotero_link}, ... ], } ], "total_duplicates": int, # total number of items that are duplicates "group_count": int, "tip": str, }

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
collectionsNo
title_thresholdNo
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Describes that it scans entire library if collections empty, uses title threshold, and returns grouped duplicates with suggested keep. No annotations provided, but description covers key behavioral aspects for a scan tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Concise, front-loaded purpose, efficient use of words. Includes detailed return structure despite no output schema. No redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Provides return value structure, input semantics, and usage context. Sibling tool names indicate this is the only duplicate finder, so completeness is high.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has 0% description coverage, but description fully explains both parameters: collections (names or empty for all) and title_threshold (word-overlap ratio, default 0.85, effect of lower values). Adds significant meaning beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it scans Zotero for duplicate items by DOI or similar title. Distinct from sibling tools which deal with notes, papers, and imports.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly advises running before a reading session or after batch import to prevent duplicate annotation work. Does not explicitly mention when not to use, but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/wuruiqi/academic-workflow-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server