workflow_import_pdfs
Extract real titles from PDFs and import into Zotero. Smart deduplication attaches PDFs to existing metadata entries to prevent duplicates.
Instructions
Import PDF files into Zotero with automatic title extraction and smart deduplication.
Improvements over zotero-mcp's zotero_add_from_file:
Extracts the real title from the PDF (metadata field, then largest-font first-page text) instead of using the filename stem.
Detects title mismatches between the filename and the PDF content — useful for catching mislabeled downloads (e.g. wrong paper saved under a different name).
Smart deduplication:
If an identical file already has a PDF in Zotero → skip.
If a matching metadata-only entry exists (no PDF yet) → attach PDF to it rather than creating a duplicate parent item.
Copies the PDF into /storage// so Zotero can open it immediately without any manual "Locate File" step.
Args: pdf_paths: List of absolute file paths to PDFs. collection: Zotero collection name (exact, case-sensitive). Leave empty for library root. item_type: Zotero item type for new parent items. Default "preprint". Other values: "journalArticle", "conferencePaper".
Returns: { "imported": [ {item_key, attachment_key, title, title_source, filename_stem, title_mismatch, attached_to_existing, storage_copied, path} ], "skipped": [ {path, item_key, title, reason} ], "failed": [ {path, error} ], "warnings": [ {path, filename_stem, extracted_title, title_source, message} ], # title mismatches needing user review "count": int, # newly imported (excludes skipped) }
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| item_type | No | preprint | |
| pdf_paths | Yes | ||
| collection | No |