Provides S3-compatible object storage for PDF documents and related files in the academic literature management system.
Stores document metadata, vector embeddings, knowledge graph entities and relations, and supports hybrid search combining full-text search with pgvector semantic search.
Paperlib MCP
Academic literature management and retrieval MCP server - supporting PDF import, hybrid search, knowledge graph construction, and literature review generation.
✨ Features
Feature | Description |
PDF Import | Auto-extract text, chunk by page, generate vector embeddings |
Hybrid Search | FTS full-text search + pgvector semantic search |
Knowledge Graph | LLM-driven entity/relation/claim extraction, Leiden community detection |
Review Generation | Structured literature review auto-generation based on evidence packs |
📋 Prerequisites
PostgreSQL 16+ with pgvector extension
MinIO or S3-compatible storage
OpenRouter API Key
🚀 Installation & Usage
Method 1: Docker Compose (Recommended for Beginners)
One-click launch of complete environment (PostgreSQL + MinIO + MCP):
Configure in Cursor
Add to claude_desktop_config.json:
Method 2: uvx Install (Recommended)
Prerequisites: Requires available PostgreSQL (with pgvector) and MinIO/S3-compatible storage service.
Configure in Cursor/Claude Desktop, modify environment variables according to your actual service addresses:
Method 3: pip Install
Prerequisites: Same as Method 2, requires available PostgreSQL and MinIO/S3 services.
Configure MCP client (modify according to your actual service addresses):
Method 4: Local Development
📖 Available Tools
Basic Tools
Tool | Description |
| System health check |
| Import PDF documents |
| Hybrid search (recommended) |
| Get document metadata |
| List all documents |
Graph Tools
Tool | Description |
| Extract knowledge graph |
| Build topic communities |
| Generate community summaries |
Writing Tools
Tool | Description |
| Build evidence pack |
| Generate review draft |
Full tool list (48+) available at docs/MCP_TOOLS_REFERENCE.md
💡 Usage Examples
📚 Documentation
Document | Description |
Deployment Guide | |
System Architecture | |
Embedding & Retrieval | |
Knowledge Graph | |
Database Schema | |
Tools API Reference |
🛠️ Tech Stack
Component | Technology |
MCP Protocol | FastMCP |
Database | PostgreSQL 16 + pgvector |
Object Storage | MinIO (S3 Compatible) |
PDF Processing | PyMuPDF4LLM |
Embedding Model | OpenRouter (text-embedding-3-small) |
Graph Clustering | igraph + Leiden |
Environment Variables
Variable | Required | Default | Description |
| ✅ | - | OpenRouter API key |
| ❌ |
| Database host |
| ❌ |
| Database user |
| ❌ |
| Database password |
| ❌ |
| Database name |
| ❌ |
| MinIO endpoint |
| ❌ |
| MinIO user |
| ❌ |
| MinIO password |
📄 License
MIT