Alexandria MCP
Allows searching and retrieving academic papers from arXiv, a repository of preprints in physics, mathematics, computer science, and related fields.
Enables retrieval of documentation for the Fastify web framework.
Provides access to Islamicate texts from the OpenITI collection hosted on GitHub.
Allows searching and accessing book metadata and full text from Google Books.
Enables searching of the NASA Astrophysics Data System (ADS) for astronomy, astrophysics, and physics literature.
Powers natural language search routing using OpenAI's GPT-4o-mini model to determine which sources to query.
Allows searching over 200 million academic papers with AI-powered recommendations and citation graphs.
Provides vector storage for ingesting and retrieving full-text content via a Supabase database.
Enables searching of digitized newspapers, books, and other collections from the National Library of Australia.
Allows searching and retrieving research outputs from CERN's open repository with higher rate limits.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Alexandria MCPfind recent papers on transformer models"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Alexandria
A Model Context Protocol (MCP) server for querying, reading, and ingesting texts from 61 public digital libraries. Works with any MCP-compatible client (Claude Desktop, Cursor, VS Code Copilot, etc.).
Tools
Tool | Description |
| List all 61 sources with descriptions and full-text capabilities |
| Natural language search — routes your query to the best sources, searches in parallel, returns unified deduplicated results |
| Search a specific source by title, author, or keywords |
| Fetch full text or metadata for an item (200k char limit) |
| Dry run: chunk and score text quality without writing anything |
| Chunk → embed → store in your vector database. Idempotent. |
| Get similar papers via Semantic Scholar's recommendation engine (up to 500) |
library_ask is the primary entry point. library_search is for targeted queries against a known source. library_index / library_ingest are for building a vector knowledge base from retrieved texts.
Sources (61)
Public Domain Literature (29)
Source | Coverage | Full Text |
| 76k+ public domain books | Yes |
| 30M+ records | Metadata only |
| 41M+ texts, newspapers, scanned books | Yes |
| Curated registry: Quran, Sufi corpus, Vedanta, Buddhism, Taoism, Hermeticism, Christian mysticism | Yes (scraped) |
| Free-content library: historical documents, literary works | Yes |
| Carefully formatted, public domain ebooks | Yes |
| Classical Greek and Latin texts with translations | Yes |
| Chinese Text Project — pre-modern Chinese literature | Yes |
| Bibliothèque nationale de France — French heritage texts | Yes |
| Library of Congress — US historical collections | Metadata only |
| 17M+ volumes from research libraries | Metadata only |
| Digital Public Library of America — US cultural heritage | Metadata only |
| National Diet Library Japan | Metadata only |
| European cultural heritage — 50M+ objects | Metadata only |
| National Library of Australia — newspapers, books, images | Yes |
| Biodiversity Heritage Library — natural history literature | Yes |
| National Library of New Zealand | Metadata only |
| Internet Classics Archive — 441 classical works | Yes |
| Marxists Internet Archive — political theory, philosophy | Yes |
| Nordic literature and history | Yes |
| Biblioteca Virtual Miguel de Cervantes — Spanish literature | Yes |
| Directory of Open Access Books — 70k+ peer-reviewed OA books | Metadata only |
| Open Access Publishing in European Networks — humanities & social sciences | Yes |
| Google Books — metadata and preview snippets | Metadata only |
| Library of Congress — US historic newspapers 1770–1963 | Yes |
| Christian Classics Ethereal Library | Yes |
| Public domain and self-published ebooks | Yes |
| World Digital Library — international manuscripts and maps | Metadata only |
| Data.gov — US government open data catalog | Metadata only |
Academic & Science (11)
Source | Coverage | Full Text |
| 2M+ preprints: physics, math, CS, biology, economics | Yes |
| 57M+ open access research papers across all disciplines | Yes |
| Europe PubMed Central — life sciences literature | Yes |
| NASA Technical Reports Server | Yes |
| DOE Office of Scientific and Technical Information | Yes |
| Education Resources Information Center | Yes |
| NSF Award Search — funded research abstracts | Yes |
| US federal and state court opinions (Free Law Project). 125 req/day. | Yes |
| bioRxiv preprints — biology | Yes |
| CERN open repository — papers, datasets, software. 2M+ records. | Yes |
| Semantic Scholar — 200M+ papers with AI-powered metadata | Yes |
Government, Law & International (5)
Source | Coverage | Full Text |
| US Government Publishing Office — laws, regulations, congressional records | Yes |
| NIH Office of Portfolio Analysis | Yes |
| National Library of Norway | Metadata only |
| legislation.gov.uk — UK Acts and Statutory Instruments | Yes |
| Open Science Framework — preprints and research data | Yes |
Specialized Corpora (3)
Source | Coverage | Full Text |
| Early English print 1473–1700 | Yes |
| OpenITI — Arabic/Persian Islamic texts (GitHub-based) | Yes |
| Scottish legislation | Yes |
Research Aggregators (8)
Source | Coverage | Full Text |
| OpenAlex — 240M+ scholarly works, open catalog | Metadata only |
| PLOS journals — open access science | Yes |
| Crossref — 150M+ DOI metadata records | Metadata only |
| NASA Astrophysics Data System | Yes |
| Smithsonian Institution — collections and research | Metadata only |
| Directory of Open Access Journals — 20k+ journals | Metadata only |
| National Archives — US federal records | Metadata only |
| SpringerNature — OA and metadata | Metadata only |
Institutional Repositories (4)
Source | Coverage | Full Text |
| Harvard Library Digital Collections | Metadata only |
| Cambridge University repository | Yes |
| Oxford Research Archive | Yes |
| Bielefeld Academic Search Engine — 300M+ documents (pending IP whitelist) | Metadata only |
Software Documentation (1)
Source | Coverage | Full Text |
| Google Code Wiki — open source project documentation | Yes |
Credentials
Most tools query external library APIs directly and need no credentials at all. The two optional dependencies are scoped to specific tools:
OpenAI — optional (platform.openai.com)
Required by two tools only:
library_ask— usesgpt-4o-minito route your natural language query to the right sources and generate optimized per-source search terms. Without this key, uselibrary_searchto query sources directly.library_ingest— usestext-embedding-3-smallto embed chunked text before writing to the vector store.
library_list_sources, library_search, library_read, library_index, and library_recommend all work without an OpenAI key.
Supabase — optional (supabase.com)
Required by one tool only:
library_ingest— writes chunked, embedded text into a pgvector table for semantic search. Without this, retrieved texts stay in-context and are not persisted anywhere.
Everything else — searching, reading, browsing, getting recommendations — queries external sources in real time and needs no database.
Source-specific keys
Some sources require their own API key. These are free registrations. Sources without a key listed here work without any credentials.
Env Var | Source(s) | Get It |
|
| |
|
| |
|
| api.data.gov/signup — one key covers both |
|
| Google Cloud Console → APIs & Services → Books API |
|
| |
|
| |
|
| |
|
| apis.europeana.eu — test key immediate, personal ~1 week |
|
| github.com/settings/tokens — public repo read scope, optional but prevents rate limiting |
|
| |
|
| dev.springernature.com — same registration, two keys |
|
| zenodo.org/account/settings/applications/tokens/new — optional, increases rate limits |
|
| semanticscholar.org/product/api — optional, increases rate limits |
|
| trove.nla.gov.au/about/create-something/using-api — ~1 week approval |
|
| base-search.net/about/en/contact — requires IP whitelist |
Setup
git clone https://github.com/suavecito585/alexandria-mcp
cd alexandria-mcp
npm install
npm run buildCopy .env.example to .env. Minimum configuration to run with no credentials (search and read only):
TRANSPORT=stdioTo enable library_ask:
TRANSPORT=stdio
OPENAI_API_KEY=sk-...To enable library_ingest:
TRANSPORT=stdio
OPENAI_API_KEY=sk-...
SUPABASE_URL=https://your-project.supabase.co
SUPABASE_SERVICE_ROLE_KEY=eyJ...Supabase Schema
Required only if using library_ingest:
create table if not exists knowledge_chunks (
id bigserial primary key,
content text not null,
embedding vector(1536),
mcp_name text,
metadata jsonb,
created_at timestamptz default now()
);
create table if not exists source_docs (
id bigserial primary key,
source_url text not null,
mcp_name text not null,
title text,
source text,
chunk_count int,
indexed_at timestamptz,
unique (source_url, mcp_name)
);
create index if not exists knowledge_chunks_embedding_idx
on knowledge_chunks using ivfflat (embedding vector_cosine_ops)
with (lists = 100);Claude Desktop (stdio)
Minimum config (search and read only):
{
"mcpServers": {
"library": {
"command": "node",
"args": ["/path/to/alexandria-mcp/dist/index.js"],
"env": {
"TRANSPORT": "stdio"
}
}
}
}With library_ask and library_ingest enabled:
{
"mcpServers": {
"library": {
"command": "node",
"args": ["/path/to/alexandria-mcp/dist/index.js"],
"env": {
"TRANSPORT": "stdio",
"OPENAI_API_KEY": "sk-...",
"SUPABASE_URL": "https://your-project.supabase.co",
"SUPABASE_SERVICE_ROLE_KEY": "eyJ..."
}
}
}
}Railway (HTTP)
Set env vars in the Railway dashboard and deploy:
railway upRegister in Claude Desktop:
{
"mcpServers": {
"library": {
"url": "https://your-service.up.railway.app/mcp"
}
}
}Health check: GET /health returns { status: "ok", sources: 61 }.
Adding Custom Providers
The pipeline is provider-agnostic. To add a new embedding model or vector store:
Implement
EmbeddingProviderorVectorStoreProviderfromsrc/types.tsAdd your implementation to
src/pipeline/providers/Register it in
src/pipeline/providers/index.tsSet
EMBEDDING_PROVIDERorVECTOR_STORE_PROVIDERin your env
// Example: Ollama embedding provider
import type { EmbeddingProvider } from '../../types.js';
export class OllamaEmbeddingProvider implements EmbeddingProvider {
readonly dimensions = 768;
async embed(texts: string[]): Promise<number[][]> {
// your implementation
}
}This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/The-40-Thieves/alexandria-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server