mcp-server-3gpp
This server provides tools to search, navigate, and extract content from 3GPP and IETF RFC specifications.
Search 3GPP documents (
search_3gpp_docs): Keyword-based search across specs (e.g., TS 24.008, TS 24.301, TS 24.501, TS 36.300), with optional filtering by spec name and configurable result limits.Get EMM/5GMM cause codes (
get_emm_cause): Look up detailed info (number, name, description, source spec) for EMM (LTE) or 5GMM (5G) cause codes by cause number.List available specifications (
list_specs): Get all indexed 3GPP specs with total indexed chunk counts.Detailed spec catalog (
get_spec_catalog): Access a comprehensive catalog including titles, versions, series, descriptions, and content statistics.Navigate chapter hierarchy (
get_spec_toc): Retrieve the Table of Contents for a specification, with optional depth or section prefix limits.Retrieve exact section text (
get_section): Fetch precise text for a section using its unique ID or a spec ID + section number combination.Expand related sections (
search_related_sections): Discover sections related to an anchor section — parents, children, siblings, and neighbors.Trace cross-spec references (
get_spec_references): Explore incoming and outgoing citations between different specifications.Access ingestion guidelines: Retrieve operational instructions for tasks like ETSI download, RFC ingestion, or managing the extraction pipeline.
Provides query expansion for semantic search using NVIDIA NIM API to generate hypothetical 3GPP-style answers, bridging vocabulary gap between queries and documents.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@mcp-server-3gppExplain 5GMM cause 27 from TS 24.501"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
mcp-server-3gpp
MCP server for 3GPP and IETF RFC specifications, backed by a prebuilt SQLite corpus.
The current v2 server is built around AI-guided chapter navigation, not hard-coded protocol lookup logic. The intended workflow is:
Discover relevant specs with
get_spec_catalog,search_etsi_catalog, orsearch_3gpp_docs.Walk the chapter structure with
get_spec_toc.Retrieve exact text with
get_section.Expand locally with
search_related_sections.Jump across documents with
get_spec_references.Extract test case structure with
get_test_case_structureandlist_test_cases.
Search is a starting point, not the whole product. The model is expected to browse and choose chapters deliberately.
What ships today
DB-backed v2 server with 12 MCP tools
Prebuilt corpus in
data/corpus/3gpp.db207 specs total: 113 TS, 1 TR, 93 RFC
66,109 full sections and 63,376 TOC rows
45,162 cross-spec reference edges
Stdio MCP entrypoint in
src/index.jsOptional Streamable HTTP transport in
src/http.jsHybrid search with 4-way RRF (Reciprocal Rank Fusion)
Optional AnyTXT Searcher integration for Windows
Optional HyDE (Hypothetical Document Embeddings) query expansion via NVIDIA NIM API
Search behavior
Search uses a multi-stage retrieval pipeline with up to four parallel retrievers fused by RRF (Reciprocal Rank Fusion, k=60):
Query -> [FTS5/BM25] (keyword, always active)
-> [sqlite-vec] (semantic, when embeddings ready)
-> [AnyTXT API] (full-document, when AnyTXT running on Windows)
-> [HyDE + LLM] (query expansion, when NVIDIA API key configured)
-> RRF fusion -> Structure-aware ranking -> ResultsKey properties:
Baseline
npm installgives you the keyword-ready server path: BM25/FTS search, TOC navigation, exact section retrieval, and cross-spec references.search_3gpp_docssupports quoted phrases,spec:filters,section:hints, and negation in that baseline path.RRF fusion replaces raw score combination with rank-based scoring, eliminating the need for score normalization across retrievers with incompatible distributions. Use
fusion: 'rrf'(default) orfusion: 'linear'to switch.The database and runtime can host
sqlite-vecembeddings viavec_sections, but that only makes the corpus vector-capable.Semantic or hybrid retrieval should be treated as an optional readiness state. It is active only when the runtime has semantic prerequisites and the smoke path actually returns
mode_actual=hybridormode_actual=semanticwith semantic evidence in results.HyDE expands short/vague queries by generating a hypothetical 3GPP-style answer via NVIDIA LLM, then embedding that answer for vector search. This bridges the vocabulary gap between queries and documents.
Requirements
Node.js 20.x, 22.x, and 24.x are the supported, CI-tested runtimes.
The project uses
better-sqlite312.x so installs can use prebuilt native binaries across the supported Node versions, including Node 24 on Windows.If you expand the Node version range later, update the native dependency and CI matrix together.
Quick start
git lfs install
git clone https://github.com/Lee-SiHyeon/mcp-server-3gpp.git
cd mcp-server-3gpp
npm install
npm run validate
npm startThe bundled database is tracked with Git LFS. A healthy startup looks like:
[3GPP MCP] Database ready: .../data/corpus/3gpp.db
[3GPP MCP] Features - FTS: true, Vector: true
[3GPP MCP] Registered 12 tools (v2 DB mode)npm run validate now reports two separate states:
Baseline keyword readiness: the DB-backed 12-tool server is healthy and search/navigation work in keyword mode.Optional semantic readiness: whether semantic prerequisites are installed and whether the live tool smoke test actually activated semantic/hybrid retrieval.
MCP client configuration
Claude Desktop
{
"mcpServers": {
"3gpp": {
"command": "node",
"args": ["/absolute/path/to/mcp-server-3gpp/src/index.js"]
}
}
}VS Code / GitHub Copilot
{
"servers": {
"3gpp": {
"type": "stdio",
"command": "node",
"args": ["/absolute/path/to/mcp-server-3gpp/src/index.js"]
}
}
}Optional custom DB path
{
"env": {
"THREEGPP_DB_PATH": "/custom/path/to/3gpp.db"
}
}The server checks these DB locations in order:
THREEGPP_DB_PATHdata/corpus/3gpp.dbdata/3gpp.db
Tool surface
Tool | Purpose |
| List indexed specs with title, version, series, description, section count, and page count. |
| Return the chapter hierarchy for a spec, optionally limited by depth or section prefix. |
| Fetch the exact section text by |
| Rank candidate sections for a query and return section IDs for follow-up retrieval. |
| Expand from an anchor section through parent, child, sibling, and search-derived neighbors. |
| Traverse incoming and outgoing cross-spec citations. |
| Search cataloged ETSI delivery metadata, including documents not yet downloaded or embedded. |
| Inspect one ETSI catalog document with versions and optional file URLs. |
| Return operational instructions for ETSI download, RFC ingest, or the extraction pipeline. |
| Compatibility alias with a smaller output shape; prefer |
| Extract structured test case data (Test Purpose, Conformance Requirements, Test Procedure) from conformance specs. |
| List all test case sections within a spec. |
Recommended prompting pattern
Use prompts that encourage structure-first navigation:
Find the chapter in TS 24.301 that defines attach reject causes.
Start by locating the spec, then inspect the TOC, then fetch the most relevant section.I need the exact wording for the NAS registration timer behavior in 5G.
Search for likely sections, then read the chapter text and nearby sections.Show which RFCs and 3GPP specs TS 29.500 cites most often.Corpus statistics
Metric | Value |
Total specs | 207 |
TS specs | 113 |
TR specs | 1 |
RFC specs | 93 |
TOC rows | 63,376 |
Section rows | 66,109 |
Cross-spec references | 45,162 |
Ingestion runs recorded | 535 |
Architecture at a glance
LLM client
-> MCP transport (stdio or HTTP)
-> tool registry + validation
-> tool handlers
-> SQLite corpus (specs, toc, sections, sections_fts, spec_references, ingestion_runs)
-> ETSI catalog (etsi_publication_types, etsi_ranges, etsi_documents, etsi_versions, etsi_files)
-> optional vec_sections table and guide resources
-> search pipeline:
keywordSearch.js (FTS5 + BM25)
semanticSearch.js (sqlite-vec cosine)
anytxtSearch.js (AnyTXT JSON-RPC API, Windows only)
hydeExpander.js (NVIDIA NIM LLM for query expansion)
hybridRanker.js (RRF fusion + structure-aware ranking)More detail lives in docs/architecture.md and docs/data-model.md.
Validation and tests
npm run validate
npm testnpm run validate checks the package metadata, resolves the DB path, verifies the core schema and counts, confirms the v2 12-tool surface, runs the navigation smoke path, and reports semantic readiness separately from baseline keyword readiness.
Optional semantic readiness
Semantic retrieval is not part of the baseline install contract. Treat it as an operator opt-in layer on top of the keyword server.
Current prerequisites:
sqlite-vecmust load successfully at runtime.vec_sectionsmust be populated with embeddings for the active corpus.A compatible transformers runtime must be present for local embeddings. The repository now ships with
@huggingface/transformers, and the runtime still accepts@xenova/transformersfor compatibility.The live
search_3gpp_docssmoke path must actually returnmode_actual=hybridormode_actual=semantic.
scripts/generate_embeddings.js is now the real local corpus-population workflow for vec_sections. It can build or rebuild the embedding index, but semantic-active readiness still requires a fresh full-corpus index. Partial runs (--spec or --limit) are useful for smoke tests and controlled backfills, but they intentionally do not mark semantic retrieval as globally ready.
Manual smoke workflows
ETSI catalog smoke:
npm run catalog:smokeThis crawls a tiny ETSI TS range into the catalog tables only. It does not
download PDFs, extract text, or generate embeddings. For broader cataloging,
use npm run catalog:crawl -- --all-publication-types --depth versions and add
limits such as --max-ranges, --max-docs, --max-versions, or
--max-requests for controlled backfills.
Catalog crawler safety controls:
python3 scripts/crawl_etsi_catalog.py --publication-types etsi_ts etsi_tr --depth documents --plan-only
python3 scripts/crawl_etsi_catalog.py --publication-types etsi_ts etsi_tr --depth documents --max-ranges 10
python3 scripts/crawl_etsi_catalog.py --publication-types etsi_ts etsi_tr --depth documents --resume
python3 scripts/select_etsi_ingest.py --policy priority --format download-listThe crawler records progress in catalog_crawl_runs and
catalog_crawl_progress. Long catalog writes remain CLI-only; MCP catalog tools
are read-only. Download, extraction, and embedding status lives separately from
document identity in etsi_document_status. select_etsi_ingest.py marks
priority 3GPP-mapped ETSI documents for later download/extract/embed work and
can emit a tab-separated download plan without downloading any PDFs.
Degraded-path smoke:
npm install
npm run validateExpected result:
Baseline keyword readiness: trueOptional semantic prerequisites met: falseorSemantic-active tool smoke: falseSearch mode actual: keyword
Semantic-active smoke:
Run
npm install.Ensure
sqlite-vecloads andvec_sectionsis populated with a fresh full-corpus embedding index that matches the active 384-dim model/prefix contract. Example:
node scripts/generate_embeddings.js --rebuildRun
npm run validate.
Expected result:
Optional semantic prerequisites met: trueSemantic-active tool smoke: trueSemantic smoke mode actual: hybridorsemantic
Optional AnyTXT integration (Windows)
AnyTXT Searcher provides an additional search signal and document parsing for formats beyond PDF. When running, its JSON-RPC API at localhost:9920 is auto-detected.
Prerequisites:
Install AnyTXT Searcher on Windows.
Enable the HTTP API: menu Help -> API.
Sync the
raw/directory into AnyTXT's index (Tools -> Index Manager).
When available, AnyTXT adds a third retriever to the RRF fusion, searching raw file content including DOCX, XLSX, and scanned PDF (OCR). When unavailable, the pipeline degrades to 2-way RRF (keyword + semantic).
CLI extraction tool:
node scripts/extract_anytxt.js --check # verify API availability
node scripts/extract_anytxt.js --sync raw/ # sync raw dir to AnyTXT index
node scripts/extract_anytxt.js raw/ts_38_523_1.pdf # extract text from a file
node scripts/extract_anytxt.js --batch raw/ # batch extract all supported filesOptional HyDE query expansion
HyDE (Hypothetical Document Embeddings) improves recall for short or vague queries by generating a hypothetical 3GPP-style answer via LLM, then using that answer's embedding for vector search. This bridges the vocabulary gap between queries and technical documents.
Configure via environment variable or programmatic API:
NVIDIA_API_KEY=nvapi-... npm startimport { configureHyde } from './src/search/hybridRanker.js';
configureHyde({ apiKey: 'nvapi-...' });Defaults to nvidia/llama-3.1-nemotron-nano-8b-v1 on the NVIDIA NIM endpoint. Any OpenAI-compatible chat completion URL can be used via the apiUrl option.
Project structure
mcp-server-3gpp/
├── src/
│ ├── index.js
│ ├── http.js
│ ├── db/
│ ├── search/
│ │ ├── hybridRanker.js (RRF fusion + structure-aware ranking)
│ │ ├── keywordSearch.js (FTS5 + BM25)
│ │ ├── semanticSearch.js (sqlite-vec)
│ │ ├── anytxtSearch.js (AnyTXT JSON-RPC retriever)
│ │ ├── hydeExpander.js (HyDE query expansion via NVIDIA LLM)
│ │ └── queryParser.js
│ ├── anytxt/
│ │ ├── client.js (JSON-RPC 2.0 client)
│ │ └── parser.js (document text extraction)
│ ├── tools/
│ └── ingest/
├── docs/
├── data/
│ └── corpus/
│ └── 3gpp.db
├── test/
├── validate.js
└── package.jsonNotes
The documented operating model is the DB-backed v2 server.
There is still a legacy fallback path in
src/index.jsif no SQLite DB is found, but that is a bootstrap escape hatch, not the primary interface this repository documents.get_sectionandget_spec_tocare the core deterministic retrieval tools. Search should feed them, not replace them.
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Lee-SiHyeon/mcp-server-3gpp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server