What can you do with this server?

This server provides tools to search, navigate, and extract content from 3GPP and IETF RFC specifications. * Search 3GPP documents (search_3gpp_docs): Keyword-based search across specs (e.g., TS 24.008, TS 24.301, TS 24.501, TS 36.300), with optional filtering by spec name and configurable result limits. * Get EMM/5GMM cause codes (get_emm_cause): Look up detailed info (number, name, description, source spec) for EMM (LTE) or 5GMM (5G) cause codes by cause number. * List available specifications (list_specs): Get all indexed 3GPP specs with total indexed chunk counts. * Detailed spec catalog (get_spec_catalog): Access a comprehensive catalog including titles, versions, series, descriptions, and content statistics. * Navigate chapter hierarchy (get_spec_toc): Retrieve the Table of Contents for a specification, with optional depth or section prefix limits. * Retrieve exact section text (get_section): Fetch precise text for a section using its unique ID or a spec ID + section number combination. * Expand related sections (search_related_sections): Discover sections related to an anchor section — parents, children, siblings, and neighbors. * Trace cross-spec references (get_spec_references): Explore incoming and outgoing citations between different specifications. * Access ingestion guidelines: Retrieve operational instructions for tasks like ETSI download, RFC ingestion, or managing the extraction pipeline.

Which integrations are available for this server?

Provides query expansion for semantic search using NVIDIA NIM API to generate hypothetical 3GPP-style answers, bridging vocabulary gap between queries and documents.

How do I use mcp-server-3gpp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@mcp-server-3gpp Explain 5GMM cause 27 from TS 24.501" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

de en es ja ko ru zh

mcp-server-3gpp

by Lee-SiHyeon

Overview Schema Related Servers Score Discussions

JavaScript

Hybrid

mcp-server-3gpp

MCP server for 3GPP and IETF RFC specifications, backed by a prebuilt SQLite corpus.

The current v2 server is built around AI-guided chapter navigation, not hard-coded protocol lookup logic. The intended workflow is:

Discover relevant specs with get_spec_catalog, search_etsi_catalog, or search_3gpp_docs.
Walk the chapter structure with get_spec_toc.
Retrieve exact text with get_section.
Expand locally with search_related_sections.
Jump across documents with get_spec_references.
Extract test case structure with get_test_case_structure and list_test_cases.

Search is a starting point, not the whole product. The model is expected to browse and choose chapters deliberately.

What ships today

DB-backed v2 server with 12 MCP tools
Prebuilt corpus in data/corpus/3gpp.db
207 specs total: 113 TS, 1 TR, 93 RFC
66,109 full sections and 63,376 TOC rows
45,162 cross-spec reference edges
Stdio MCP entrypoint in src/index.js
Optional Streamable HTTP transport in src/http.js
Hybrid search with 4-way RRF (Reciprocal Rank Fusion)
Optional AnyTXT Searcher integration for Windows
Optional HyDE (Hypothetical Document Embeddings) query expansion via NVIDIA NIM API

Search behavior

Search uses a multi-stage retrieval pipeline with up to four parallel retrievers fused by RRF (Reciprocal Rank Fusion, k=60):

Query -> [FTS5/BM25]    (keyword, always active)
      -> [sqlite-vec]    (semantic, when embeddings ready)
      -> [AnyTXT API]    (full-document, when AnyTXT running on Windows)
      -> [HyDE + LLM]    (query expansion, when NVIDIA API key configured)
      -> RRF fusion -> Structure-aware ranking -> Results

Key properties:

Baseline npm install gives you the keyword-ready server path: BM25/FTS search, TOC navigation, exact section retrieval, and cross-spec references.
search_3gpp_docs supports quoted phrases, spec: filters, section: hints, and negation in that baseline path.
RRF fusion replaces raw score combination with rank-based scoring, eliminating the need for score normalization across retrievers with incompatible distributions. Use fusion: 'rrf' (default) or fusion: 'linear' to switch.
The database and runtime can host sqlite-vec embeddings via vec_sections, but that only makes the corpus vector-capable.
Semantic or hybrid retrieval should be treated as an optional readiness state. It is active only when the runtime has semantic prerequisites and the smoke path actually returns mode_actual=hybrid or mode_actual=semantic with semantic evidence in results.
HyDE expands short/vague queries by generating a hypothetical 3GPP-style answer via NVIDIA LLM, then embedding that answer for vector search. This bridges the vocabulary gap between queries and documents.

Requirements

Node.js 20.x, 22.x, and 24.x are the supported, CI-tested runtimes.
The project uses better-sqlite3 12.x so installs can use prebuilt native binaries across the supported Node versions, including Node 24 on Windows.
If you expand the Node version range later, update the native dependency and CI matrix together.

Quick start

git lfs install
git clone https://github.com/Lee-SiHyeon/mcp-server-3gpp.git
cd mcp-server-3gpp
npm install
npm run validate
npm start

The bundled database is tracked with Git LFS. A healthy startup looks like:

[3GPP MCP] Database ready: .../data/corpus/3gpp.db
[3GPP MCP] Features - FTS: true, Vector: true
[3GPP MCP] Registered 12 tools (v2 DB mode)

npm run validate now reports two separate states:

Baseline keyword readiness: the DB-backed 12-tool server is healthy and search/navigation work in keyword mode.
Optional semantic readiness: whether semantic prerequisites are installed and whether the live tool smoke test actually activated semantic/hybrid retrieval.

MCP client configuration

Claude Desktop

{
  "mcpServers": {
    "3gpp": {
      "command": "node",
      "args": ["/absolute/path/to/mcp-server-3gpp/src/index.js"]
    }
  }
}

VS Code / GitHub Copilot

{
  "servers": {
    "3gpp": {
      "type": "stdio",
      "command": "node",
      "args": ["/absolute/path/to/mcp-server-3gpp/src/index.js"]
    }
  }
}

Optional custom DB path

{
  "env": {
    "THREEGPP_DB_PATH": "/custom/path/to/3gpp.db"
  }
}

The server checks these DB locations in order:

THREEGPP_DB_PATH
data/corpus/3gpp.db
data/3gpp.db

Tool surface

Tool	Purpose
`get_spec_catalog`	List indexed specs with title, version, series, description, section count, and page count.
`get_spec_toc`	Return the chapter hierarchy for a spec, optionally limited by depth or section prefix.
`get_section`	Fetch the exact section text by `sectionId` or `specId + sectionNumber`.
`search_3gpp_docs`	Rank candidate sections for a query and return section IDs for follow-up retrieval.
`search_related_sections`	Expand from an anchor section through parent, child, sibling, and search-derived neighbors.
`get_spec_references`	Traverse incoming and outgoing cross-spec citations.
`search_etsi_catalog`	Search cataloged ETSI delivery metadata, including documents not yet downloaded or embedded.
`get_etsi_document`	Inspect one ETSI catalog document with versions and optional file URLs.
`get_ingest_guide`	Return operational instructions for ETSI download, RFC ingest, or the extraction pipeline.
`list_specs`	Compatibility alias with a smaller output shape; prefer `get_spec_catalog`.
`get_test_case_structure`	Extract structured test case data (Test Purpose, Conformance Requirements, Test Procedure) from conformance specs.
`list_test_cases`	List all test case sections within a spec.

Recommended prompting pattern

Use prompts that encourage structure-first navigation:

Find the chapter in TS 24.301 that defines attach reject causes.
Start by locating the spec, then inspect the TOC, then fetch the most relevant section.

I need the exact wording for the NAS registration timer behavior in 5G.
Search for likely sections, then read the chapter text and nearby sections.

Show which RFCs and 3GPP specs TS 29.500 cites most often.

Corpus statistics

Metric	Value
Total specs	207
TS specs	113
TR specs	1
RFC specs	93
TOC rows	63,376
Section rows	66,109
Cross-spec references	45,162
Ingestion runs recorded	535

Architecture at a glance

LLM client
  -> MCP transport (stdio or HTTP)
  -> tool registry + validation
  -> tool handlers
  -> SQLite corpus (specs, toc, sections, sections_fts, spec_references, ingestion_runs)
  -> ETSI catalog (etsi_publication_types, etsi_ranges, etsi_documents, etsi_versions, etsi_files)
  -> optional vec_sections table and guide resources
  -> search pipeline:
       keywordSearch.js  (FTS5 + BM25)
       semanticSearch.js  (sqlite-vec cosine)
       anytxtSearch.js    (AnyTXT JSON-RPC API, Windows only)
       hydeExpander.js    (NVIDIA NIM LLM for query expansion)
       hybridRanker.js    (RRF fusion + structure-aware ranking)

More detail lives in docs/architecture.md and docs/data-model.md.

Validation and tests

npm run validate
npm test

npm run validate checks the package metadata, resolves the DB path, verifies the core schema and counts, confirms the v2 12-tool surface, runs the navigation smoke path, and reports semantic readiness separately from baseline keyword readiness.

Optional semantic readiness

Semantic retrieval is not part of the baseline install contract. Treat it as an operator opt-in layer on top of the keyword server.

Current prerequisites:

sqlite-vec must load successfully at runtime.
vec_sections must be populated with embeddings for the active corpus.
A compatible transformers runtime must be present for local embeddings. The repository now ships with @huggingface/transformers, and the runtime still accepts @xenova/transformers for compatibility.
The live search_3gpp_docs smoke path must actually return mode_actual=hybrid or mode_actual=semantic.

scripts/generate_embeddings.js is now the real local corpus-population workflow for vec_sections. It can build or rebuild the embedding index, but semantic-active readiness still requires a fresh full-corpus index. Partial runs (--spec or --limit) are useful for smoke tests and controlled backfills, but they intentionally do not mark semantic retrieval as globally ready.

Manual smoke workflows

ETSI catalog smoke:

npm run catalog:smoke

This crawls a tiny ETSI TS range into the catalog tables only. It does not download PDFs, extract text, or generate embeddings. For broader cataloging, use npm run catalog:crawl -- --all-publication-types --depth versions and add limits such as --max-ranges, --max-docs, --max-versions, or --max-requests for controlled backfills.

Catalog crawler safety controls:

python3 scripts/crawl_etsi_catalog.py --publication-types etsi_ts etsi_tr --depth documents --plan-only
python3 scripts/crawl_etsi_catalog.py --publication-types etsi_ts etsi_tr --depth documents --max-ranges 10
python3 scripts/crawl_etsi_catalog.py --publication-types etsi_ts etsi_tr --depth documents --resume
python3 scripts/select_etsi_ingest.py --policy priority --format download-list

The crawler records progress in catalog_crawl_runs and catalog_crawl_progress. Long catalog writes remain CLI-only; MCP catalog tools are read-only. Download, extraction, and embedding status lives separately from document identity in etsi_document_status. select_etsi_ingest.py marks priority 3GPP-mapped ETSI documents for later download/extract/embed work and can emit a tab-separated download plan without downloading any PDFs.

Degraded-path smoke:

npm install
npm run validate

Expected result:

Baseline keyword readiness: true
Optional semantic prerequisites met: false or Semantic-active tool smoke: false
Search mode actual: keyword

Semantic-active smoke:

Run npm install.
Ensure sqlite-vec loads and vec_sections is populated with a fresh full-corpus embedding index that matches the active 384-dim model/prefix contract. Example:

node scripts/generate_embeddings.js --rebuild

Run npm run validate.

Expected result:

Optional semantic prerequisites met: true
Semantic-active tool smoke: true
Semantic smoke mode actual: hybrid or semantic

Optional AnyTXT integration (Windows)

AnyTXT Searcher provides an additional search signal and document parsing for formats beyond PDF. When running, its JSON-RPC API at localhost:9920 is auto-detected.

Prerequisites:

Install AnyTXT Searcher on Windows.
Enable the HTTP API: menu Help -> API.
Sync the raw/ directory into AnyTXT's index (Tools -> Index Manager).

When available, AnyTXT adds a third retriever to the RRF fusion, searching raw file content including DOCX, XLSX, and scanned PDF (OCR). When unavailable, the pipeline degrades to 2-way RRF (keyword + semantic).

CLI extraction tool:

node scripts/extract_anytxt.js --check                # verify API availability
node scripts/extract_anytxt.js --sync raw/            # sync raw dir to AnyTXT index
node scripts/extract_anytxt.js raw/ts_38_523_1.pdf    # extract text from a file
node scripts/extract_anytxt.js --batch raw/           # batch extract all supported files

Optional HyDE query expansion

HyDE (Hypothetical Document Embeddings) improves recall for short or vague queries by generating a hypothetical 3GPP-style answer via LLM, then using that answer's embedding for vector search. This bridges the vocabulary gap between queries and technical documents.

Configure via environment variable or programmatic API:

NVIDIA_API_KEY=nvapi-... npm start

import { configureHyde } from './src/search/hybridRanker.js';
configureHyde({ apiKey: 'nvapi-...' });

Defaults to nvidia/llama-3.1-nemotron-nano-8b-v1 on the NVIDIA NIM endpoint. Any OpenAI-compatible chat completion URL can be used via the apiUrl option.

Project structure

mcp-server-3gpp/
├── src/
│   ├── index.js
│   ├── http.js
│   ├── db/
│   ├── search/
│   │   ├── hybridRanker.js      (RRF fusion + structure-aware ranking)
│   │   ├── keywordSearch.js     (FTS5 + BM25)
│   │   ├── semanticSearch.js    (sqlite-vec)
│   │   ├── anytxtSearch.js      (AnyTXT JSON-RPC retriever)
│   │   ├── hydeExpander.js      (HyDE query expansion via NVIDIA LLM)
│   │   └── queryParser.js
│   ├── anytxt/
│   │   ├── client.js            (JSON-RPC 2.0 client)
│   │   └── parser.js            (document text extraction)
│   ├── tools/
│   └── ingest/
├── docs/
├── data/
│   └── corpus/
│       └── 3gpp.db
├── test/
├── validate.js
└── package.json

Notes

The documented operating model is the DB-backed v2 server.
There is still a legacy fallback path in src/index.js if no SQLite DB is found, but that is a bootstrap escape hatch, not the primary interface this repository documents.
get_section and get_spec_toc are the core deterministic retrieval tools. Search should feed them, not replace them.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Lee-SiHyeon/mcp-server-3gpp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server