Which integrations are available for this server?

Provides structured search over Linux kernel mailing list archives at lore.kernel.org, enabling AI agents to query discussions, patches, and metadata from all kernel lists. Allows searching the WireGuard-specific mailing list archive on lore.kernel.org as part of the broader Linux kernel archive.

How do I use kernel-lore-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@kernel-lore-mcp find recent patches for the ext4 filesystem" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

kernel-lore-mcp

by mjbommar

Overview Schema Related Servers Score Discussions

Rust

Hybrid

kernel-lore-mcp

PyPI version Release License: MIT

Free (MIT) MCP server exposing structured search over the Linux kernel mailing list archives at lore.kernel.org to LLM-backed developer tools — Claude Code, Codex, Cursor, Zed, anything else that speaks the Model Context Protocol.

No authentication, ever. No API keys, no OAuth, no login flow. Same anonymous posture on every deployment — local, hosted, everywhere. Every agent that asks us a question is one fewer agent scraping lore directly; fanout-to-one is the value proposition.

Quick start

Install is one command. The first sync is where real time goes — budget honestly depending on what you want to cover:

Shape	Disk	First-sync wall-clock
1–2 small lists (`wireguard`, `xdp-newbies`)	~1 GB	1–5 min
Subsystem slice (lkml + netdev + linux-cifs)	~25 GB	15–60 min
Full lore (390 shards, every list)	~100 GB	4–12 h

Steady-state syncs on the 5-min timer after cold-start are seconds.

# 1. install — one command, pre-built abi3 wheel, no Rust toolchain required
uv tool install kernel-lore-mcp

# 2. first sync — manifest fetch + gix fetch + ingest in one process
#    under one writer lock. Pick a small slice for a first experiment:
export KLMCP_DATA_DIR=~/klmcp-data
mkdir -p "$KLMCP_DATA_DIR"
kernel-lore-sync \
    --data-dir "$KLMCP_DATA_DIR" \
    --with-over \
    --include '/wireguard/*' --include '/linux-cifs/*'
# Drop --include to mirror all ~390 lists. Plan the disk + time.

# 3. confirm freshness + which capabilities are provisioned
kernel-lore-mcp status --data-dir "$KLMCP_DATA_DIR"
# Look at `capabilities`: each over_db / bm25 / path_vocab / embedding /
# maintainers / git_sidecar boolean tells you which tools will actually
# return data on this deployment. While a sync is active, the same
# status output also shows `writer_lock_present`, `sync_active`, and
# the current sync stage.

# 3b. inspect shard/index health; add --heal to repair unborn shard HEADs
#     and remove unrecoverable shard repos so the next sync reclones them
kernel-lore-doctor --data-dir "$KLMCP_DATA_DIR"

# 4. verify the MCP surface — zero API cost
git clone --depth 1 https://github.com/mjbommar/kernel-lore-mcp.git
cd kernel-lore-mcp && ./scripts/agentic_smoke.sh local
# PASS: 7/7 tools, 5/5 resource templates, 5/5 prompts (the
# `REQUIRED_*` subset from src/kernel_lore_mcp/_surface_manifest.py;
# the live server registers 25 tools in total).

Then pick your agent and copy its snippet from docs/mcp/client-config.md. All four clients (Claude Code, Codex, Cursor, Zed) work over stdio against the exact same server binary.

Optional capabilities — opt in when you need them

The baseline sync gives you everything a typical query asks for. Three tiers are explicitly opt-in because they cost disk or time and not every deployment wants them:

Capability	Build	When you want it
BM25 prose search (`b:` / free text)	`kernel-lore-reindex --data-dir $KLMCP_DATA_DIR --tier bm25`	semantic-free text search over prose bodies
Semantic embeddings (`lore_nearest`, `lore_similar`)	`kernel-lore-embed --data-dir $KLMCP_DATA_DIR`	"more like this" / free-text → vector ANN
Git-sidecar (authoritative `merged` + `picked_up`)	`kernel-lore-build-git-sidecar --repo linux-stable --path /path/to/linux-stable.git`	upgrades `lore_stable_backport_status` + `lore_thread_state` from lore heuristic to git-history truth
MAINTAINERS snapshot	drop a `MAINTAINERS` file into `$KLMCP_DATA_DIR` or point `$KLMCP_MAINTAINERS_FILE` at one	`lore_maintainer_profile` declared-vs-observed ownership

kernel-lore-mcp status reports which are ready via the capabilities field, and tools that need an un-provisioned tier return a setup_required error naming the exact command to fix it (no silent empty results).

Install from source

Contributing? Building a custom binary?

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | \
    sh -s -- -y --default-toolchain stable
git clone https://github.com/mjbommar/kernel-lore-mcp.git
cd kernel-lore-mcp
uv sync
uv run maturin develop --release
cargo build --release \
    --bin kernel-lore-sync \
    --bin kernel-lore-reindex \
    --bin kernel-lore-doctor
./target/release/kernel-lore-sync --data-dir $KLMCP_DATA_DIR --with-over
./target/release/kernel-lore-reindex --data-dir $KLMCP_DATA_DIR
./target/release/kernel-lore-doctor --data-dir $KLMCP_DATA_DIR

Going bigger

Want fuller coverage? Drop --include flags to mirror all ~390 lists (~100+ GB first run).

Want production-grade systemd deployment (single klmcp-sync.timer plus the long-lived MCP server)? docs/ops/runbook.md §1 onwards.

Related MCP server: email-insights

Status — v0.3.5 (2026-04-23)

Current release: v0.3.5. The 0.3.x line hardened hosted operation, made sync live-safe by default, added explicit derived-tier rebuilds via kernel-lore-reindex, and improved packaging so the wheel-shipped helper CLIs (kernel-lore-sync, kernel-lore-reindex, kernel-lore-doctor) work from clean uv tool install / uvx installs.

Shipped:

Ingest pipeline — gix + mail-parser + metadata / over.db / trigram / BM25 / embedding tiers. Incremental; dangling-OID safe; single-writer flock.
kernel-lore-sync — one Rust binary that internalized the legacy grokmirror + separate-ingest two-process chain. HTTPS manifest fetch, gix smart-HTTP clone-or-fetch (rayon-fanned across shards), ingest, and generation bump — all under one writer lock so there's no trigger/debounce race.
kernel-lore-reindex — rebuilds slower derived tiers from the already-downloaded local corpus. Defaults to tid + path_vocab; --tier bm25 rebuilds prose search explicitly and off the hot path.
kernel-lore-doctor — inspects shard + tier health and can repair unborn shard HEADs or remove broken shard repos so the next sync reclones them cleanly.
Full MCP surface: 25 tools (search, primitives, sampling- backed summarize/classify/explain, authoritative merged / picked_up verdicts via git-sidecar, lore_corpus_stats for coverage transparency, lore_author_footprint for address- mention search), 5 RFC-6570 resource templates, 2 static resources (blind-spots://coverage, stats://coverage), 5 slash-command prompts, populated KWIC snippets, freshness marker + capability booleans on every response.
HMAC-signed pagination cursors live on lore_search, lore_patch_search, lore_regex, lore_activity, lore_author_footprint. Query-scoped, tamper-detected.
stdio + Streamable HTTP transports; no SSE.
/status + /metrics (Prometheus) with freshness_ok + per-tier capabilities flags so clients distinguish "no results" from "feature not provisioned."
systemd units for hosted deploy; 5-min klmcp-sync.timer cadence, machine-readable sync progress, and exported writer_lock_present / sync_active metrics + status fields.
Live-tested against real claude --print and codex exec every commit via scripts/agentic_smoke.sh.

Near-term work is focused on production hardening and better continuous-sync ergonomics. The active execution list lives in TODO.md; dated plans under docs/plans/ remain as design history.

Deferred past v0.3: trained kernel-specific retrieval model (docs/research/training-retriever.md), snapshot-bundle reciprocity, Patchwork state integration, CVE-chain tool (all planned; see docs/plans/2026-04-14-best-in-class-kernel-mcp.md).

Why

Linux kernel development lives on ~390 public mailing lists. lei and b4 work well for humans with terminals, but LLM-backed developer tools have no equivalent: they can't answer "who touched fs/smb/server/smbacl.c in the last 90 days, grouped by series, with trailers" or "has this XDR overflow pattern been reported before" without being fed curated context by hand.

This project closes that gap. One MCP server over the full corpus, so an agent working on kernel code has the same research surface a senior maintainer has. And because it's all mirrored + indexed once, every agent query is zero HTTP load on lore.kernel.org.

Architecture in one paragraph

Four-tier index plus an embedding tier, purpose-built per query class: columnar metadata (Arrow/Parquet) for analytical scans; SQLite over.db (public-inbox pattern) for sub-millisecond metadata point lookups and predicate scans; trigram (fst + roaring) for patch/diff content with DFA-only regex confirmation; BM25 (tantivy) for prose; semantic (HNSW via instant-distance) for "more like this." Rust core via PyO3 0.28 does the heavy lifting; Python + FastMCP 3.2 serves MCP over stdio + Streamable HTTP. Ingestion is incremental from public-inbox git shards pulled via kernel-lore-sync (gix smart- HTTP + lore manifest-diff), replacing the pre-v0.2.0 grokmirror dependency. The zstd-compressed raw store is the source of truth; all four tiers rebuild from it.

North star: a trained kernel retriever

The Parquet metadata tier captures the training signal for free — subject/body pairs, series version chains, Fixes: → target SHA, reply graphs via in_reply_to / references, trailer co-occurrence. A future phase trains a <200 MB int8-quantized CPU-inferable retriever on that self-supervised signal. Recipe: docs/research/training-retriever.md.

Documentation

CLAUDE.md — authoritative project state + non-negotiable product constraints
CHANGELOG.md — release history
CONTRIBUTING.md — dev loop, PR discipline
SECURITY.md — disclosure posture
docs/ops/runbook.md — local dev (§0A)
- hosted deploy (§1+)
docs/ops/update-frequency.md — 5-min cadence policy + fanout-to-one cost analysis
docs/ops/production-hardening.md — threat model, cost-class caps, capability flags, systemd layout
docs/ops/public-launch-checklist.md — pre-launch hosted-box gate: shard health, metrics, harness, log readability
docs/mcp/client-config.md — copy-paste snippets for Claude Code, Codex, Cursor, Zed
docs/mcp/transport-auth.md — transport + why no auth
docs/architecture/ — design rationale
TODO.md — current execution contract
docs/plans/2026-04-14-best-in-class-kernel-mcp.md — 6-month roadmap (north star)
docs/research/ — dated investigations that fed the plan

License

MIT. See LICENSE.

Data from lore.kernel.org is re-hosted under the same terms as lore itself (public archive). Attribution preserved in every response. Redaction policy: LEGAL.md.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

0dRelease cycle

10Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

EMBA-MCP
Security Penetration Testing Embedded system
0xbuz3R
F
license
-
quality
D
maintenance
An MCP server for EMBA firmware analysis that exposes structured security findings and tools to LLMs. It enables users to programmatically query, reason over, and correlate firmware analysis results such as kernel details, SBOMs, and attack paths.
Last updated 2026-06-08
6
email-insights
Databases Search Research & Data
Shubby98
F
license
-
quality
D
maintenance
An MCP server that provides structured analytics for email data by extracting signals like topic, tone, and urgency using a local LLM. It allows users to query email distributions, sender patterns, and specific signals through a SQLite-backed interface.
Last updated 2026-03-26
nexus-mcp-ci
Code Analysis Knowledge & Memory RAG Systems
jaggernaut007
A
license
A
quality
B
maintenance
Unified MCP server combining hybrid search (vector + BM25 + code graph), structural code analysis, and persistent semantic memory. 15 tools, 25+ languages, <350MB RAM, fully local.
Last updated 2026-07-23
10
MIT
mcp-context
RAG Systems Search
ericlimabr
F
license
-
quality
D
maintenance
Local MCP server that provides semantic search (RAG) over code repositories, enabling AI clients like Claude and Gemini to access project context without manual re-upload.
Last updated 2026-05-09

View all related MCP servers

Related MCP Connectors

XMemo
User-owned memory for AI agents, Copilot, Claude, IDEs, CLIs, and chat apps over remote MCP.
nyc-property-intel
MCP server giving Claude AI access to 22+ NYC public-record databases for real estate due diligence
gread
An MCP server that gives your AI access to the source code and docs of all public github repos

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/mjbommar/kernel-lore-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

kernel-lore-mcp

Quick start

Optional capabilities — opt in when you need them

Install from source

Going bigger

Status — v0.3.5 (2026-04-23)

Why

Architecture in one paragraph

North star: a trained kernel retriever

Documentation

License

Maintenance

Resources

Looking for Admin?

Related MCP Servers

EMBA-MCP

email-insights

nexus-mcp-ci

mcp-context

Related MCP Connectors

Latest Blog Posts

MCP directory API