Which integrations are available for this server?

Scans Notion pages for audio/video attachments, transcribes them, and creates subpages with summaries, knowledge items, and full transcripts.

How do I use Scribe MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Scribe MCP transcribe meeting.mp4" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Scribe MCP

by dariakroshka

Overview Schema Related Servers Score Discussions

TypeScript

Local

Scribe MCP

Audio/video transcription, summarization, and knowledge extraction tool powered by Gemini. Works as a standalone CLI, Notion integration, MCP server tool for agent ecosystems, or a full connectome-host fleet persona.

What it does

Given a video or audio recording, Scribe produces:

Summary — executive summary with title, language, speakers, and key topics
Knowledge items — structured, self-contained facts extracted from the conversation, categorized by type (fact, decision, process, explanation, requirement, issue) with timestamps
Full transcript — timestamped, speaker-attributed transcript in [MM:SS] Speaker: text format

Long recordings are automatically split into 10-minute audio chunks via ffmpeg to avoid LLM degeneration on extended output. A domain glossary can be provided — from a local file, a MediaWiki page, or both — to improve recognition of business-specific terminology.

Related MCP server: whipscribe-mcp

Prerequisites

Bun runtime
Gemini API key (GEMINI_API_KEY)
Notion integration token (NOTION_API_KEY) — for Notion mode only

ffmpeg/ffprobe are bundled (see Bundled media toolchain) — no system install required. Downloads/HTTP use Bun's native fetch, so curl isn't needed either.

Setup

git clone <repo-url> && cd scribe-mcp
bun install
bun run smoke   # optional: verify the bundled ffmpeg/ffprobe resolved and run
cp .env.example .env
# Edit .env with your API keys

Notion integration setup

Go to https://www.notion.so/my-integrations and create an internal integration
Grant it Read content, Insert content, and Upload files permissions
Copy the token into .env as NOTION_API_KEY
On each Notion page you want to process: ... > Connections > add your integration

Usage

Local file transcription

# Markdown output to stdout
bun src/cli.ts recording.mp4

# With glossary for domain-specific terms
bun src/cli.ts recording.mp4 --glossary glossary.txt

# Save to file
bun src/cli.ts recording.mp4 --output transcript.md

# JSON output
bun src/cli.ts recording.mp4 --json --output transcript.json

Notion — single page

Scans a Notion page for video/audio attachments, transcribes each, and creates two subpages:

📝 Title — summary, topics, knowledge items, plus .md file download
📜 Title — Transcript — full timestamped transcript

bun src/cli.ts --notion "https://www.notion.so/workspace/page-name-abc123" --glossary glossary.txt

Notion — batch mode

Create a text file with one Notion page URL per line (lines starting with # are skipped):

# pages.txt
https://www.notion.so/workspace/page-one-abc123
https://www.notion.so/workspace/page-two-def456
https://www.notion.so/workspace/page-three-ghi789

bun src/cli.ts --batch pages.txt --glossary glossary.txt

Processes sequentially, continues past failures, reports progress.

Chat mode

Talk to Scribe directly — ask about glossary terms, get help interpreting transcripts, or provide context before a transcription run.

# Interactive REPL
bun src/cli.ts --chat --glossary glossary.txt

# One-shot question
bun src/cli.ts --chat "What is a Driver Server?" --glossary glossary.txt

# Transcribe a Notion page, then chat about it
bun src/cli.ts --chat --notion "https://www.notion.so/workspace/page-abc123" --glossary glossary.txt

# Transcribe a local file, then chat about it
bun src/cli.ts --chat /path/to/recording.mp4 --glossary glossary.txt

When combined with --notion or a file path, Scribe transcribes first, then enters chat with the full transcript as context.

MCP server

bun src/index.ts

Exposes four tools over stdio:

Tool	Description
`transcribe`	Transcribe a local audio/video file
`probe`	Check file format and estimate token cost
`scribe_notion_page`	Scan Notion page, transcribe media, post results
`chat`	Free-text conversation with Scribe (glossary-aware, remembers recent transcripts)

MCP server config (for connectome-host or Claude Desktop)

{
  "scribe": {
    "command": "bun",
    "args": ["src/index.ts"],
    "cwd": "/path/to/scribe-mcp",
    "env": {
      "GEMINI_API_KEY": "${GEMINI_API_KEY}",
      "NOTION_API_KEY": "${NOTION_API_KEY}",
      "SCRIBE_GLOSSARY_PATH": "/path/to/glossary.txt",
      "SCRIBE_GLOSSARY_URL": "http://your-wiki/wiki/index.php/Glossary_Page"
    }
  }
}

CLI options

Commands:
  <file>           Transcribe a local audio/video file
  --notion <url>   Scan a Notion page for media, transcribe, and post back
  --batch <file>   Process multiple Notion pages (one URL per line)
  --chat [msg]     Chat with Scribe (interactive REPL, or one-shot with message)

Options:
  --audio-only         Extract audio before uploading (cheaper, needs ffmpeg)
  --model <name>       Gemini model (default: gemini-2.5-flash)
  --glossary <path>    File with domain-specific terms to improve recognition
  --glossary-url <url> MediaWiki page URL to fetch glossary from (merged with --glossary)
  --json               Output raw JSON instead of markdown
  --output <path>      Write output to file instead of stdout
  --prompt <text>      Additional instructions for the transcription
  --help               Show this help

Glossary

A domain glossary improves recognition of spoken jargon, abbreviations, and cross-language terms. Scribe supports two glossary sources that can be used independently or together (merged at load time):

Local file (`--glossary` or `SCRIBE_GLOSSARY_PATH`)

A plain text file, one term per line:

Locate Router / LR — middleware handling short-sale locate requests
NBBO (National Best Bid and Offer) — best bid/ask across all US exchanges
локейт = locate
венью = venue

See glossary.txt for a full example.

MediaWiki page (`--glossary-url` or `SCRIBE_GLOSSARY_URL`)

A URL to a MediaWiki page. Scribe fetches the page content via the MediaWiki API (action=query&prop=extracts&explaintext=1). The page is fetched on startup and refreshed every 30 minutes when running as an MCP server.

# CLI
bun src/cli.ts recording.mp4 --glossary-url "http://wiki.example.com/wiki/index.php/Glossary"

# Or via environment variable
export SCRIBE_GLOSSARY_URL="http://wiki.example.com/wiki/index.php/Glossary"
bun src/cli.ts recording.mp4

When both sources are set, wiki content is prepended to the local file content.

Environment variables

Variable	Required	Description
`GEMINI_API_KEY`	Yes	Gemini API key for transcription
`NOTION_API_KEY`	For Notion modes	Notion integration token
`SCRIBE_GLOSSARY_PATH`	No	Path to local glossary file
`SCRIBE_GLOSSARY_URL`	No	MediaWiki page URL for glossary (auto-refreshes every 30min in MCP mode)

Bundled media toolchain

ffmpeg/ffprobe ship with the package rather than being required on PATH, so consumers (Docker images, agent recipes, ops runbooks) don't have to provide them out of band — bun install is sufficient. bun run smoke verifies they resolved.

The two binaries have different supply-chain properties, worth knowing before you ship this somewhere sensitive:

@ffprobe-installer/ffprobe ships the ffprobe binary inside per-platform npm packages, so every byte is covered by the lockfile's sha512 integrity hash.
ffmpeg-static ships a small install script that downloads the ffmpeg binary from a GitHub release at install time. The lockfile hashes the ~3 KB npm tarball, not the ~78 MB binary it fetches — i.e. it's an unverified HTTPS download from a third-party maintainer's release page (pinned version → fixed asset URL). This is a deliberate trade: ffmpeg-static ships ffmpeg 7.x, whereas @ffmpeg-installer/ffmpeg (the symmetric choice) is stuck on the 4.x vintage. If you need download integrity, add a post-install sha256 check against a pinned digest, or vendor the binary.

trustedDependencies lists only ffmpeg-static — it's the one with an install script that bun must be allowed to run; @ffprobe-installer/ffprobe has no lifecycle script (its binary is already in the tarball). On an unsupported platform, bins.ts logs a stderr warning and falls back to ffmpeg/ffprobe on PATH.

Connectome-host integration

Scribe integrates with connectome-host in two ways — as an MCP tool on an existing agent, or as a standalone fleet persona. See recipes/CONNECTOME.md for full setup and comparison.

Architecture

src/
  cli.ts      — CLI entry point (file, --notion, --batch, --chat modes)
  index.ts    — MCP server (stdio transport, 4 tools)
  gemini.ts   — Core transcription engine (3-pass: metadata -> transcript -> knowledge)
  notion.ts   — Notion API integration (download, post, file upload, dedup)
  format.ts   — Markdown formatting
  types.ts    — Zod schemas (Metadata, KnowledgeItem, Transcript)

Processing pipeline

Upload & metadata — upload file (audio-only for videos >10min), extract title/summary/topics via structured JSON generation
Chunked transcription — split into 10-min audio chunks via ffmpeg, transcribe each with Gemini, offset timestamps, merge
Knowledge extraction — process transcript text through Gemini to extract structured knowledge items
Post-processing — strip prompt echoes, deduplicate repetitions, format output

Key design decisions

Two-pass media processing: metadata as small structured JSON, transcript as plain text — avoids JSON string escaping overhead and output token limits
10-minute audio chunks: prevents LLM text degeneration (repetition loops) on long generation tasks; all autoregressive models exhibit this
Audio-only for metadata on long videos: video tokens (~300/sec) exceed Gemini's 1M input limit on recordings >10min; audio (~32/sec) stays well within bounds
Two Notion subpages: agents read the summary/knowledge page first (small, dense), drill into transcript only when needed — progressive disclosure
Retry with resume: large file downloads use curl with -C - and up to 5 retries; Gemini API calls retry up to 3 times with backoff
Media deduplication: Notion sometimes represents the same file as multiple block types (video + file); scribe deduplicates by filename before processing
Chat via Gemini: the chat tool routes conversation through Gemini (not Claude), keeping costs low while providing glossary-aware, transcript-aware responses

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

gobbler-mcp
Browser Automation Multimedia Processing Web Scraping
Enablement-Engineering
A
license
-
quality
B
maintenance
Converts YouTube videos, audio, documents, and web pages to clean markdown with YAML frontmatter, providing AI assistants with structured content via the MCP protocol.
Last updated 2026-07-30
4
MIT
whipscribe-mcp
Speech Processing AI & Machine Learning
neugence
A
license
A
quality
C
maintenance
MCP server for Whipscribe — transcribe audio and video from a URL or local file via Claude Desktop, Claude Code, Cursor, Windsurf, or any MCP-compatible client.
Last updated 2026-04-26
6
1
Apache 2.0
MCP Audio Server
Audio Processing Speech Processing
thrid3v
F
license
-
quality
C
maintenance
A Model Context Protocol (MCP) server that gives AI agents the ability to process audio files — transcribe speech to text, detect spoken languages, and extract audio metadata.
Last updated 2026-06-16
1
Whisper MCP Server
Audio Processing Speech Processing
jwulff
A
license
A
quality
F
maintenance
Provides local audio transcription using whisper.cpp, supporting multiple models and audio formats. Enables transcription of audio files via MCP tools with optional timestamps.
Last updated 2026-01-12
3
103
2
MIT

View all related MCP servers

Related MCP Connectors

Frenchie
OCR, transcription, file extraction, and image generation for AI agents via MCP.
hithereiamaliff-mcp-nextcloud
A comprehensive Model Context Protocol (MCP) server that enables AI assistants to interact with yo…
better-notion-mcp
Markdown-first MCP server for Notion API with 8 composite tools and 39 actions.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/dariakroshka/scribe-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Scribe MCP

What it does

Prerequisites

Setup

Notion integration setup

Usage

Local file transcription

Notion — single page

Notion — batch mode

Chat mode

MCP server

MCP server config (for connectome-host or Claude Desktop)

CLI options

Glossary

Local file (--glossary or SCRIBE_GLOSSARY_PATH)

MediaWiki page (--glossary-url or SCRIBE_GLOSSARY_URL)

Environment variables

Bundled media toolchain

Connectome-host integration

Architecture

Processing pipeline

Key design decisions

Maintenance

Resources

Looking for Admin?

Related MCP Servers

gobbler-mcp

whipscribe-mcp

MCP Audio Server

Whisper MCP Server

Related MCP Connectors

Latest Blog Posts

MCP directory API

Local file (`--glossary` or `SCRIBE_GLOSSARY_PATH`)

MediaWiki page (`--glossary-url` or `SCRIBE_GLOSSARY_URL`)