Gladia MCP

Features

Audio transcription with speaker diarization
Real-time speech-to-text
Audio intelligence capabilities:
- Translation
- Summarization
- Named Entity Recognition
- Sentiment Analysis
- Content Moderation
- Chapterization
- Audio to LLM integration
Async API with FastAPI
Easy-to-use CLI interface
Configurable logging
CORS support
Health check endpoint

Quickstart with Claude Desktop

Get your API key from Gladia. There is a free tier available.
Install uv (Python package manager), install with curl -LsSf https://astral.sh/uv/install.sh | sh or see the uv repo for additional install methods.
Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json to include the following:

{
  "mcpServers": {
    "Gladia": {
      "command": "uvx",
      "args": ["gladia-mcp"],
      "env": {
        "GLADIA_API_KEY": "<insert-your-api-key-here>"
      }
    }
  }
}

If you're using Windows, you will have to enable "Developer Mode" in Claude Desktop to use the MCP server. Click "Help" in the hamburger menu at the top left and select "Enable Developer Mode".

Other MCP clients

For other clients like Cursor and Windsurf, run:

pip install gladia-mcp
python -m gladia_mcp --api-key={{PUT_YOUR_API_KEY_HERE}} --print to get the configuration. Paste it into appropriate configuration directory specified by your MCP client.

Example usage

Try asking Claude:

"Transcribe this audio file and identify different speakers"
"Convert this recording to text and translate it to Spanish"
"Analyze the sentiment and emotions in this speech"
"Extract key topics and create chapters from this long audio file"
"Transcribe this conversation and summarize the main points"

Optional features

You can add the GLADIA_MCP_BASE_PATH environment variable to the claude_desktop_config.json to specify the base path MCP server should look for and output files specified with relative paths.

Contributing

If you want to contribute or run from source:

Clone the repository:

git clone https://github.com/gladia/gladia-mcp
cd gladia-mcp

Create a virtual environment and install dependencies using uv:

uv venv
source .venv/bin/activate
uv pip install -e ".[dev]"

Copy .env.example to .env and add your Gladia API key:

cp .env.example .env
# Edit .env and add your API key

Run the tests to make sure everything is working:

./scripts/test.sh
# Or with options
./scripts/test.sh --verbose --fail-fast

Install the server in Claude Desktop: mcp install gladia_mcp/server.py
Debug and test locally with MCP Inspector: mcp dev gladia_mcp/server.py

API Endpoints

Health Check

GET /health

Transcribe Audio

POST /transcribe

Parameters:

file: Audio file (multipart/form-data)
diarization: Enable speaker diarization (boolean, optional)
language: Language code (string, optional)

Example using curl:

curl -X POST "http://localhost:8000/transcribe" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@audio.wav" \
  -F "diarization=true"

Troubleshooting

Logs when running with Claude Desktop can be found at:

Windows: %APPDATA%\Claude\logs\mcp-server-gladia.log
macOS: ~/Library/Logs/Claude/mcp-server-gladia.log

MCP Gladia: spawn uvx ENOENT

If you encounter the error "MCP Gladia: spawn uvx ENOENT", confirm its absolute path by running this command in your terminal:

which uvx

Once you obtain the absolute path (e.g., /usr/local/bin/uvx), update your configuration to use that path (e.g., "command": "/usr/local/bin/uvx"). This ensures that the correct executable is referenced.

Development

Running Tests

pytest

Code Style

The project follows PEP 8 style guide. Use flake8 for linting:

flake8 gladia_mcp

License

MIT License

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Official Model Context Protocol server that enables interaction with powerful Speech-to-Text and Audio Intelligence APIs, allowing clients like Claude Desktop to transcribe audio, analyze speech, translate content, and more.

Related MCP Servers

API Tester MCP Server
Vikrant-Khedkar
-
security
F
license
-
quality
A Model Context Protocol server that allows Claude to make API requests on your behalf, providing tools for testing various APIs including HTTP requests and OpenAI integrations without sharing your API keys in the chat.
Last updated -
Python
TTS-MCP
nakamurau1
-
security
A
license
-
quality
A Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.
Last updated -
14
1
TypeScript
MIT License
Mobvoi TTS MCP Server
mobvoi
A
security
A
license
A
quality
Model Context Protocol server that enables interaction with Mobvoi's Text to Speech and Voice Clone APIs, allowing MCP clients like Cursor, Claude Desktop, and Cline to generate speech and clone voices.
Last updated -
4
1
Python
MIT License
AllVoiceLab-MCP
Ruxo0
-
security
A
license
-
quality
A Model Context Protocol server that enables developers to integrate advanced text-to-speech and video translation capabilities into their applications through simple API calls.
Last updated -
Python
MIT License

View all related MCP servers

Appeared in Searches

Search for official information or resources

Gladia MCP

Gladia MCP

Features

Quickstart with Claude Desktop

Other MCP clients

Example usage

Optional features

Contributing

API Endpoints

Health Check

Transcribe Audio

Troubleshooting

MCP Gladia: spawn uvx ENOENT

Development

Running Tests

Code Style

License

Related MCP Servers

API Tester MCP Server

TTS-MCP

Mobvoi TTS MCP Server

AllVoiceLab-MCP

Appeared in Searches

New MCP Servers

MCP directory API