Which integrations are available for this server?

Converts text summaries to speech using OpenAI's TTS API, with multiple voice options and custom speaking instructions.

How do I use summarize-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@summarize-mcp Summarize this article and play it as audio" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

summarize-mcp

by FiveOhhWon

Overview Schema Related Servers Score Discussions

Python

Local

summarize-mcp

🤖 Co-authored with Claude Code - Making AI summaries audible since 2025! 🔊

A Model Context Protocol (MCP) server that converts text summaries to speech using OpenAI's TTS API and plays them in the background across all major platforms (macOS, Windows, Linux).

🌟 Overview

summarize-mcp enables LLMs to convert any text summary into natural-sounding speech using OpenAI's state-of-the-art text-to-speech models. Perfect for creating audio summaries of documents, articles, or any content that benefits from an auditory presentation.

Related MCP server: voice-status-report-mcp-server

🚀 Key Features

🎯 Simple & Focused: One tool that does one thing exceptionally well
🎤 Multiple Voices: Choose from 10 distinct OpenAI voices (alloy, ash, ballad, coral, echo, fable, nova, onyx, sage, shimmer)
🎨 Custom Instructions: Control how the text should be spoken
🔧 Background Playback: Audio plays in the background without blocking
🌍 Cross-Platform: Works on macOS, Windows, and Linux
💾 Persistent Preferences: Save your favorite voice and tone settings
🎯 Multiple Tools: Set voice, set tone, and play summaries
🧹 Automatic Cleanup: Temporary files are cleaned up automatically
🛡️ Type-Safe: Full Python type hints with Pydantic validation
📊 Comprehensive Logging: Debug mode for troubleshooting
⚡ Performance Optimized: Efficient file handling and cleanup

📋 Prerequisites

Python 3.8 or higher
OpenAI API Key with access to TTS models
Audio Player (automatically detected):
- macOS: Built-in afplay (no installation needed)
- Windows: Built-in Windows Media Player (no installation needed)
- Linux: One of: mpg123, sox (play), ffmpeg (ffplay), vlc (cvlc), or alsa-utils (aplay)

📦 Installation

git clone https://github.com/FiveOhhWon/summarize-mcp.git
cd summarize-mcp
pip install -e .

🏃 Configuration

Claude Desktop

Add this configuration to your Claude Desktop config file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
Linux: ~/.config/Claude/claude_desktop_config.json

Configuration:

{
  "mcpServers": {
    "summarize": {
      "command": "python",
      "args": ["/absolute/path/to/summarize-mcp/src/summarize_mcp/server.py"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}

Environment Variables

OPENAI_API_KEY (required): Your OpenAI API key
DEBUG (optional): Set to "true" for verbose logging

🛠️ Available Tools

play_summary

Converts text to speech and plays it in the background. Uses saved voice and tone preferences unless overridden.

Parameters:

summary (required): The text to convert to speech
voice (optional): Voice to use - alloy, ash, ballad, coral, echo, fable, nova, onyx, sage, or shimmer (uses saved preference if not specified)
instructions (optional): Instructions for how the text should be spoken (uses saved tone if not specified)

Example:

{
  "summary": "The quick brown fox jumps over the lazy dog. This pangram contains all letters of the alphabet.",
  "voice": "nova",
  "instructions": "Speak slowly and clearly, emphasizing each word."
}

set_voice

Set the default voice for all future text-to-speech conversions.

Parameters:

voice (required): The voice to use - alloy, ash, ballad, coral, echo, fable, nova, onyx, sage, or shimmer

Example:

{
  "voice": "nova"
}

set_tone

Set the default tone/instructions for how text should be spoken in all future TTS requests.

Parameters:

tone (required): The tone/instructions to use (e.g., "Speak slowly and calmly", "Be enthusiastic and energetic")

Example:

{
  "tone": "Speak in a warm, friendly manner with moderate pacing"
}

📖 Usage Examples

Basic Summary

"Please summarize this article and play it as audio"

The LLM will:

Generate a summary of the content
Use the play_summary tool to convert it to speech
The audio will play in the background with saved preferences

Set Default Voice

"Set the default voice to nova"

This will save "nova" as your preferred voice for all future summaries.

Set Default Tone

"Set the tone to be warm and conversational with a slower pace"

This will save your tone preference for all future summaries.

Custom Voice (One-time)

"Summarize this document and play it using the 'sage' voice"

This will use "sage" for this summary only, without changing your default.

With Custom Instructions (One-time)

"Create an audio summary of this text. Make it sound enthusiastic and energetic."

This will use custom instructions for this summary only.

🎯 Voice Options

Voice	Description
`alloy`	Neutral and balanced
`ash`	Warm and engaging
`ballad`	Expressive and dramatic
`coral`	Clear and professional (default)
`echo`	Smooth and reflective
`fable`	Expressive and animated
`nova`	Friendly and upbeat
`onyx`	Deep and authoritative
`sage`	Wise and measured
`shimmer`	Soft and gentle

🧪 Development

# Install dependencies
pip install -r requirements.txt

# Install in development mode
pip install -e .

# Run the server
python -m summarize_mcp

# Run tests
python test.py

# Run with debug logging
DEBUG=true python -m summarize_mcp

🏗️ Architecture

summarize-mcp/
├── src/
│   └── summarize_mcp/
│       ├── __init__.py      # Package initialization
│       ├── __main__.py      # Entry point for python -m
│       └── server.py        # Main MCP server implementation
├── pyproject.toml           # Python project metadata
├── requirements.txt         # Python dependencies
├── test.py                  # Test script
└── README.md               # This file

🔧 Technical Details

Audio Format: MP3 (OpenAI TTS output format)
Temporary Files: Stored in system temp directory
File Cleanup: Automatic cleanup after 10 seconds (configurable)
Old File Purge: Files older than 1 hour are cleaned on startup
Platform Support:
- macOS: Uses built-in afplay
- Windows: Uses PowerShell with Windows Media Player
- Linux: Auto-detects available player (mpg123, sox, ffmpeg, vlc, alsa)
- Fallback: Opens with system default audio application
State Management:
- Preferences saved to ~/.summarize-mcp-state.json
- Persists voice and tone settings between sessions
- Automatic loading on startup
Error Handling: Comprehensive error handling with specific error types
Validation: Input validation using Pydantic models

🚨 Troubleshooting

"OPENAI_API_KEY environment variable is not set"

Set your OpenAI API key in the Claude Desktop configuration.

"No audio player available"

Linux users: Install one of the supported audio players:

# Ubuntu/Debian
sudo apt-get install mpg123
# or
sudo apt-get install sox
# or
sudo apt-get install ffmpeg
# or
sudo apt-get install vlc

# Fedora/RHEL
sudo dnf install mpg123
# or similar for other players

# Arch
sudo pacman -S mpg123
# or similar for other players

Windows/macOS: Audio playback should work out of the box.

Audio doesn't play

Check system volume
Ensure no other audio issues on your system
Enable debug logging with DEBUG=true
Check the logs for any errors

📝 Changelog

v2.0.0 (Python Rewrite)

🐍 Complete rewrite in Python for better cross-platform support
🔧 Improved async handling with Python's asyncio
📦 Simplified installation with pip
🛡️ Enhanced type safety with Pydantic
🚀 Better performance and reliability

v1.2.0 (Persistent Preferences)

💾 Added persistent state management for voice and tone preferences
🎯 Added set_voice tool to set default voice
🎯 Added set_tone tool to set default speaking instructions
🎆 Added support for new OpenAI voices: ash, ballad, and sage
🔄 play_summary now uses saved preferences unless overridden
📝 State saved to ~/.summarize-mcp-state.json

v1.1.0 (Cross-Platform Support)

🌍 Added Windows support using PowerShell/Windows Media Player
🐧 Added Linux support with auto-detection of audio players
🔄 Added fallback to system default audio player
📝 Updated documentation for multi-platform usage

v1.0.0 (Initial Release)

🎉 Initial release
✨ Core TTS functionality with OpenAI integration
✨ Support for 7 different voices
✨ Custom speaking instructions
✨ Background audio playback on macOS
✨ Automatic file cleanup
✨ TypeScript implementation
✨ Comprehensive error handling

💰 Estimated Costs

This tool uses OpenAI's gpt-4o-mini-tts model for text-to-speech conversion. Here's the pricing breakdown:

Model	Audio Output Price	Estimated Cost
`gpt-4o-mini-tts`	$12.00 per 1M tokens	$0.015 per minute of audio

Cost Examples:

100-word summary (~30 seconds): ~$0.0075
500-word summary (~2.5 minutes): ~$0.0375
1000-word summary (~5 minutes): ~$0.075

The actual cost depends on:

Length of your summaries
Speaking speed (instructions can affect this)
How frequently you use the tool

For current pricing details, see OpenAI's pricing page.

🔮 Roadmap

Cross-platform audio playback (Windows, Linux)
Python implementation for better cross-platform support
Additional TTS providers (ElevenLabs, Amazon Polly)
Audio format options (WAV, OGG)
Playback control (pause, resume, stop)
Queue management for multiple summaries
Audio file caching
Speed and pitch controls
SSML support for advanced speech control

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built on the Model Context Protocol specification by Anthropic
Powered by OpenAI's TTS API
Special thanks to the MCP community for inspiration and support

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

mcp-tts-server
Text-to-Speech Audio Processing AI & Machine Learning
kaichen
A
license
-
quality
C
maintenance
Enables text-to-speech generation using the Groq API, supporting multiple audio formats and optional local playback.
Last updated 2025-03-30
51
1
MIT
voice-status-report-mcp-server
Text-to-Speech Speech Processing
tomekkorbak
A
license
A
quality
F
maintenance
Enables LLMs to provide voice status updates via OpenAI TTS, ideal for background task progress reports.
Last updated 2025-06-05
1
8
MIT
Chatty MCP
Text-to-Speech Speech Processing
stphtt
A
license
-
quality
D
maintenance
Provides voice summaries after each AI request in Cursor, Cline, or any MCP-supported editor, allowing users to hear what was done and stay informed without staring at the screen.
Last updated 2025-04-19
9
MIT
oto
Text-to-Speech Audio Processing
jonymoney
A
license
-
quality
B
maintenance
Enables text-to-speech conversion using OpenAI's TTS API, with inline audio playback and history within MCP hosts like Claude.
Last updated 2026-06-18
18
BSD 2-Clause "Simplified"

View all related MCP servers

Related MCP Connectors

Audio Delivery Network
AI-manageable audio CDN: upload, transcode, normalize, stream & deliver audio, plus grounded docs.
MCP Automations
Summarize URLs, repurpose content, daily news digests, find competitors. Cost telemetry built in.
tts
Hosted pay-per-use TTS: 54 neural voices, 9 languages incl. Brazilian Portuguese. $10 free credits.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/FiveOhhWon/summarize-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

summarize-mcp

🌟 Overview

🚀 Key Features

📋 Prerequisites

📦 Installation

🏃 Configuration

Claude Desktop

Configuration:

Environment Variables

🛠️ Available Tools

play_summary

set_voice

set_tone

📖 Usage Examples

Basic Summary

Set Default Voice

Set Default Tone

Custom Voice (One-time)

With Custom Instructions (One-time)

🎯 Voice Options

🧪 Development

🏗️ Architecture

🔧 Technical Details

🚨 Troubleshooting

"OPENAI_API_KEY environment variable is not set"

"No audio player available"

Audio doesn't play

📝 Changelog

v2.0.0 (Python Rewrite)

v1.2.0 (Persistent Preferences)

v1.1.0 (Cross-Platform Support)

v1.0.0 (Initial Release)

💰 Estimated Costs

Cost Examples:

🔮 Roadmap

🤝 Contributing

📄 License

🙏 Acknowledgments

Maintenance

Resources

Looking for Admin?

Related MCP Servers

mcp-tts-server

voice-status-report-mcp-server

Chatty MCP

oto

Related MCP Connectors

Latest Blog Posts

MCP directory API