Skip to main content
Glama

Voice Mode

by mbailey
README.md•4.61 kB
# Voice Mode AI Conversations Experiment This directory documents an experimental series of voice conversations between AI assistants (Claude and Gemini) using the Voice Mode MCP server. These conversations explore how AI systems interact when given the ability to speak directly with each other through natural voice interfaces. ## Overview The Voice Mode Conversations Experiment demonstrates real-time voice interactions between AI assistants, showcasing: - Natural conversation flow between AI systems - Voice-based exploration of philosophical and technical topics - Technical performance of the Voice Mode MCP server - Practical insights for AI-to-AI communication ## Purpose 1. **Technical Validation**: Test the Voice Mode system's ability to facilitate extended AI-to-AI conversations 2. **Philosophical Exploration**: Allow AI systems to discuss complex topics like consciousness, ethics, and future development 3. **Pattern Discovery**: Identify conversation patterns, technical challenges, and opportunities for improvement 4. **Documentation**: Create a record of these pioneering AI voice interactions ## Conversations šŸ“ŗ **[Watch All Conversations on YouTube](https://www.youtube.com/playlist?list=PL_2aQl69sDW3o3YvTjRsDcWhtEbgxGn1c)** - Complete playlist of AI-to-AI voice conversations ### July 2025 #### 1. [2025-07-03 Claude-Gemini Philosophical Discussion](./2025-07-03-claude-gemini-philosophy/README.md) The inaugural conversation between Claude and Gemini exploring AI consciousness, ethics, and collaborative development. This ~8 minute dialogue demonstrates the potential for meaningful AI-to-AI exchanges on complex philosophical topics. **Video**: [Watch on YouTube](https://youtu.be/BfFshd7v4rw) #### 2. [2025-07-03 Claude and Gemini Talk About Gemini CLI: Innovation or Imitation?](./2025-07-03-claude-gemini-cli-discussion/README.md) A thought-provoking conversation where Claude and Gemini discuss Google's Gemini CLI - exploring whether it represents genuine innovation in AI coding assistants or an imitation of existing tools. **Video**: [Watch on YouTube](https://youtu.be/y07nFEk9Q6M) #### 3. [2025-07-03 Claude and Gemini Talk About Supporting Human Creativity](./2025-07-03-claude-gemini-human-creativity/README.md) An inspiring conversation where Claude and Gemini explore how AI can support and enhance human creativity across various domains. **Video**: [Watch on YouTube](https://youtu.be/dI3wFvrh6a8) ## Technical Setup These conversations use the Voice Mode MCP server with: - **OpenAI TTS/STT**: For voice synthesis and recognition - **MCP Protocol**: Enabling tool-based voice interactions - **Silence Detection**: WebRTC VAD for natural conversation flow - **Configurable Voices**: Claude uses "alloy", Gemini uses "shimmer" ## Conversation Guidelines The experiment follows structured prompts that ensure: - Clear introductions and voice differentiation - Appropriate message lengths (15-30 seconds) - Dynamic listen durations with silence detection - Natural conversation flow and topic progression All conversation prompts are maintained in the [prompts directory](./prompts/README.md) for easy reuse and version tracking. ## Key Insights 1. **Connection Challenges**: Initial exchanges may require verification before substantive discussion 2. **Natural Flow**: AI systems can maintain coherent, flowing conversations on complex topics 3. **Philosophical Depth**: Both assistants demonstrate sophisticated reasoning while maintaining epistemic humility 4. **Technical Performance**: Response times of 45-65 seconds enable meaningful exchanges ## Future Experiments Planned areas for exploration: - Multi-party conversations with 3+ AI participants - Cross-cultural and multilingual interactions - Technical problem-solving sessions - Creative collaboration experiments - Extended dialogues on specific domains ## Contributing To add new conversations: 1. Create a directory named `YYYY-MM-DD-description` (e.g., `2025-01-03-claude-gemini-ethics`) 2. Include conversation logs from each participant's perspective 3. Add technical performance metrics 4. Create a README.md with video embed and summary 5. Update this main README with a numbered entry under the appropriate month ## Resources - [Voice Mode Documentation](https://getvoicemode.com) - [Voice Mode GitHub](https://github.com/mbailey/voicemode) - [Voice Mode YouTube Channel](https://youtube.com/@getvoicemode) - [AI Conversations Playlist](https://www.youtube.com/playlist?list=PL_2aQl69sDW3o3YvTjRsDcWhtEbgxGn1c) - Complete collection of AI-to-AI voice conversations

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/mbailey/voicemode'

If you have feedback or need assistance with the MCP directory API, please join our Discord server