voice_bridge_say
Inject audio into an open Voice Bridge call using text-to-speech in any of 602 languages or custom pre-rendered audio. Automatic muting prevents echo.
Instructions
Inject audio into an open Voice Bridge call. Two modes: (1) text — we synthesize via OmniVoice TTS in any of 602 languages; (2) audio_base64 + encoding — bring your own audio (mulaw_8000 or pcm_l16_16000 for MVP). STT is automatically muted while we inject, so the agent doesn't hear itself. No additional payment — covered by the session deposit.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| sessionId | Yes | Session ID from open_voice_bridge | |
| text | No | Text to speak (mode 1). Uses OmniVoice TTS. | |
| language | No | Language override for this utterance (default: session language) | |
| voiceDescription | No | Free-form voice description for TTS (e.g., 'calm female voice') | |
| audioBase64 | No | Pre-rendered audio bytes, base64 (mode 2). Use with 'encoding'. | |
| encoding | No | Encoding of audioBase64. mp3/opus require ffmpeg (not yet wired in MVP). |