Skip to main content
Glama

create_lipsync

Generate lip-sync videos by aligning mouth movements with audio. Use custom audio files or text-to-speech with multiple voice options. Works for videos featuring clear, steady human faces. Supports real, 3D, or 2D characters. Limited to 10-second clips.

Instructions

Create a lip-sync video by synchronizing mouth movements with audio. Supports both text-to-speech (TTS) with various voice options or custom audio upload. The original video must contain a clear, steady human face with visible mouth. Works with real, 3D, or 2D human characters (not animals). Video length limited to 10 seconds.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
audio_urlNoURL of custom audio file (mp3, wav, flac, ogg; max 20MB, 60s). If provided, TTS parameters are ignored
model_nameNoModel version to use (default: kling-v2-master)
tts_speedNoSpeech speed for TTS (0.5-2.0, default: 1.0)
tts_textNoText for text-to-speech synthesis (used only if audio_url is not provided)
tts_voiceNoVoice style for TTS (default: male-warm). Includes Chinese and English voice options
video_urlYesURL of the video to apply lip-sync to (must contain clear human face)

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/199-mcp/mcp-kling'

If you have feedback or need assistance with the MCP directory API, please join our Discord server