Skip to main content
Glama
gaudiolab-jp

gaudio-developers-mcp

Official

@gaudiolab/mcp-developers

npm version Glama score License: MIT

MCP server for Gaudio Lab Audio AI API. Separate vocals, instruments, dialogue, music, effects from any audio/video — or sync lyrics to timestamps — all through natural language in your AI tools.

Works with Claude, ChatGPT, Cursor, VS Code, GitHub Copilot, and any MCP-compatible client.

Get Your API Key

  1. Sign up at Gaudio Developers

  2. Create a project and get your API key from the dashboard

Quick Start

Add to your MCP client config:

{
  "mcpServers": {
    "gaudio": {
      "command": "npx",
      "args": ["-y", "@gaudiolab/mcp-developers"],
      "env": {
        "GAUDIO_API_KEY": "your-api-key-here"
      }
    }
  }
}

Then just ask in natural language:

  • "Separate the vocals from this file"

  • "Extract the dialogue from this video"

  • "Sync these lyrics to this song"

  • "What models are available?"

  • "How many credits do I have left?"

Tools

Tool

Description

gaudio_get_key_info

Get API key info: credits, project, permitted models

gaudio_list_models

List available AI models by category

gaudio_upload_file

Upload audio/video/text file (multipart, auto-chunked)

gaudio_create_job

Create a processing job

gaudio_get_job

Check job status and get download URLs

gaudio_separate_audio

All-in-one: upload → process → download URLs

gaudio_sync_lyrics

All-in-one lyrics sync with timestamps

Models

Stem Separation

Model

Description

Type Options

gsep_music_hq_v1

Multi-instrument separation

vocal, drum, bass, electric_guitar, acoustic_piano

gsep_music_shq_v1

Super HQ vocal + accompaniment

vocal

gsep_speech_hq_v1

Speech / noise removal

speech

Max: 1GB / 20 min per file. Types can be combined (e.g. vocal,drum).

DME Separation (Dialogue, Music, Effects)

Model

Description

gsep_dme_dtrack_v1

Dialogue extraction

gsep_dme_d2track_v1

Dialogue + vocals

gsep_dme_metrack_v1

Music + effects

gsep_dme_me2track_v1

Music + effects v1

gsep_dme_me2track_v2

Music + effects v2 (high quality)

gsep_dme_mtrack_v1

Music only

gsep_dme_etrack_v1

Effects only

Max: 10GB / 200 min per file.

AI Text Sync

Model

Description

Languages

gts_lyrics_line_v1

Lyrics line sync

en, ko, ja, zh-cn

Max: 1GB / 10 min. Text: .txt (UTF-8), min 2 lines, max 60 chars/line.

Output: CSV (timestamp, lyric_text, confidence_score) + JSON report.

Supported Formats

Type

Formats

Audio

WAV, FLAC, MP3, M4A

Video

MOV, MP4 (audio auto-extracted)

Text

TXT (UTF-8)

Output: MP3 (48kHz/320kbps) + WAV (same as input). Download URLs valid for 48 hours.

How It Works

Upload file → Create job → Poll status → Get download URLs

The high-level tools (gaudio_separate_audio, gaudio_sync_lyrics) handle this entire flow automatically. Upload IDs are valid for 72 hours and can be reused across multiple jobs.

License

MIT

Install Server
A
license - permissive license
A
quality
B
maintenance

Maintenance

Maintainers
Response time
Release cycle
1Releases (12mo)

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/gaudiolab-jp/gaudio-developers-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server