gaudio_list_models

List available Gaudio AI models for audio processing. Filter by category to find stem separation, DME separation, or text sync models.

Instructions

List available Gaudio AI models. Filter by category: 'stem' (instrument separation), 'dme' (dialogue/music/effects separation), 'text_sync' (lyrics sync).

Input Schema

TableJSON Schema

Name	Required	Description	Default
`category`	No	Filter by category: all (default), stem, dme, or text_sync	all

Implementation Reference

src/index.ts:27-27 (registration)
Registers the 'gaudio_list_models' tool on the MCP server via registerListModels()
```
registerListModels(server);
```

src/tools/list-models.ts:5-38 (handler)

Full handler implementation for 'gaudio_list_models'. Registers the tool with Zod schema for optional 'category' filter (all/stem/dme/text_sync) and returns formatted model info from the registry.

export function registerListModels(server: McpServer) {
  server.tool(
    "gaudio_list_models",
    "List available Gaudio AI models. Filter by category: 'stem' (instrument separation), 'dme' (dialogue/music/effects separation), 'text_sync' (lyrics sync).",
    {
      category: z
        .enum(["all", "stem", "dme", "text_sync"])
        .default("all")
        .describe("Filter by category: all (default), stem, dme, or text_sync"),
    },
    async ({ category }) => {
      const models = getModelsByCategory(category === "all" ? undefined : category);
      const formatted = models.map((m) => ({
        name: m.name,
        category: m.category,
        description: m.description,
        typeOptions: m.typeOptions ?? null,
        typeRequired: m.typeRequired,
        maxFileSize: m.maxFileSize,
        maxDuration: m.maxDuration,
        outputFormat: m.outputFormat,
      }));

      return {
        content: [
          {
            type: "text" as const,
            text: JSON.stringify(formatted, null, 2),
          },
        ],
      };
    },
  );
}

src/tools/list-models.ts:9-14 (schema)

Zod schema for the 'category' input parameter: enum of 'all', 'stem', 'dme', 'text_sync', defaulting to 'all'.

{
  category: z
    .enum(["all", "stem", "dme", "text_sync"])
    .default("all")
    .describe("Filter by category: all (default), stem, dme, or text_sync"),
},

src/models/registry.ts:124-127 (helper)

getModelsByCategory helper function used by the handler to filter MODEL_REGISTRY by category.

export function getModelsByCategory(category?: string): ModelInfo[] {
  if (!category) return MODEL_REGISTRY;
  return MODEL_REGISTRY.filter((m) => m.category === category);
}

src/models/registry.ts:1-118 (helper)

ModelInfo interface and MODEL_REGISTRY array with all available model definitions used by gaudio_list_models.

export interface ModelInfo {
  name: string;
  category: "stem" | "dme" | "text_sync";
  description: string;
  typeOptions?: string[];
  typeRequired: boolean;
  maxFileSize: string;
  maxDuration: string;
  outputFormat: string;
}

export const MODEL_REGISTRY: ModelInfo[] = [
  // Stem Separation
  {
    name: "gsep_music_hq_v1",
    category: "stem",
    description: "High-quality multi-instrument separation. Separates vocals, drums, bass, electric guitar, and acoustic piano.",
    typeOptions: ["vocal", "drum", "bass", "electric_guitar", "acoustic_piano"],
    typeRequired: true,
    maxFileSize: "1GB",
    maxDuration: "20 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_music_shq_v1",
    category: "stem",
    description: "Super high-quality vocal + accompaniment separation.",
    typeOptions: ["vocal"],
    typeRequired: true,
    maxFileSize: "1GB",
    maxDuration: "20 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_speech_hq_v1",
    category: "stem",
    description: "Speech separation / noise removal.",
    typeOptions: ["speech"],
    typeRequired: true,
    maxFileSize: "1GB",
    maxDuration: "20 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  // DME Separation
  {
    name: "gsep_dme_dtrack_v1",
    category: "dme",
    description: "Extract dialogue track from audio/video.",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_dme_d2track_v1",
    category: "dme",
    description: "Extract dialogue + vocals track from audio/video.",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_dme_metrack_v1",
    category: "dme",
    description: "Extract music + effects track (paired with dtrack).",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_dme_me2track_v1",
    category: "dme",
    description: "Extract music + effects track v1 (paired with d2track).",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_dme_me2track_v2",
    category: "dme",
    description: "Extract music + effects track v2 (high quality, paired with d2track).",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_dme_mtrack_v1",
    category: "dme",
    description: "Extract music-only track from audio/video.",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  {
    name: "gsep_dme_etrack_v1",
    category: "dme",
    description: "Extract effects-only track from audio/video.",
    typeRequired: false,
    maxFileSize: "10GB",
    maxDuration: "200 minutes",
    outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
  },
  // AI Text Sync
  {
    name: "gts_lyrics_line_v1",
    category: "text_sync",
    description: "AI lyrics line sync. Aligns lyrics text to audio timestamps. Outputs CSV (timestamp, lyric_text, confidence_score) + JSON report.",
    typeRequired: false,
    maxFileSize: "1GB",
    maxDuration: "10 minutes",
    outputFormat: "CSV (lyrics) + JSON (report)",
  },
];

gaudio-developers-mcp

gaudio_list_models

Instructions

Input Schema

Implementation Reference

Tool Definition Quality

Other Tools

Latest Blog Posts

MCP directory API