Skip to main content
Glama
gaudiolab-jp

gaudio-developers-mcp

Official

gaudio_list_models

List available Gaudio AI models for audio processing. Filter by category to find stem separation, DME separation, or text sync models.

Instructions

List available Gaudio AI models. Filter by category: 'stem' (instrument separation), 'dme' (dialogue/music/effects separation), 'text_sync' (lyrics sync).

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
categoryNoFilter by category: all (default), stem, dme, or text_syncall

Implementation Reference

  • src/index.ts:27-27 (registration)
    Registers the 'gaudio_list_models' tool on the MCP server via registerListModels()
    registerListModels(server);
  • Full handler implementation for 'gaudio_list_models'. Registers the tool with Zod schema for optional 'category' filter (all/stem/dme/text_sync) and returns formatted model info from the registry.
    export function registerListModels(server: McpServer) {
      server.tool(
        "gaudio_list_models",
        "List available Gaudio AI models. Filter by category: 'stem' (instrument separation), 'dme' (dialogue/music/effects separation), 'text_sync' (lyrics sync).",
        {
          category: z
            .enum(["all", "stem", "dme", "text_sync"])
            .default("all")
            .describe("Filter by category: all (default), stem, dme, or text_sync"),
        },
        async ({ category }) => {
          const models = getModelsByCategory(category === "all" ? undefined : category);
          const formatted = models.map((m) => ({
            name: m.name,
            category: m.category,
            description: m.description,
            typeOptions: m.typeOptions ?? null,
            typeRequired: m.typeRequired,
            maxFileSize: m.maxFileSize,
            maxDuration: m.maxDuration,
            outputFormat: m.outputFormat,
          }));
    
          return {
            content: [
              {
                type: "text" as const,
                text: JSON.stringify(formatted, null, 2),
              },
            ],
          };
        },
      );
    }
  • Zod schema for the 'category' input parameter: enum of 'all', 'stem', 'dme', 'text_sync', defaulting to 'all'.
    {
      category: z
        .enum(["all", "stem", "dme", "text_sync"])
        .default("all")
        .describe("Filter by category: all (default), stem, dme, or text_sync"),
    },
  • getModelsByCategory helper function used by the handler to filter MODEL_REGISTRY by category.
    export function getModelsByCategory(category?: string): ModelInfo[] {
      if (!category) return MODEL_REGISTRY;
      return MODEL_REGISTRY.filter((m) => m.category === category);
    }
  • ModelInfo interface and MODEL_REGISTRY array with all available model definitions used by gaudio_list_models.
    export interface ModelInfo {
      name: string;
      category: "stem" | "dme" | "text_sync";
      description: string;
      typeOptions?: string[];
      typeRequired: boolean;
      maxFileSize: string;
      maxDuration: string;
      outputFormat: string;
    }
    
    export const MODEL_REGISTRY: ModelInfo[] = [
      // Stem Separation
      {
        name: "gsep_music_hq_v1",
        category: "stem",
        description: "High-quality multi-instrument separation. Separates vocals, drums, bass, electric guitar, and acoustic piano.",
        typeOptions: ["vocal", "drum", "bass", "electric_guitar", "acoustic_piano"],
        typeRequired: true,
        maxFileSize: "1GB",
        maxDuration: "20 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_music_shq_v1",
        category: "stem",
        description: "Super high-quality vocal + accompaniment separation.",
        typeOptions: ["vocal"],
        typeRequired: true,
        maxFileSize: "1GB",
        maxDuration: "20 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_speech_hq_v1",
        category: "stem",
        description: "Speech separation / noise removal.",
        typeOptions: ["speech"],
        typeRequired: true,
        maxFileSize: "1GB",
        maxDuration: "20 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      // DME Separation
      {
        name: "gsep_dme_dtrack_v1",
        category: "dme",
        description: "Extract dialogue track from audio/video.",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_dme_d2track_v1",
        category: "dme",
        description: "Extract dialogue + vocals track from audio/video.",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_dme_metrack_v1",
        category: "dme",
        description: "Extract music + effects track (paired with dtrack).",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_dme_me2track_v1",
        category: "dme",
        description: "Extract music + effects track v1 (paired with d2track).",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_dme_me2track_v2",
        category: "dme",
        description: "Extract music + effects track v2 (high quality, paired with d2track).",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_dme_mtrack_v1",
        category: "dme",
        description: "Extract music-only track from audio/video.",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      {
        name: "gsep_dme_etrack_v1",
        category: "dme",
        description: "Extract effects-only track from audio/video.",
        typeRequired: false,
        maxFileSize: "10GB",
        maxDuration: "200 minutes",
        outputFormat: "mp3 (48kHz/320kbps) + wav (same as input)",
      },
      // AI Text Sync
      {
        name: "gts_lyrics_line_v1",
        category: "text_sync",
        description: "AI lyrics line sync. Aligns lyrics text to audio timestamps. Outputs CSV (timestamp, lyric_text, confidence_score) + JSON report.",
        typeRequired: false,
        maxFileSize: "1GB",
        maxDuration: "10 minutes",
        outputFormat: "CSV (lyrics) + JSON (report)",
      },
    ];
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the burden. It mentions listing and filtering but does not disclose read-only nature, authentication, or rate limits. The behavior is implied but not explicit.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

A single, front-loaded sentence that efficiently conveys purpose and parameter usage with no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given low complexity (one optional parameter, no output schema), the description adequately covers purpose and filtering. It does not explain output format, but this is acceptable for a simple list tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for each enum value. The description adds real-world meaning ('stem' = instrument separation) beyond the schema, aiding selection.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists Gaudio AI models and explains the filter categories. The verb 'list' and resource 'models' are precise, and it distinguishes from sibling tools like gaudio_create_job or gaudio_separate_audio.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for viewing models by category but lacks explicit when-to-use or when-not-to-use guidance. However, the context of sibling tools provides clarity on distinct purposes.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/gaudiolab-jp/gaudio-developers-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server