Skip to main content
Glama

transcribe_audio

Convert speech in audio files to text for transcription, supporting multiple languages and optional webhook notifications.

Instructions

Transcribe speech from audio to text

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
audio_urlYesURL of the audio file to transcribe
languageNoLanguage code (e.g., 'en', 'es', 'fr')
webhook_urlNoURL for callback upon completion

Implementation Reference

  • The handler function that implements the core logic of the transcribe_audio tool by making an API call to /audiotranscribe endpoint.
    private async handleTranscribeAudio(args: any) {
      if (!args.audio_url) {
        throw new McpError(ErrorCode.InvalidParams, "audio_url is required");
      }
    
      const response = await this.axiosInstance.post("/audiotranscribe", {
        audio_url: args.audio_url,
        language: args.language,
        webhook_url: args.webhook_url,
      });
    
      return {
        content: [
          {
            type: "text",
            text: `Audio transcription started!\n\n${JSON.stringify(response.data, null, 2)}\n\nUse get_conversion_by_id with the task_id to check the status.`,
          },
        ],
      };
    }
  • Input schema defining parameters for the transcribe_audio tool including audio_url (required), language, and webhook_url.
    inputSchema: {
      type: "object" as const,
      properties: {
        audio_url: {
          type: "string",
          description: "URL of the audio file to transcribe",
        },
        language: {
          type: "string",
          description: "Language code (e.g., 'en', 'es', 'fr')",
        },
        webhook_url: {
          type: "string",
          description: "URL for callback upon completion",
        },
      },
      required: ["audio_url"],
    },
  • src/index.ts:522-542 (registration)
    Tool registration object in the TOOLS array that defines the tool's metadata and schema for MCP list tools request.
      name: "transcribe_audio",
      description: "Transcribe speech from audio to text",
      inputSchema: {
        type: "object" as const,
        properties: {
          audio_url: {
            type: "string",
            description: "URL of the audio file to transcribe",
          },
          language: {
            type: "string",
            description: "Language code (e.g., 'en', 'es', 'fr')",
          },
          webhook_url: {
            type: "string",
            description: "URL for callback upon completion",
          },
        },
        required: ["audio_url"],
      },
    },
  • src/index.ts:713-714 (registration)
    Switch case in the CallTool request handler that registers and routes transcribe_audio calls to its handler function.
    case "transcribe_audio":
      return await this.handleTranscribeAudio(args);

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/pasie15/mcp-server-musicgpt'

If you have feedback or need assistance with the MCP directory API, please join our Discord server