transcribe
Transcribe audio or video files into text, identifying speakers and adding timestamps.
Instructions
Transcribe audio or video with speaker labels and timestamps. Cost: 3 credits.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| audio_url | Yes | URL to audio or video file | |
| speaker_labels | No | Enable speaker diarization |
Implementation Reference
- src/index.ts:131-137 (schema)Schema definition for the 'transcribe' tool: accepts audio_url (string) and speaker_labels (boolean, default true).
name: "transcribe", description: "Transcribe audio or video with speaker labels and timestamps. Cost: 3 credits.", inputSchema: { audio_url: z.string().describe("URL to audio or video file"), speaker_labels: z.boolean().optional().default(true).describe("Enable speaker diarization"), }, }, - src/index.ts:247-259 (registration)Tools are registered dynamically in a loop over CAPABILITIES. The 'transcribe' tool is registered via server.registerTool at line 249 when cap.name === 'transcribe'.
for (const cap of CAPABILITIES) { // Cast inputSchema to avoid TS2589 (excessively deep type instantiation from Zod chains) server.registerTool( cap.name, { description: cap.description, inputSchema: cap.inputSchema as any, }, async (args: any): Promise<CallToolResult> => { return callSuprsonic(cap.name, args as Record<string, unknown>); }, ); } - src/index.ts:183-234 (handler)Generic handler function callSuprsonic that executes all tool logic. It sends a POST request to the Suprsonic API with the capability name ('transcribe') and params, then returns the result.
async function callSuprsonic(capability: string, params: Record<string, unknown>): Promise<CallToolResult> { if (!API_KEY) { return { content: [{ type: "text", text: "Error: SUPRSONIC_API_KEY environment variable is not set. Get your key at https://suprsonic.ai/app/apis" }], isError: true, }; } try { const resp = await fetch(`${BASE_URL}/v1/agent`, { method: "POST", headers: { "Authorization": `Bearer ${API_KEY}`, "Content-Type": "application/json", }, body: JSON.stringify({ capability, params }), }); const result = await resp.json() as any; // Handle non-envelope responses (401, 429, etc. return {"detail": ...}) if (result.detail && result.success === undefined) { const msg = typeof result.detail === "object" ? (result.detail.title || result.detail.detail || JSON.stringify(result.detail)) : String(result.detail); return { content: [{ type: "text", text: `Error (HTTP ${resp.status}): ${msg}` }], isError: true, }; } if (!result.success) { const errMsg = result.error?.detail || result.error?.title || "Request failed"; return { content: [{ type: "text", text: `Error: ${errMsg}` }], isError: true, }; } const text = JSON.stringify(result.data, null, 2); const meta = result.metadata ? `\n\n[Provider: ${(result.metadata as any).provider_used || "unknown"}, ${(result.metadata as any).response_time_ms || 0}ms, ${result.credits_used || 0} credits]` : ""; return { content: [{ type: "text", text: text + meta }], }; } catch (err) { return { content: [{ type: "text", text: `Network error: ${err instanceof Error ? err.message : String(err)}` }], isError: true, }; } }