Skip to main content
Glama

hyperframes_tts

Add speech to videos by converting text to audio with local TTS voices. List available voices for selection.

Instructions

Generate speech audio or list available Hyperframes local TTS voices.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
text_or_fileNo
output_pathNo
voiceNo
speedNo
languageNo
list_voicesNo

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault

No arguments

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must carry the full burden of behavioral disclosure. It only says 'generate speech audio or list available Hyperframes local TTS voices' but does not disclose side effects, requirements (e.g., internet, authentication), or behavioral traits like whether generation is synchronous or asynchronous.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is a single sentence, concise and to the point. However, it could benefit from structuring the two modes explicitly (e.g., 'Use list_voices=true to list voices, otherwise generate audio from text').

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness1/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity (6 parameters, no schema descriptions, no annotations), the description is severely incomplete. It does not explain how the tool works, what the output schema contains, or how to switch between generation and listing modes. An output schema exists but its content is unknown.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, meaning the schema itself has no parameter descriptions. The description does not explain any parameters (e.g., text_or_file, output_path, voice, speed, language, list_voices) or how they relate to the two modes. The agent cannot infer parameter semantics from the description alone.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states two functions: generate speech audio or list voices. Verb-resource pair is specific. However, it does not explicitly distinguish from sibling tools like audio_synthesize, which might also generate audio.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No usage guidance provided. It does not explain when to use this tool versus alternatives like audio_synthesize, nor does it mention prerequisites or typical use cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/KyaniteLabs/mcp-video'

If you have feedback or need assistance with the MCP directory API, please join our Discord server