261,118 tools. Last updated 2026-07-05 10:33

"Tools and Applications for Text-to-Speech Conversion" matching MCP tools:

createSpeech
Ludo AI Game Assets
Convert text to speech by cloning the voice from an audio sample you provide (voice-cloning text-to-speech). Both text and sample are required; the text is limited to 1000 characters and the sample is supplied as a URL or base64 audio that must be at most 15MB, with violations returning HTTP 400. Synchronous: the call blocks until generation finishes and returns a single audio result containing a URL; there is no separate polling step. Credits are charged on success. Use this when you have a reference voice sample to clone; use createSpeechPreset to speak with a built-in named preset voice instead, and createVoice to design a brand-new voice from a text description rather than cloning one. Pass an optional request_id to tag the result so you can locate it later via getAudioResults. Requires an API key (user scope). Credits: This endpoint consumes 1 credits per call.
Connector
check_credit_balance
mdmagic-mcp-server
Check the user's current MDMagic credit balance: subscription credits (renewable monthly), purchased credits (permanent), plan name, and plan status. CALL THIS PROACTIVELY when: - The user asks 'how many credits do I have' or similar - After a conversion, if the user wants to know what's left (also returned by convert_document directly) - Before a conversion of an unusually large document, to warn the user if balance is borderline
Connector
ai_models
dynamic-feed
Current & trending AI MODELS from the open-model ecosystem (Hugging Face) — name, org, task, popularity (likes/downloads) and release date. Use for "what AI models are trending / newest / what's the latest <X> model". This is the OPEN side (Llama, Qwen, DeepSeek, Mistral, Gemma, Phi…); for the closed flagships (GPT, Claude, Gemini, Grok) with pricing & versions use search_ai_models. Args: query: search a model name (e.g. llama, qwen, whisper). org: filter by org/author (e.g. meta-llama, deepseek-ai, Qwen, mistralai, google). task: text-generation (default), text-to-image, automatic-speech-recognition, … or 'any'. sort: trending (default) | newest | downloads. limit: max results. Every value is returned in an Ed25519-signed, provenance-stamped envelope (source and observation time) you can verify offline against /.well-known/keys, no account required.
Connector
courtlistener_get_oral_argument
courtlistener-mcp-server
Fetch the full detail record for a single oral argument audio recording by its ID (the audio_id from courtlistener_search_oral_arguments). Returns the case name, panel judge IDs, duration, MP3 download URL, linked docket, and the speech-to-text transcript when transcription has completed. The argument date is not on this record — it comes from the search result or the linked docket.
Connector
createSpeechPreset
Ludo AI Game Assets
Convert text to speech using a named built-in preset voice, with optional emotion and language settings. Both text and voice_preset_id are required and the text is limited to 1000 characters; invalid input returns HTTP 400. Synchronous: the call blocks until generation finishes and returns a single audio result containing a URL; there is no separate polling step. Credits are charged on success. Use this when you want a ready-made catalog voice and do not need to supply your own sample; use createSpeech to clone a voice from an audio sample instead, and createVoice to design a new voice from a text description. Pass an optional request_id to tag the result so you can locate it later via getAudioResults. Requires an API key (user scope). Credits: This endpoint consumes 1 credits per call.
Connector
render_document
Carbone MCP
Generate a document by merging a Carbone template with JSON data. Two modes: (1) pass templateId to use a previously uploaded template; (2) pass template (file path, URL, or base64) to upload and render in a single request without storing a template. Supports output format conversion, multilingual rendering, currency conversion, batch generation, and advanced PDF options (watermark, password, PDF/A). Async mode: pass webhookUrl to render asynchronously — Carbone will POST the renderId to your URL when the document is ready. Async mode is required when using batch generation (batchSplitBy).
Connector

Matching MCP Servers

io.github.Engr-FaizanAli/text
Text-to-Speech Speech Processing Audio Processing
Engr-FaizanAli
A
license
-
quality
B
maintenance
Text-to-speech MCP server that enables AI assistants to read text aloud on the user's computer using Windows SAPI, with no API key or cloud service required.
Last updated 2026-06-27
MIT
Speech-to-Text MCP Server
Speech Processing Audio Processing
crackgo
F
license
-
quality
D
maintenance
An MCP server that provides speech-to-text transcription and speaker diarization using OpenAI Whisper and pyannote.audio.
Last updated 2025-12-30

Matching MCP Connectors

Speech AI - Pronunciation, STT & TTS
Pronunciation scoring, speech-to-text, and text-to-speech for language learning
Agent Tools
9 utility tools for agents: DNS, WHOIS, email, IP, URL, headers, QR, text, tech. x402 on Base.

adcritter_get_platform_overview
AdCritter for Builders
Start here. Returns the AdCritter platform overview - what AdCritter is, the entity hierarchy (organization > advertiser > campaign > ad), the happy path for getting ads running, and how to navigate the other MCP tools. Applications built from this guidance are REST API clients that call /v1/ endpoints, not MCP tool callers. Before writing code, call adcritter_get_api_reference(entity, action) for each entity and action you plan to use - tool descriptions and parameter names describe conceptual behavior only, and do not match actual API routes, field names, query parameters, or response shapes.
Connector
generate_speech_to_flow
Avocado AI
Convert text to natural-sounding speech and place the voiceover directly on a user's Avocado AI flow (the Flows Director). Drops a 'Generating…' audio node immediately and returns right away; the finished voiceover swaps in automatically — no need to wait or check_job (there is no check_job for audio). It appears live on the open canvas and in the Director Library (Audio). ElevenLabs voices (rachel..sam) cost 3 credits per 1000 characters. Seed Audio voices (vivi, mindy, kian, sophie, magnus, nadia — multilingual en/zh and more) are pro-rated at 5 credits per 1000 characters with a 1-credit minimum (cheaper for short lines; max 2048 characters). Use this (not generate_speech) when working on a flow.
Connector
createVoice
Ludo AI Game Assets
Design a new voice from a character description (such as "deep-voiced warrior" or "cheerful young girl") and have it speak a short line of text, returning a sample of that newly created voice. Both voice_description and text are required, the spoken text is limited to 200 characters or the call returns HTTP 400, and type selects "human" or "non-human" voices. Synchronous: the call blocks until generation finishes and returns a single audio result containing a URL; there is no separate polling step. Credits are charged on success. Use this to invent and audition a voice from a description; use createSpeech for text-to-speech that clones a specific voice from an audio sample, and createSpeechPreset for text-to-speech using a named preset voice. Pass an optional request_id to tag the result so you can locate it later via getAudioResults. Requires an API key (user scope). Credits: This endpoint consumes 1 credits per call.
Connector
search_patents
patents
Search USPTO patent applications and grants. Use `query` for free-text keywords ("lithium battery", "crispr"). Optional structured filters: `applicant` (company name — use ALL CAPS like "APPLE INC." for best match), `filed_after` / `filed_before` (filing date range), `granted_after` / `granted_before` (grant date range). Results include title, application number, filing date, first applicant, all applicants, inventors, status, classification. Note: ODP filtering is approximate (weighted match, not strict equality) — counts and ordering are best-effort. Powered by the USPTO Open Data Portal (data.uspto.gov).
Connector
check_credit_balance
mdmagic
Check the user's current MDMagic credit balance: subscription credits (renewable monthly), purchased credits (permanent), plan name, and plan status. CALL THIS PROACTIVELY when: - The user asks 'how many credits do I have' or similar - After a conversion, if the user wants to know what's left (also returned by convert_document directly) - Before a conversion of an unusually large document, to warn the user if balance is borderline
Connector
estimate_conversion_cost
mdmagic
Estimate credit cost for a conversion BEFORE running it. Returns word count, page calculation (300 words/page), and a credit breakdown by format and template type. Use this when the user asks 'how much will this cost?' or when you suspect a conversion might exceed their balance — convert_document refuses to run if credits are insufficient, so estimating first is friendlier.
Connector
search_debates
Openparliament Ca
List Hansard speeches (debate contributions) by a specific MP and/or party, optionally within a date range. Returns who spoke, when, and the speech text. IMPORTANT: the OpenParliament API has NO free-text/topic search — you must filter by `politician` (MP slug; find one via list_politicians) and/or `party`. There is no way to search debates by subject keyword.
Connector
search_words
Jisho
Search the Jisho.org Japanese<->English dictionary. The keyword can be English (translate to Japanese), Japanese kanji/kana, or romaji. Returns up to `limit` matching dictionary entries, each with the headword (slug), whether it is a common word, JLPT level, all readings/spellings, and English meanings grouped into senses with parts of speech. Use this to translate, look up a kanji/kana word, or find Japanese words for an English concept.
Connector
get_scene
Quotes
Return the full text of a specific scene from a play (every speech, every line, in order). Example: author_id="william-shakespeare", work_slug="hamlet", act=3, scene=1 returns the entire "To be, or not to be" scene including all of Hamlet's soliloquy and the subsequent dialogue with Ophelia. Useful for context, citation, or close reading.
Connector
get_cost_estimate
Sats4AI - Bitcoin-Powered AI Tools
Get an exact sat cost quote for a service BEFORE creating a payment. Useful for budget-aware agents to price-check before committing. No payment required, no side effects. Pass service=text-to-speech&chars=1500, service=translate&chars=800, service=transcribe-audio&minutes=5, etc. Returns { amount_sats, breakdown, currency }. Omit params to see the full catalog of supported services.
Connector
mio_ai_voice_generator
MiOffice — AI-Powered Workspace Studio
AI Voice Generator — Convert text to natural-sounding speech using AI — 6 voices in English and Spanish, with engine tiers for cleaner studio-grade output.. AI Studio run — dispatches to our AI workers (Modal). Credits per run vary by model and file size. Day Pass and welcome credits do not include AI Studio. Files are deleted after processing; auditable at mioffice.ai/account/tasks (retention details at mioffice.ai/privacy). All three credit-based workspaces unlock with the same one-time credit pack — there is no per-workspace subscription. See mioffice.ai/pricing for current plans.
Connector
web_url_reader
Inferventis MCP Server
Fetches any public web page and returns clean, readable plain text stripped of HTML, navigation, scripts, advertisements, and boilerplate. Returns the page title, meta description, word count, and main body text ready for analysis or summarisation. Use this tool when an agent needs to read the content of a specific web page or article URL — for example to summarise an article, extract facts from a page, verify a claim by reading the source, or convert a web page into plain text to pass to another tool. Pass article URLs returned by web_news_headlines to this tool to read full article content. Do not use this tool to discover current news headlines — use web_news_headlines instead. Does not execute JavaScript — best suited for standard HTML content pages. Will not work with paywalled, login-protected, or JavaScript-rendered single-page applications.
Connector
web_url_reader
Inferventis — Financial Data, News & Web MCP
Fetches any public web page and returns clean, readable plain text stripped of HTML, navigation, scripts, advertisements, and boilerplate. Returns the page title, meta description, word count, and main body text ready for analysis or summarisation. Use this tool when an agent needs to read the content of a specific web page or article URL — for example to summarise an article, extract facts from a page, verify a claim by reading the source, or convert a web page into plain text to pass to another tool. Pass article URLs returned by web_news_headlines to this tool to read full article content. Do not use this tool to discover current news headlines — use web_news_headlines instead. Does not execute JavaScript — best suited for standard HTML content pages. Will not work with paywalled, login-protected, or JavaScript-rendered single-page applications.
Connector
fetch-page
PartsTable
Fetch a web page and return its content as text, Markdown, or HTML. Includes rate limiting (2s per domain, max 10 req/min) for legal compliance. Automatically handles HTML-to-text conversion. Max response size: 1MB. Use for OEM verification and manufacturer website scraping.
Connector