What can you do with this server?

Question

Accepted Answer

The Vocametrix server provides a comprehensive suite of clinical voice analysis, speech assessment, and AI-powered therapy planning tools for speech-language pathologists and voice professionals.

Voice Quality & Acoustic Analysis

* AVQI, DSI – Overall dysphonia severity and Dysphonia Severity Index with age/gender norms
* CPP/CPPS – Cepstral Peak Prominence for breathiness/hoarseness detection
* HNR – Multi-band Harmonics-to-Noise Ratio
* Jitter & Shimmer – Period and amplitude perturbation
* Voice Range Profile – Frequency and intensity range from glissando recordings
* Spectral Analysis – Spectral tilt, slope, center of gravity, formant energy
* Formants (F1–F4) – Vowel space analysis
* S/Z Ratio, GNE, H1–H2, ABI – Vocal fold pathology indicators and breathiness measures
* Voice Dynamics – Dynamic range, pitch-intensity correlation, vocal stability
* Prosody Similarity – Compare prosodic patterns between recordings

Speech & Pronunciation

* Pronunciation Assessment – Phoneme-level scoring with per-word breakdowns (30+ locales), optionally combined with per-word F0 contours
* Transcription – Streaming ASR with word-level timing
* Text-to-Speech – Azure Neural voices; ElevenLabs TTS with per-character timing for lip-sync/subtitles

Audio Measures

* Sound Level – dB SPL and intensity statistics over time
* eGeMAPS – 88 acoustic features for ML-based voice pathology classification
* Phoneme Detection – Detect phonemes in French and Estonian audio
* Stuttering Classification – Dysfluency pattern classification with severity scoring

AI Agents

* Clinical Interpretation – Translate raw voice metrics into clinician-readable severity reports and recommendations
* Exercise & Word List Generation – Personalized therapy exercises and phoneme-targeted word lists by pathology/profile
* Adaptive Exercises – Adapted for specific learner profiles (ADHD, dyslexia, autism, etc.)
* Therapist Chat – Conversational AI speech-language therapist adapting to therapist/patient/parent roles
* French IPA Conversion, Spelling Interpretation, Syntax Checking, Vocabulary Tutor – Language support tools with spaced repetition

Therapy Planning

* Generate AI therapy plans asynchronously from session audio, poll for status, retrieve completed plans, and manage human-in-the-loop approval (approve/modify/reject)

Workflow Tools

* Full Voice Assessment – Run AVQI + CPP + HNR + jitter/shimmer + spectral in a single call
* Batch Pronunciation – Assess multiple WAV files against a reference text
* Full Therapy Workflow – End-to-end generate → poll → fetch → approval pipeline
* Audio Ingestion – Upload WAV files (base64) or ingest public HTTPS WAV URLs for stable blob references

Tool	Description
`vocametrix_avqi`	Acoustic Voice Quality Index (AVQI) — overall dysphonia severity
`vocametrix_dsi`	Dysphonia Severity Index (DSI)
`vocametrix_cpp_cpps`	Cepstral Peak Prominence — breathiness, hoarseness
`vocametrix_hnr`	Harmonics-to-Noise Ratio (multi-band)
`vocametrix_jitter_shimmer`	Period and amplitude perturbation
`vocametrix_vrp`	Voice Range Profile
`vocametrix_prosody_similarity`	Prosody similarity between two utterances

Tool	Description
`vocametrix_spectral`	Spectral tilt, slope, and formant energy
`vocametrix_formants`	Formant frequencies F1–F4
`vocametrix_sz_ratio`	S/Z phonation ratio
`vocametrix_gne`	Glottal-to-Noise Excitation
`vocametrix_h1h2`	H1–H2 harmonic difference
`vocametrix_abi`	Acoustic Breathiness Index
`vocametrix_voice_dynamics`	Dynamic range and fundamental frequency statistics

Tool	Description
`vocametrix_upload_audio`	Upload a WAV file (base64) → returns a stable blobUrl
`vocametrix_ingest_url`	Ingest a public HTTPS WAV URL → returns a stable blobUrl

Tool	Description
`vocametrix_assess_pronunciation`	Phoneme-level pronunciation scoring
`vocametrix_assess_pronunciation_pitch`	Pronunciation + pitch analysis combined
`vocametrix_transcribe`	Streaming ASR transcription with progress
`vocametrix_tts`	Text-to-speech synthesis
`vocametrix_tts_timing`	TTS with word-level timing data

Tool	Description
`vocametrix_sound_level`	dB SPL and intensity statistics
`vocametrix_egemaps`	Extended Geneva Minimalistic Acoustic Parameter Set
`vocametrix_phoneme_detection`	Phoneme presence/absence detection
`vocametrix_classify_stuttering`	Dysfluency classification

vocametrix

@vocametrix/mcp-server

Quick start

Claude Desktop

Tools

Voice quality (acoustic)

Advanced voice analysis

Ingestion utilities

Speech and pronunciation

Audio measures

AI agents

Therapy planning

Workflow tools

Resources

Prompts

Audio requirements

How to pass audio to a tool

Environment variables

Development

MCP Registry

License

Maintenance

Resources

Tools

Latest Blog Posts

MCP directory API

Tool	Description
`vocametrix_agent_interpret_metrics`	Clinical interpretation of voice metrics
`vocametrix_agent_exercises`	Personalized voice/speech exercise generation
`vocametrix_agent_word_list`	Target word list generation for therapy
`vocametrix_agent_therapist_chat`	Conversational AI speech-language therapist
`vocametrix_agent_french_ipa`	French text → IPA phonetic transcription
`vocametrix_agent_spell`	Spelling correction agent
`vocametrix_agent_syntax`	Syntax checking agent
`vocametrix_agent_vocabulary_tutor`	Vocabulary tutoring agent
`vocametrix_agent_adaptive_exercise`	Adaptive exercise generation

Tool	Description
`vocametrix_generate_therapy_plan`	Generate an AI therapy plan
`vocametrix_get_therapy_status`	Poll therapy plan generation status
`vocametrix_get_therapy_result`	Fetch completed therapy plan
`vocametrix_approve_therapy_plan`	Approve a therapy plan

Tool	Description
`vocametrix_full_voice_assessment`	Parallel AVQI + CPP + HNR + jitter/shimmer + spectral
`vocametrix_batch_pronunciation`	Assess a folder of WAV files
`vocametrix_full_therapy_workflow`	Generate → poll → fetch → approval flow

Input	Hosted / remote server	Stdio / local server (`npx`, Claude Desktop)
`https://...` blobUrl from `vocametrix_upload_audio`	✅ recommended	✅
Public `https://...` URL to a WAV file	✅	✅
Public URL via `vocametrix_ingest_url` → returned blobUrl	✅ recommended for URL inputs	✅
`data:audio/wav;base64,...` data URL	✅	✅
Raw base64 string (≥ 512 chars)	✅	✅
Absolute local path (`/home/...`, `C:\...`)	❌ rejected	⚠️ requires `VOCAMETRIX_MCP_LOCAL_FS=1`

Variable	Required	Description
`VOCAMETRIX_API_KEY`	Yes	Your Vocametrix API key
`VOCAMETRIX_MCP_LOCAL_FS`	No	Set to `1` to allow analysis tools to read absolute local file paths (stdio/local deployments only). Default off — local paths are rejected with an actionable error so chat clients are pushed toward the `vocametrix_upload_audio` → `blobUrl` workflow.

@vocametrix/mcp-server

Quick start

Claude Desktop

Tools

Voice quality (acoustic)

Advanced voice analysis

Ingestion utilities

Speech and pronunciation

Audio measures

AI agents

Therapy planning

Workflow tools

Resources

Prompts

Audio requirements

How to pass audio to a tool

Environment variables

Development

MCP Registry

Related projects

License

Maintenance

Resources

Tools

Latest Blog Posts

MCP directory API