analyze_speaker_stress
Analyze vocal stress in live audio streams to detect nervousness, evasion, and uncertainty, flagging potential market-moving signals such as guidance revisions.
Instructions
Analyse the speaker's vocal stress over a recent time window.
Extracts F0 pitch contour, measures pitch jitter (vocal tremor), and detects hesitation patterns (pauses > 400ms). Returns a composite stress score from 0–100 where:
0–20 = confident, fluent delivery 20–45 = normal variation 45–75 = elevated stress — worth monitoring 75–100 = high stress — potential market-moving signal
Higher scores correlate with executive nervousness, evasion, and uncertainty — the kind of prosodic signals that precede guidance revisions and earnings misses.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| stream_id | Yes | ||
| time_window_seconds | No | Analysis window in seconds |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| stream_id | Yes | ||
| time_window_seconds | Yes | ||
| stress_score | Yes | Composite vocal stress score 0-100 | |
| pitch_mean_hz | Yes | Mean fundamental frequency in Hz | |
| pitch_std_hz | Yes | Standard deviation of F0 | |
| pitch_jitter | Yes | Normalised pitch jitter (coefficient of variation) | |
| hesitation_ratio | Yes | Fraction of audio that is silence >400ms | |
| voiced_fraction | Yes | Fraction of voiced frames | |
| pause_count | Yes | Number of significant pauses detected | |
| longest_pause_ms | Yes | Duration of longest pause in ms | |
| analysis | Yes | Human-readable stress interpretation | |
| chunks_analyzed | Yes | Number of audio chunks processed |