diarize_speech
Transcribe audio files into text while identifying different speakers, saving the output with speaker labels for clear conversation analysis.
Instructions
Convert speech to text with speaker diarization and save the output text file to a given directory. Directory is optional, if not provided, the output file will be saved to $HOME/Desktop.
⚠️ COST WARNING: This tool makes an API call to Whissle which may incur costs. Only use when explicitly requested by the user.
Args:
audio_file_path (str): Path to the audio file to transcribe
model_name (str, optional): The name of the ASR model to use. Defaults to "en-NER"
max_speakers (int, optional): Maximum number of speakers to identify
boosted_lm_words (List[str], optional): Words to boost in recognition
boosted_lm_score (int, optional): Score for boosted words (0-100)
output_directory (str, optional): Directory where files should be saved.
Defaults to $HOME/Desktop if not provided.
Returns:
TextContent with the diarized transcription and path to the output file.
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| audio_file_path | Yes | ||
| model_name | No | en-NER | |
| max_speakers | No | ||
| boosted_lm_words | No | ||
| boosted_lm_score | No |