speech_to_text | Convert speech to text with a given model and save the output text file to a given directory.
Directory is optional, if not provided, the output file will be saved to $HOME/Desktop. ⚠️ COST WARNING: This tool makes an API call to Whissle which may incur costs. Only use when explicitly requested by the user.
Args:
audio_file_path (str): Path to the audio file to transcribe
model_name (str, optional): The name of the ASR model to use. Defaults to "en-NER"
timestamps (bool, optional): Whether to include word timestamps
boosted_lm_words (List[str], optional): Words to boost in recognition
boosted_lm_score (int, optional): Score for boosted words (0-100)
output_directory (str, optional): Directory where files should be saved.
Defaults to $HOME/Desktop if not provided.
Returns:
TextContent with the transcription and path to the output file. |
diarize_speech | Convert speech to text with speaker diarization and save the output text file to a given directory.
Directory is optional, if not provided, the output file will be saved to $HOME/Desktop. ⚠️ COST WARNING: This tool makes an API call to Whissle which may incur costs. Only use when explicitly requested by the user.
Args:
audio_file_path (str): Path to the audio file to transcribe
model_name (str, optional): The name of the ASR model to use. Defaults to "en-NER"
max_speakers (int, optional): Maximum number of speakers to identify
boosted_lm_words (List[str], optional): Words to boost in recognition
boosted_lm_score (int, optional): Score for boosted words (0-100)
output_directory (str, optional): Directory where files should be saved.
Defaults to $HOME/Desktop if not provided.
Returns:
TextContent with the diarized transcription and path to the output file. |
translate_text | Translate text from one language to another. ⚠️ COST WARNING: This tool makes an API call to Whissle which may incur costs. Only use when explicitly requested by the user.
Args:
text (str): The text to translate
source_language (str): Source language code (e.g., "en" for English)
target_language (str): Target language code (e.g., "es" for Spanish)
Returns:
TextContent with the translated text. |
summarize_text | Summarize text using an LLM model. ⚠️ COST WARNING: This tool makes an API call to Whissle which may incur costs. Only use when explicitly requested by the user.
Args:
content (str): The text to summarize
model_name (str, optional): The LLM model to use. Defaults to "openai"
instruction (str, optional): Specific instructions for summarization
Returns:
TextContent with the summary. |
list_asr_models | List all available ASR models and their capabilities. |