smolagents' Tool that returns audio data from text.
Reads an audio file and returns its transcript.
Generate speech from text using Hugging Face models