MU-NLPC/whisper-large-v2-audio-captioning
Updated
•
250
•
8
Whisper models finetuned on audio captioning instead of speech recognition. These model aim to briefly describe what happens in the audio scene.