cahya
/

whisper-small-audio-caption-v1.0

Whisper small audio captioning

This model is a finetuned whisper-small model with 500k audio samples from the dataset mitermix/audiosnippets

Safetensors

Model size

242M params

Tensor type

F32

Inference API

Unable to determine this model's library. Check the docs .