Fine-tunining Whisper models for shorter audio segments

#34
by Malishevsky - opened

Hi all. My project needs to recognize many short audio parts. Can I use fine to change the multilingual model for short audios like 10 seconds ? If not, can I train the model from scratch for these purposes? I would be grateful for any help and hints.

Sign up or log in to comment