Improve suggestion

by LukeJacob2023 - opened Aug 26

LukeJacob2023

Aug 26

•

Hello, antony66, thanks for your contribute of this model. I am trying to improve russian whisper too. In my option, finetune on common voice will not improve much, because I think the original whisper have collect the dataset and train. I think we need some accurate dataset. Generally after finetune, the wer can reduce 50%.

LukeJacob2023

Aug 26

This model improve much in it's dataset. I think original whisper may has not train it.
https://github.com/sovse/base_rus_whisper_stt

And original whisper may not use dataset:
https://github.com/snakers4/open_stt

antony66

Owner Aug 26

Hey! Yes this is exactly my thought as well. So I had been working on a custom dataset until I had to switch to another task temporarily. Looking forward to returning to this task soon

LukeJacob2023

Aug 26

This comment has been hidden

LukeJacob2023 changed discussion status to closed Aug 26

LukeJacob2023 changed discussion status to open Aug 26

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment