Question regarding model evaluation

#1
by marcoullmann - opened

Hello Nizar,

great work. Thank you very much!

Did the model see the test data during training time? If not, can you provide details about the split ratios?

Thank you. Best,
Marco

Hello Marco,

Thank you for your feedback!

No the model was trained only on the training splits given with the databases I gathered online. Here's a summary of the split ratios.

I stay available for any other question about the model :)

Best,
Nizar

Database Total Train Test Train_Percentage Test_Percentage
ASGD_Test_Set 5750 0 5750 0% 100%
SDS-200_Corpus 138907 135271 3636 97% 3%
SPC 150702 147370 3332 98% 2%
STT4SG-350 247527 222922 24605 90% 10%
SwissDialZH1_1 30921 30921 0 100% 0%

Thank you! Hope I can use the model soon in a real world project.

You're welcome! That's cool, keep me updated :)
In any case here's what I did with the model for my master thesis:

  • Speeding up inference by translating the model with CTranslate2.
  • Added word-alignment using the model with WhisperX, from the ct2 translated model. (There are 1-2 tricks to make it run properly since v3 natively uses more log-mel channels, I can provide some code to to that if needed)

Sign up or log in to comment