waveletdeboshir
/

whisper-base-ru-pruned-ft

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

waveletdeboshir commited on Aug 23

Commit

3f171d3

•

1 Parent(s): dd57f71

Add metrics

Files changed (1) hide show

README.md +7 -5

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ model-index:
     metrics:
     - name: WER
       type: wer
-      value: null
   - task:
       name: Speech Recognition
       type: automatic-speech-recognition
@@ -36,7 +36,7 @@ model-index:
     metrics:
     - name: WER (without punctuation)
       type: wer
-      value: null
 datasets:
 - mozilla-foundation/common_voice_15_0
 ---
@@ -50,11 +50,13 @@ Model was finetuned on russian part of [mozilla-foundation/common_voice_15_0](ht
 ## Metrics
-| metric | dataset | waveletdeboshir/whisper-base-ru-pruned | waveletdeboshir/whisper-small-ru-pruned-ft |
 | :------ | :------ | :------ | :------ |
-| WER (without punctuation) | common_voice_15_0_test |  |  |
-| WER | common_voice_15_0_test |  |  |
 ## Size
 Only 10% tokens was left including special whisper tokens (no language tokens except \<|ru|\> and \<|en|\>, no timestamp tokens), 200 most popular tokens from tokenizer and 4000 most popular Russian tokens computed by tokenization of russian text corpus.

     metrics:
     - name: WER
       type: wer
+      value: 26.52
   - task:
       name: Speech Recognition
       type: automatic-speech-recognition
     metrics:
     - name: WER (without punctuation)
       type: wer
+      value: 21.35
 datasets:
 - mozilla-foundation/common_voice_15_0
 ---
 ## Metrics
+| metric | dataset | waveletdeboshir/whisper-base-ru-pruned | waveletdeboshir/whisper-base-ru-pruned-ft |
 | :------ | :------ | :------ | :------ |
+| WER (without punctuation) | common_voice_15_0_test | 0.3352 | **0.2135** |
+| WER | common_voice_15_0_test | 0.4050 | **0.2652** |
+## Limitations
+Because texts in Common Voice don't contain digits and other characters except letters and punctuation signs, model lost an ability to predict numbers and special characters.
 ## Size
 Only 10% tokens was left including special whisper tokens (no language tokens except \<|ru|\> and \<|en|\>, no timestamp tokens), 200 most popular tokens from tokenizer and 4000 most popular Russian tokens computed by tokenization of russian text corpus.