Salama1429
/

KalemaTech-Arabic-STT-ASR-based-on-Whisper-Small

Automatic Speech Recognition

hf-asr-leaderboard

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Salama1429 commited on Dec 28, 2022

Commit

399c972

·

1 Parent(s): 29a8690

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 # KalemaTech-Arabic-STT-ASR-based-on-Whisper-Small
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.5362
 - Wer: 58.5848
@@ -29,7 +29,11 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 # KalemaTech-Arabic-STT-ASR-based-on-Whisper-Small
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on Common_Voice_Arabic_12.0_Augmented.
 It achieves the following results on the evaluation set:
 - Loss: 0.5362
 - Wer: 58.5848
 ## Training and evaluation data
+Common_Voice_Arabic_12.0 and I made some augmentations to it as follows:
+- 25% of the data TimeMasking
+- 25% of the data SpecAugmentation
+- 25% of the data WavAugmentation (AddGaussianNoise)
+- The final dataset is the original common voice plus the augmented files
 ## Training procedure