Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,49 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# distil-whisper-large-v3-tr
|
2 |
+
|
3 |
+
## Model Description
|
4 |
+
|
5 |
+
`distil-whisper-large-v3-tr` is a distilled version of the Whisper model, fine-tuned for Turkish language tasks. This model has been trained and evaluated using a comprehensive dataset to achieve high accuracy in Turkish speech recognition.
|
6 |
+
|
7 |
+
## Training and Evaluation Metrics
|
8 |
+
|
9 |
+
The model was trained and evaluated using the `wandb` tool, with the following results:
|
10 |
+
|
11 |
+
### Evaluation Metrics
|
12 |
+
|
13 |
+
- **Cross-Entropy Loss (eval/ce_loss):** 0.53218
|
14 |
+
- **Epoch (eval/epoch):** 28
|
15 |
+
- **KL Loss (eval/kl_loss):** 0.34883
|
16 |
+
- **Total Loss (eval/loss):** 0.77457
|
17 |
+
- **Evaluation Time (eval/time):** 397.1784 seconds
|
18 |
+
- **Word Error Rate (eval/wer):** 14.43288%
|
19 |
+
- **Orthographic Word Error Rate (eval/wer_ortho):** 21.55298%
|
20 |
+
|
21 |
+
### Training Metrics
|
22 |
+
|
23 |
+
- **Cross-Entropy Loss (train/ce_loss):** 0.04695
|
24 |
+
- **Epoch (train/epoch):** 28
|
25 |
+
- **KL Loss (train/kl_loss):** 0.24143
|
26 |
+
- **Learning Rate (train/learning_rate):** 0.0001
|
27 |
+
- **Total Loss (train/loss):** 0.27899
|
28 |
+
- **Training Time (train/time):** 12426.92106 seconds
|
29 |
+
|
30 |
+
## Run History
|
31 |
+
|
32 |
+
### Overall Metrics
|
33 |
+
|
34 |
+
- **Real-Time Factor (all/rtf):** 392.23396
|
35 |
+
- **Word Error Rate (all/wer):** 14.33829
|
36 |
+
|
37 |
+
### Common Voice 17.0 Turkish Pseudo-Labelled Dataset
|
38 |
+
|
39 |
+
- **Real-Time Factor (common_voice_17_0_tr_pseudo_labelled/test/rtf):** 392.23396
|
40 |
+
- **Word Error Rate (common_voice_17_0_tr_pseudo_labelled/test/wer):** 14.33829
|
41 |
+
|
42 |
+
## Author
|
43 |
+
|
44 |
+
**Sercan Çepni**
|
45 |
+
Email: [email protected]
|
46 |
+
|
47 |
+
---
|
48 |
+
|
49 |
+
For any questions or further information, please feel free to contact the author.
|