korbih commited on
Commit
9b0a79d
1 Parent(s): 4390c49

End of training

Browse files
README.md CHANGED
@@ -8,9 +8,24 @@ tags:
8
  - generated_from_trainer
9
  datasets:
10
  - mozilla-foundation/common_voice_11_0
 
 
11
  model-index:
12
  - name: Whisper Small Hi - Sanchit Gandhi
13
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -19,6 +34,9 @@ should probably proofread and complete it, then remove this comment. -->
19
  # Whisper Small Hi - Sanchit Gandhi
20
 
21
  This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
 
 
 
22
 
23
  ## Model description
24
 
@@ -39,7 +57,7 @@ More information needed
39
  The following hyperparameters were used during training:
40
  - learning_rate: 1e-05
41
  - train_batch_size: 16
42
- - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
@@ -47,6 +65,21 @@ The following hyperparameters were used during training:
47
  - training_steps: 4000
48
  - mixed_precision_training: Native AMP
49
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
  ### Framework versions
51
 
52
  - Transformers 4.40.0.dev0
 
8
  - generated_from_trainer
9
  datasets:
10
  - mozilla-foundation/common_voice_11_0
11
+ metrics:
12
+ - wer
13
  model-index:
14
  - name: Whisper Small Hi - Sanchit Gandhi
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 11.0
21
+ type: mozilla-foundation/common_voice_11_0
22
+ config: hi
23
+ split: None
24
+ args: 'config: hi, split: test'
25
+ metrics:
26
+ - name: Wer
27
+ type: wer
28
+ value: 32.984847202234825
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
34
  # Whisper Small Hi - Sanchit Gandhi
35
 
36
  This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
37
+ It achieves the following results on the evaluation set:
38
+ - Loss: 0.4377
39
+ - Wer: 32.9848
40
 
41
  ## Model description
42
 
 
57
  The following hyperparameters were used during training:
58
  - learning_rate: 1e-05
59
  - train_batch_size: 16
60
+ - eval_batch_size: 32
61
  - seed: 42
62
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
  - lr_scheduler_type: linear
 
65
  - training_steps: 4000
66
  - mixed_precision_training: Native AMP
67
 
68
+ ### Training results
69
+
70
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
71
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|
72
+ | No log | 0.0 | 1 | 2.2652 | 86.7857 |
73
+ | 0.1858 | 1.22 | 500 | 0.3301 | 39.7317 |
74
+ | 0.0881 | 2.44 | 1000 | 0.2966 | 34.9065 |
75
+ | 0.0457 | 3.67 | 1500 | 0.3160 | 33.8695 |
76
+ | 0.0195 | 4.89 | 2000 | 0.3571 | 33.9287 |
77
+ | 0.0047 | 6.11 | 2500 | 0.3913 | 33.4843 |
78
+ | 0.0014 | 7.33 | 3000 | 0.4186 | 32.9637 |
79
+ | 0.0005 | 8.56 | 3500 | 0.4286 | 33.0737 |
80
+ | 0.0005 | 9.78 | 4000 | 0.4377 | 32.9848 |
81
+
82
+
83
  ### Framework versions
84
 
85
  - Transformers 4.40.0.dev0
generation_config.json CHANGED
@@ -160,6 +160,7 @@
160
  "<|yo|>": 50325,
161
  "<|zh|>": 50260
162
  },
 
163
  "max_initial_timestamp_index": 50,
164
  "max_length": 448,
165
  "no_timestamps_token_id": 50363,
 
160
  "<|yo|>": 50325,
161
  "<|zh|>": 50260
162
  },
163
+ "language": "hi",
164
  "max_initial_timestamp_index": 50,
165
  "max_length": 448,
166
  "no_timestamps_token_id": 50363,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0d4ca1263512e2daffab031eefa5b4df4b6ba57c9e6faceca76182e58055e9e1
3
  size 966995080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cbc45000948638da4096e1e4853ddb446272e4c2d573ff3a46c7041bbfd46e14
3
  size 966995080
runs/Mar20_19-13-54_2b76929eb25a/events.out.tfevents.1710962048.2b76929eb25a.12451.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fe71920ff383a2bcb26f3d0e90503435a110e6e6214a4d12e3d37f3e886a6792
3
- size 42031
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0bf47c22dac6a94b906585398f08760c26a6daa3576e52bf593555a7c670d571
3
+ size 42385