End of training
Browse files- README.md +8 -2
- generation_config.json +1 -0
README.md
CHANGED
@@ -43,10 +43,16 @@ The following hyperparameters were used during training:
|
|
43 |
- seed: 42
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: linear
|
46 |
-
-
|
47 |
-
- training_steps: 4000
|
48 |
- mixed_precision_training: Native AMP
|
49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
### Framework versions
|
51 |
|
52 |
- Transformers 4.40.0.dev0
|
|
|
43 |
- seed: 42
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: linear
|
46 |
+
- training_steps: 10
|
|
|
47 |
- mixed_precision_training: Native AMP
|
48 |
|
49 |
+
### Training results
|
50 |
+
|
51 |
+
| Training Loss | Epoch | Step | Validation Loss | Wer |
|
52 |
+
|:-------------:|:-----:|:----:|:---------------:|:-------:|
|
53 |
+
| No log | 10.0 | 10 | 1.1995 | 68.6047 |
|
54 |
+
|
55 |
+
|
56 |
### Framework versions
|
57 |
|
58 |
- Transformers 4.40.0.dev0
|
generation_config.json
CHANGED
@@ -160,6 +160,7 @@
|
|
160 |
"<|yo|>": 50325,
|
161 |
"<|zh|>": 50260
|
162 |
},
|
|
|
163 |
"max_initial_timestamp_index": 50,
|
164 |
"max_length": 448,
|
165 |
"no_timestamps_token_id": 50363,
|
|
|
160 |
"<|yo|>": 50325,
|
161 |
"<|zh|>": 50260
|
162 |
},
|
163 |
+
"language": "hi",
|
164 |
"max_initial_timestamp_index": 50,
|
165 |
"max_length": 448,
|
166 |
"no_timestamps_token_id": 50363,
|