gweltou
/

wav2vec2-large-xlsr-53-br

@@ -1,25 +1,40 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: facebook/wav2vec2-large-xlsr-53
 metrics:
 - wer
 model-index:
-- name: wav2vec2-large-xlsr-53-breton
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# wav2vec2-large-xlsr-53-breton
-This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9840
-- Wer: 0.5852
-- Cer: 0.2130
 ## Model description
@@ -39,45 +54,39 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 6e-05
-- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.08
-- num_epochs: 50
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer    | Cer    |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
-| 11.8947       | 2.56  | 250  | 3.4769          | 1.0    | 0.9862 |
-| 3.1668        | 5.13  | 500  | 3.0459          | 1.0    | 0.9862 |
-| 2.6491        | 7.69  | 750  | 1.6416          | 0.9319 | 0.4441 |
-| 1.4107        | 10.26 | 1000 | 1.1000          | 0.7751 | 0.2852 |
-| 0.9989        | 12.82 | 1250 | 0.9827          | 0.7092 | 0.2578 |
-| 0.8238        | 15.38 | 1500 | 0.9543          | 0.6864 | 0.2476 |
-| 0.7193        | 17.95 | 1750 | 0.9241          | 0.6547 | 0.2371 |
-| 0.6377        | 20.51 | 2000 | 0.9296          | 0.6452 | 0.2352 |
-| 0.5865        | 23.08 | 2250 | 0.9287          | 0.6320 | 0.2301 |
-| 0.541         | 25.64 | 2500 | 0.9359          | 0.6205 | 0.2231 |
-| 0.4988        | 28.21 | 2750 | 0.9850          | 0.6149 | 0.2244 |
-| 0.4691        | 30.77 | 3000 | 0.9566          | 0.6065 | 0.2192 |
-| 0.4568        | 33.33 | 3250 | 0.9653          | 0.6019 | 0.2175 |
-| 0.4485        | 35.9  | 3500 | 0.9760          | 0.5949 | 0.2175 |
-| 0.4219        | 38.46 | 3750 | 0.9824          | 0.5926 | 0.2177 |
-| 0.397         | 41.03 | 4000 | 0.9669          | 0.5885 | 0.2138 |
-| 0.3912        | 43.59 | 4250 | 0.9857          | 0.5908 | 0.2145 |
-| 0.3764        | 46.15 | 4500 | 0.9937          | 0.5886 | 0.2145 |
-| 0.3742        | 48.72 | 4750 | 0.9840          | 0.5852 | 0.2130 |
 ### Framework versions
-- Transformers 4.35.2
-- Pytorch 2.1.0+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.0

 ---
 license: apache-2.0
+base_model: facebook/wav2vec2-large-xlsr-53
 tags:
 - generated_from_trainer
+datasets:
+- common_voice_15_0
 metrics:
 - wer
 model-index:
+- name: wav2vec2-large-xlsr-53-br
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: common_voice_15_0
+      type: common_voice_15_0
+      config: br
+      split: None
+      args: br
+    metrics:
+    - name: Wer
+      type: wer
+      value: 54.71511888739345
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# wav2vec2-large-xlsr-53-br
+This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_15_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7879
+- Wer: 54.7151
+- Cer: 19.2493
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 6e-05
+- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 4
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 300
+- num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     | Cer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 6.3257        | 2.18  | 500  | 3.0700          | 100.0   | 99.0871 |
+| 2.2071        | 4.36  | 1000 | 1.1541          | 80.0449 | 29.4230 |
+| 1.0019        | 6.54  | 1500 | 0.8986          | 69.2059 | 24.3938 |
+| 0.7796        | 8.71  | 2000 | 0.8015          | 63.3737 | 22.1296 |
+| 0.6677        | 10.89 | 2500 | 0.8014          | 61.4984 | 21.4568 |
+| 0.5937        | 13.07 | 3000 | 0.7623          | 58.9323 | 20.4929 |
+| 0.5454        | 15.25 | 3500 | 0.7975          | 57.8466 | 20.2585 |
+| 0.5075        | 17.43 | 4000 | 0.7831          | 56.7250 | 19.7879 |
+| 0.4837        | 19.61 | 4500 | 0.7902          | 55.9623 | 19.5101 |
+| 0.4529        | 21.79 | 5000 | 0.7851          | 54.9753 | 19.0924 |
+| 0.4381        | 23.97 | 5500 | 0.7865          | 55.1727 | 19.3211 |
+| 0.4208        | 26.14 | 6000 | 0.8168          | 55.1817 | 19.3967 |
+| 0.4197        | 28.32 | 6500 | 0.7879          | 54.7151 | 19.2493 |
 ### Framework versions
+- Transformers 4.39.1
+- Pytorch 2.0.1+cu117
+- Datasets 2.18.0
+- Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:062d783998ef2b200d2c7094dddda6a478e6944534b38e2b56da97f0efc6c58f
 size 1262012432

 version https://git-lfs.github.com/spec/v1
+oid sha256:1c604794129ec1687d5ee04edcae82b445f864b30c3be119899c787a85c59eb4
 size 1262012432

runs/Jun03_08-47-04_gweltaz-NUC10i7FNK/events.out.tfevents.1717397345.gweltaz-NUC10i7FNK.2937.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:833f0ee3b70e655516dfbfe85ecf2e7520a579d3120753d0e7a9cf469dbdaa99
-size 13814

 version https://git-lfs.github.com/spec/v1
+oid sha256:19812879513e1a20a3853cea6e6fd5e63ad5f4132b9ec49c5c0c553a77391644
+size 14168