fsicoli commited on
Commit
ad631fd
1 Parent(s): 8c4e3c4

Model save

Browse files
README.md CHANGED
@@ -1,79 +1,74 @@
1
- ---
2
- library_name: transformers
3
- license: apache-2.0
4
- base_model: openai/whisper-large-v3
5
- tags:
6
- - generated_from_trainer
7
- datasets:
8
- - fsicoli/cv19-fleurs
9
- metrics:
10
- - wer
11
- model-index:
12
- - name: whisper-large-v3-pt-cv19-fleurs
13
- results:
14
- - task:
15
- name: Automatic Speech Recognition
16
- type: automatic-speech-recognition
17
- dataset:
18
- name: fsicoli/cv19-fleurs default
19
- type: fsicoli/cv19-fleurs
20
- args: default
21
- metrics:
22
- - name: Wer
23
- type: wer
24
- value: 0.07299865617200998
25
- ---
26
-
27
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
- should probably proofread and complete it, then remove this comment. -->
29
-
30
- # whisper-large-v3-pt-cv19-fleurs
31
-
32
- This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the fsicoli/cv19-fleurs default dataset in Portuguese.
33
- It achieves the following results on the evaluation set:
34
- - Loss: 0.2108
35
- - Wer: 0.0729
36
-
37
- ## Model description
38
-
39
- More information needed
40
-
41
- ## Intended uses & limitations
42
-
43
- More information needed
44
-
45
- ## Training and evaluation data
46
-
47
- More information needed
48
-
49
- ## Training procedure
50
-
51
- ### Training hyperparameters
52
-
53
- The following hyperparameters were used during training:
54
- - learning_rate: 6.25e-06
55
- - train_batch_size: 8
56
- - eval_batch_size: 8
57
- - seed: 42
58
- - gradient_accumulation_steps: 2
59
- - total_train_batch_size: 16
60
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
- - lr_scheduler_type: linear
62
- - lr_scheduler_warmup_steps: 10000
63
- - training_steps: 50000
64
- - mixed_precision_training: Native AMP
65
-
66
- ### Training results
67
-
68
- | Training Loss | Epoch | Step | Validation Loss | Wer |
69
- |:-------------:|:-------:|:-----:|:---------------:|:------:|
70
- | 0.0339 | 2.2883 | 5000 | 0.1096 | 0.0729 |
71
-
72
-
73
-
74
- ### Framework versions
75
-
76
- - Transformers 4.45.0.dev0
77
- - Pytorch 2.4.1
78
- - Datasets 2.21.0
79
- - Tokenizers 0.19.1
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: openai/whisper-large-v3
5
+ tags:
6
+ - generated_from_trainer
7
+ metrics:
8
+ - wer
9
+ model-index:
10
+ - name: whisper-large-v3-pt-cv19-fleurs
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # whisper-large-v3-pt-cv19-fleurs
18
+
19
+ This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.1848
22
+ - Wer: 0.0756
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 6.25e-06
42
+ - train_batch_size: 8
43
+ - eval_batch_size: 8
44
+ - seed: 42
45
+ - gradient_accumulation_steps: 2
46
+ - total_train_batch_size: 16
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - lr_scheduler_warmup_steps: 10000
50
+ - training_steps: 50000
51
+ - mixed_precision_training: Native AMP
52
+
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
56
+ |:-------------:|:-------:|:-----:|:---------------:|:------:|
57
+ | 0.0559 | 2.2883 | 5000 | 0.1096 | 0.0730 |
58
+ | 0.0581 | 4.5767 | 10000 | 0.1326 | 0.0829 |
59
+ | 0.0225 | 6.8650 | 15000 | 0.1570 | 0.0849 |
60
+ | 0.0088 | 9.1533 | 20000 | 0.1704 | 0.0840 |
61
+ | 0.0065 | 11.4416 | 25000 | 0.1823 | 0.0849 |
62
+ | 0.006 | 13.7300 | 30000 | 0.1808 | 0.0809 |
63
+ | 0.0055 | 16.0183 | 35000 | 0.1811 | 0.0790 |
64
+ | 0.0031 | 18.3066 | 40000 | 0.1907 | 0.0784 |
65
+ | 0.0011 | 20.5950 | 45000 | 0.1852 | 0.0771 |
66
+ | 0.0003 | 22.8833 | 50000 | 0.1848 | 0.0756 |
67
+
68
+
69
+ ### Framework versions
70
+
71
+ - Transformers 4.45.0.dev0
72
+ - Pytorch 2.4.1
73
+ - Datasets 2.21.0
74
+ - Tokenizers 0.19.1
 
 
 
 
 
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:242ddaa463a64ced245bd1a75f41ebe2270e2d2ac6d434f748e8c72719472b7c
3
  size 4993448880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:667b425605cb54201276f4b1043b82f818c772a56947650ef279a065dfa00b52
3
  size 4993448880
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8c21aaa3c64158023633b474a22a98400eb1876dea9acdee8740d5d24136ba5a
3
  size 1180663192
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc9953958ba23eec7d5e1d56d43c22e30896791ecb209951e89a72e9790e3261
3
  size 1180663192
runs/Oct01_15-09-05_DITEC2014063010/events.out.tfevents.1727806253.DITEC2014063010 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:738d401364b78169883771181c43ee91cfd79ec68374b16270b9ec4afb30b36b
3
- size 49102
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e51de7802391e4955a5e0a004a08bb1380b3b193945f8ae7e836c3cf2327596
3
+ size 49462