kiranpantha
/

w2v-bert-2.0-nepali-unlabeled-2

+---
+library_name: transformers
+language:
+- ne
+license: mit
+base_model: kiranpantha/w2v-bert-2.0-nepali-unlabeled-1
+tags:
+- generated_from_trainer
+datasets:
+- kiranpantha/OpenSLR54-Balanced-Nepali
+metrics:
+- wer
+model-index:
+- name: Wave2Vec2-Bert2.0 - Kiran Pantha
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: OpenSLR54
+      type: kiranpantha/OpenSLR54-Balanced-Nepali
+      config: default
+      split: test
+      args: 'config: ne, split: train,test'
+    metrics:
+    - name: Wer
+      type: wer
+      value: 0.44966842373745963
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# Wave2Vec2-Bert2.0 - Kiran Pantha
+This model is a fine-tuned version of [kiranpantha/w2v-bert-2.0-nepali-unlabeled-1](https://huggingface.co/kiranpantha/w2v-bert-2.0-nepali-unlabeled-1) on the OpenSLR54 dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.5190
+- Wer: 0.4497
+- Cer: 0.1090
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
+- num_epochs: 2
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step  | Cer    | Validation Loss | Wer    |
+|:-------------:|:------:|:-----:|:------:|:---------------:|:------:|
+| 0.4494        | 0.0375 | 300   | 0.1147 | 0.5118          | 0.4793 |
+| 0.5556        | 0.075  | 600   | 0.1448 | 0.6503          | 0.5808 |
+| 0.5684        | 0.1125 | 900   | 0.1418 | 0.6258          | 0.5741 |
+| 0.5309        | 0.15   | 1200  | 0.1446 | 0.6867          | 0.5391 |
+| 0.615         | 0.1875 | 1500  | 0.1566 | 0.6692          | 0.5844 |
+| 0.5627        | 0.225  | 1800  | 0.1434 | 0.6586          | 0.5597 |
+| 0.6188        | 0.2625 | 2100  | 0.1500 | 0.6250          | 0.5559 |
+| 0.5888        | 0.3    | 2400  | 0.1624 | 0.6863          | 0.6162 |
+| 0.5435        | 0.3375 | 2700  | 0.1551 | 0.6415          | 0.5736 |
+| 0.5667        | 0.375  | 3000  | 0.1478 | 0.6041          | 0.5661 |
+| 0.5323        | 0.4125 | 3300  | 0.1392 | 0.5805          | 0.5327 |
+| 0.5471        | 0.45   | 3600  | 0.1390 | 0.5699          | 0.5327 |
+| 0.5939        | 0.4875 | 3900  | 0.1341 | 0.5739          | 0.5169 |
+| 0.5795        | 0.525  | 4200  | 0.1392 | 0.6036          | 0.5278 |
+| 0.4974        | 0.5625 | 4500  | 0.1255 | 0.5331          | 0.4997 |
+| 0.5247        | 0.6    | 4800  | 0.1300 | 0.5649          | 0.5190 |
+| 0.5035        | 0.6375 | 5100  | 0.1292 | 0.5583          | 0.5067 |
+| 0.5354        | 0.675  | 5400  | 0.1270 | 0.5472          | 0.5115 |
+| 0.536         | 0.7125 | 5700  | 0.1283 | 0.5406          | 0.5012 |
+| 0.498         | 0.75   | 6000  | 0.1331 | 0.5747          | 0.5167 |
+| 0.4339        | 0.7875 | 6300  | 0.1266 | 0.5224          | 0.4846 |
+| 0.4504        | 0.825  | 6600  | 0.1234 | 0.5549          | 0.4982 |
+| 0.4237        | 0.8625 | 6900  | 0.1221 | 0.5376          | 0.4759 |
+| 0.4434        | 0.9    | 7200  | 0.1303 | 0.5651          | 0.5080 |
+| 0.443         | 0.9375 | 7500  | 0.1219 | 0.5222          | 0.4889 |
+| 0.4282        | 0.975  | 7800  | 0.1247 | 0.5297          | 0.4936 |
+| 0.4128        | 1.0125 | 8100  | 0.1230 | 0.5263          | 0.4804 |
+| 0.4507        | 1.05   | 8400  | 0.1254 | 0.5548          | 0.4881 |
+| 0.4008        | 1.0875 | 8700  | 0.1232 | 0.5411          | 0.4816 |
+| 0.4834        | 1.125  | 9000  | 0.1215 | 0.5264          | 0.4853 |
+| 0.3955        | 1.1625 | 9300  | 0.1232 | 0.5288          | 0.4876 |
+| 0.3837        | 1.2    | 9600  | 0.1224 | 0.5496          | 0.4853 |
+| 0.3819        | 1.2375 | 9900  | 0.5215 | 0.4739          | 0.1232 |
+| 0.3771        | 1.275  | 10200 | 0.5115 | 0.4641          | 0.1188 |
+| 0.4067        | 1.3125 | 10500 | 0.5274 | 0.4810          | 0.1236 |
+| 0.3561        | 1.35   | 10800 | 0.5366 | 0.4739          | 0.1182 |
+| 0.3971        | 1.3875 | 11100 | 0.4951 | 0.4669          | 0.1178 |
+| 0.337         | 1.425  | 11400 | 0.5180 | 0.4630          | 0.1156 |
+| 0.4031        | 1.4625 | 11700 | 0.4895 | 0.4664          | 0.1156 |
+| 0.4278        | 1.5    | 12000 | 0.4858 | 0.4469          | 0.1107 |
+| 0.3332        | 1.5375 | 12300 | 0.4986 | 0.4546          | 0.1130 |
+| 0.3516        | 1.575  | 12600 | 0.5067 | 0.4677          | 0.1148 |
+| 0.4022        | 1.6125 | 12900 | 0.5022 | 0.4638          | 0.1114 |
+| 0.3922        | 1.65   | 13200 | 0.4753 | 0.4588          | 0.1130 |
+| 0.3483        | 1.6875 | 13500 | 0.4812 | 0.4562          | 0.1135 |
+| 0.3572        | 1.725  | 13800 | 0.4940 | 0.4461          | 0.1083 |
+| 0.2796        | 1.7625 | 14100 | 0.4854 | 0.4457          | 0.1082 |
+| 0.2555        | 1.8    | 14400 | 0.5231 | 0.4482          | 0.1099 |
+| 0.2823        | 1.8375 | 14700 | 0.5126 | 0.4475          | 0.1093 |
+| 0.2478        | 1.875  | 15000 | 0.5063 | 0.4458          | 0.1087 |
+| 0.2435        | 1.9125 | 15300 | 0.5151 | 0.4409          | 0.1077 |
+| 0.2478        | 1.95   | 15600 | 0.5185 | 0.4464          | 0.1084 |
+| 0.2653        | 1.9875 | 15900 | 0.5190 | 0.4497          | 0.1090 |
+### Framework versions
+- Transformers 4.45.0.dev0
+- Pytorch 2.4.1+cu121
+- Datasets 2.21.0
+- Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5ab9d126876dbf69b7eaccc8a8a3e593ad4288b4ad360743cd47034f663f6744
 size 2423081060

 version https://git-lfs.github.com/spec/v1
+oid sha256:0eace2ee8e5b068bbcb047d925c8780f5d190a9a625d53797f463e3dd6c3b9b9
 size 2423081060

runs/Sep25_23-50-40_kiran/events.out.tfevents.1727287634.kiran.151812.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d3e68b7a56b83e9eb0dbdb32dff0f8fffa812f446dd6d0359367ed68a64e1144
-size 17817

 version https://git-lfs.github.com/spec/v1
+oid sha256:ee762563956ef39469e19b5af89449a4e71fcb9ee220d5c555a730ed2ae7fdea
+size 18747