End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [kiranpantha/whisper-large-v3-nepali](https://huggingface.co/kiranpantha/whisper-large-v3-nepali) on the OpenSLR54 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2792
 ## Model description
@@ -46,16 +46,23 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 10
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 9    | 0.5391          |
-| No log        | 2.0   | 18   | 0.2760          |
-| 0.4778        | 3.0   | 27   | 0.2792          |
 ### Framework versions

 This model is a fine-tuned version of [kiranpantha/whisper-large-v3-nepali](https://huggingface.co/kiranpantha/whisper-large-v3-nepali) on the OpenSLR54 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4135
 ## Model description
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 10
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 14   | 0.3783          |
+| 0.5414        | 2.0   | 28   | 0.3666          |
+| 0.5414        | 3.0   | 42   | 0.3630          |
+| 0.1128        | 4.0   | 56   | 0.3761          |
+| 0.1128        | 5.0   | 70   | 0.3991          |
+| 0.0601        | 6.0   | 84   | 0.4439          |
+| 0.0601        | 7.0   | 98   | 0.4124          |
+| 0.0442        | 8.0   | 112  | 0.4095          |
+| 0.0239        | 9.0   | 126  | 0.4067          |
+| 0.0239        | 10.0  | 140  | 0.4135          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -26,8 +26,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "k_proj"
   ],
   "task_type": null,
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": null,
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d2554f0bc9eadf0f76511ef8d19b8f1fd48d49114222852034d55db994f81fe6
 size 62969640

 version https://git-lfs.github.com/spec/v1
+oid sha256:bf3a922bcc8bbab947942232e1eabb5dec81a990f76ecb2a6ebb405c692735ca
 size 62969640

runs/Jan14_19-13-22_idc-training-gpu-compute-27/events.out.tfevents.1736882003.idc-training-gpu-compute-27.649182.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e0bd503e4d6ce8eaa61db7dd3ae65e015ae2dd4f23f6dba58f6735dab4e2225e
+size 9891

runs/Jan14_19-19-03_idc-training-gpu-compute-27/events.out.tfevents.1736882347.idc-training-gpu-compute-27.649182.5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:201cca6697b327007d7f62d3efe54b945ef6f7437f38c3d9266ed3055d7923f4
+size 10033

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:74e09717bf6af6ef900d73788a8ccd8689ff92ec4ca7e567a268e030b9ff7c77
 size 5560

 version https://git-lfs.github.com/spec/v1
+oid sha256:7fce2cb692eab6e32e790eb26265b9adcd6d4bc2e936d00513f2201e1c916b89
 size 5560