japanese-denim
/

mbart-50-finetuned-naga-to-eng

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4960
-- Bleu: 20.9248
 ## Model description
@@ -45,6 +45,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Training results
@@ -52,7 +53,7 @@ The following hyperparameters were used during training:
 ### Framework versions
-- Transformers 4.33.3
-- Pytorch 2.0.1+cu118
-- Datasets 2.14.5
-- Tokenizers 0.13.3

 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6053
+- Bleu: 27.9236
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
+- mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions
+- Transformers 4.35.2
+- Pytorch 2.1.0+cu118
+- Datasets 2.14.7
+- Tokenizers 0.15.0

generation_config.json CHANGED Viewed

@@ -7,5 +7,5 @@
   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
-  "transformers_version": "4.33.3"
 }

   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
+  "transformers_version": "4.35.2"
 }

runs/Nov15_21-37-13_46535a2a894d/events.out.tfevents.1700095755.46535a2a894d.430.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c95d98e3218898e0be5720f81947a92f6f7fed324b25ed6f5758eaca3d2b4b12
+size 413