japanese-denim commited on
Commit
f679b5a
1 Parent(s): 0692b09

Training complete

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.4960
22
- - Bleu: 20.9248
23
 
24
  ## Model description
25
 
@@ -45,6 +45,7 @@ The following hyperparameters were used during training:
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - num_epochs: 3
 
48
 
49
  ### Training results
50
 
@@ -52,7 +53,7 @@ The following hyperparameters were used during training:
52
 
53
  ### Framework versions
54
 
55
- - Transformers 4.33.3
56
- - Pytorch 2.0.1+cu118
57
- - Datasets 2.14.5
58
- - Tokenizers 0.13.3
 
18
 
19
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.6053
22
+ - Bleu: 27.9236
23
 
24
  ## Model description
25
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - num_epochs: 3
48
+ - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
 
53
 
54
  ### Framework versions
55
 
56
+ - Transformers 4.35.2
57
+ - Pytorch 2.1.0+cu118
58
+ - Datasets 2.14.7
59
+ - Tokenizers 0.15.0
generation_config.json CHANGED
@@ -7,5 +7,5 @@
7
  "max_length": 200,
8
  "num_beams": 5,
9
  "pad_token_id": 1,
10
- "transformers_version": "4.33.3"
11
  }
 
7
  "max_length": 200,
8
  "num_beams": 5,
9
  "pad_token_id": 1,
10
+ "transformers_version": "4.35.2"
11
  }
runs/Nov15_21-37-13_46535a2a894d/events.out.tfevents.1700095755.46535a2a894d.430.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c95d98e3218898e0be5720f81947a92f6f7fed324b25ed6f5758eaca3d2b4b12
3
+ size 413