dq158 commited on
Commit
5587dc0
1 Parent(s): a034cdf

End of training

Browse files
Files changed (2) hide show
  1. README.md +20 -21
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,10 +1,8 @@
1
  ---
2
- license: apache-2.0
3
- base_model: google/flan-t5-base
4
  tags:
5
  - generated_from_trainer
6
  metrics:
7
- - rouge
8
  model-index:
9
  - name: morbius
10
  results: []
@@ -15,14 +13,15 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # morbius
17
 
18
- This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: nan
21
- - Rouge1: 2.1823
22
- - Rouge2: 0.226
23
- - Rougel: 2.1188
24
- - Rougelsum: 2.1216
25
- - Gen Len: 8.9152
 
26
 
27
  ## Model description
28
 
@@ -42,8 +41,8 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 5e-05
45
- - train_batch_size: 8
46
- - eval_batch_size: 8
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
@@ -51,18 +50,18 @@ The following hyperparameters were used during training:
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
- |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
56
- | 0.0 | 1.0 | 7626 | nan | 2.1823 | 0.226 | 2.1188 | 2.1216 | 8.9152 |
57
- | 0.0 | 2.0 | 15252 | nan | 2.1823 | 0.226 | 2.1188 | 2.1216 | 8.9152 |
58
- | 0.0 | 3.0 | 22878 | nan | 2.1823 | 0.226 | 2.1188 | 2.1216 | 8.9152 |
59
- | 0.0 | 4.0 | 30504 | nan | 2.1823 | 0.226 | 2.1188 | 2.1216 | 8.9152 |
60
- | 0.0 | 5.0 | 38130 | nan | 2.1823 | 0.226 | 2.1188 | 2.1216 | 8.9152 |
61
 
62
 
63
  ### Framework versions
64
 
65
- - Transformers 4.33.3
66
  - Pytorch 2.0.1+cu118
67
  - Datasets 2.14.5
68
- - Tokenizers 0.13.3
 
1
  ---
 
 
2
  tags:
3
  - generated_from_trainer
4
  metrics:
5
+ - bleu
6
  model-index:
7
  - name: morbius
8
  results: []
 
13
 
14
  # morbius
15
 
16
+ This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.3311
19
+ - Bleu: 0.0490
20
+ - Precisions: [0.12658339197748064, 0.058000714881448825, 0.031020853918560506, 0.0276665140764477]
21
+ - Brevity Penalty: 0.9781
22
+ - Length Ratio: 0.9783
23
+ - Translation Length: 45472
24
+ - Reference Length: 46479
25
 
26
  ## Model description
27
 
 
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
+ - train_batch_size: 12
45
+ - eval_batch_size: 12
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
 
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Precisions | Brevity Penalty | Length Ratio | Translation Length | Reference Length |
54
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:--------------------------------------------------------------------------------------:|:---------------:|:------------:|:------------------:|:----------------:|
55
+ | 2.6085 | 1.0 | 2630 | 2.3793 | 0.0398 | [0.11484440108136675, 0.05086452177719413, 0.022402389588222743, 0.019262093750807972] | 1.0 | 1.0585 | 49197 | 46479 |
56
+ | 2.5537 | 2.0 | 5260 | 2.3538 | 0.0451 | [0.12435074854873206, 0.053338059789672695, 0.02736549165120594, 0.024163621427155037] | 0.9858 | 0.9859 | 45822 | 46479 |
57
+ | 2.427 | 3.0 | 7890 | 2.3412 | 0.0478 | [0.12566410537870473, 0.05610922151130985, 0.029971974257836827, 0.026891236083357122] | 0.9798 | 0.9800 | 45550 | 46479 |
58
+ | 2.3716 | 4.0 | 10520 | 2.3347 | 0.0487 | [0.12663965838169275, 0.0574505431946487, 0.030477866031926728, 0.027230821761893922] | 0.9823 | 0.9825 | 45665 | 46479 |
59
+ | 2.3494 | 5.0 | 13150 | 2.3311 | 0.0490 | [0.12658339197748064, 0.058000714881448825, 0.031020853918560506, 0.0276665140764477] | 0.9781 | 0.9783 | 45472 | 46479 |
60
 
61
 
62
  ### Framework versions
63
 
64
+ - Transformers 4.34.0
65
  - Pytorch 2.0.1+cu118
66
  - Datasets 2.14.5
67
+ - Tokenizers 0.14.0
generation_config.json CHANGED
@@ -2,5 +2,5 @@
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
- "transformers_version": "4.33.3"
6
  }
 
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
+ "transformers_version": "4.34.0"
6
  }