Sidharthkr commited on
Commit
8df041d
1 Parent(s): 2f4a74c

End of training

Browse files
Files changed (3) hide show
  1. README.md +11 -11
  2. pytorch_model.bin +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.3565
21
- - Rouge1: 47.5452
22
- - Rouge2: 25.3674
23
- - Rougel: 36.0812
24
- - Rougelsum: 36.2167
25
- - Gen Len: 107.74
26
 
27
  ## Model description
28
 
@@ -42,11 +42,11 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
- - train_batch_size: 5
46
  - eval_batch_size: 4
47
  - seed: 42
48
  - gradient_accumulation_steps: 2
49
- - total_train_batch_size: 10
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
  - num_epochs: 3
@@ -55,9 +55,9 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
58
- | 0.5304 | 1.0 | 500 | 0.3506 | 46.2605 | 23.1881 | 33.9634 | 34.0538 | 108.25 |
59
- | 0.2188 | 2.0 | 1000 | 0.3462 | 48.0532 | 25.4668 | 36.3198 | 36.449 | 106.76 |
60
- | 0.1677 | 3.0 | 1500 | 0.3565 | 47.5452 | 25.3674 | 36.0812 | 36.2167 | 107.74 |
61
 
62
 
63
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.3548
21
+ - Rouge1: 47.5134
22
+ - Rouge2: 24.7121
23
+ - Rougel: 35.7366
24
+ - Rougelsum: 35.6499
25
+ - Gen Len: 111.96
26
 
27
  ## Model description
28
 
 
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
+ - train_batch_size: 6
46
  - eval_batch_size: 4
47
  - seed: 42
48
  - gradient_accumulation_steps: 2
49
+ - total_train_batch_size: 12
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
  - num_epochs: 3
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
58
+ | No log | 1.0 | 417 | 0.3468 | 44.9326 | 22.3736 | 33.008 | 32.9247 | 116.43 |
59
+ | 0.5244 | 2.0 | 834 | 0.3440 | 46.9139 | 24.683 | 35.3699 | 35.333 | 119.65 |
60
+ | 0.2061 | 3.0 | 1251 | 0.3548 | 47.5134 | 24.7121 | 35.7366 | 35.6499 | 111.96 |
61
 
62
 
63
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2d77d40f521ed26202512f7dd3e6eee8b4d279d54f04983ed281042ce96cd90e
3
  size 1625537802
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a974372ecd23751fbe71ad5e8ec22179b98d2a947233343705c2db6f5edd8d28
3
  size 1625537802
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1aaff9143bea6a363b218702354c87478cdeb0d266f9a38da3fddac70d075da2
3
  size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:21af4c0dcb418ec5753bb0293ad8df699e767a7a156c565b38d83ec4b2b3b0a2
3
  size 4664