nandavikas16's picture
Model save
a665b82 verified
|
raw
history blame
No virus
3.37 kB
metadata
license: mit
base_model: facebook/bart-large-cnn
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: bart-large-cnn-finetuned-scope-summarization-train-test-split
    results: []

bart-large-cnn-finetuned-scope-summarization-train-test-split

This model is a fine-tuned version of facebook/bart-large-cnn on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2315
  • Rouge1: 52.3537
  • Rouge2: 31.6854
  • Rougel: 36.6454
  • Rougelsum: 50.8292

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5.6e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum
No log 1.0 25 0.6966 51.835 31.057 37.6234 50.2076
0.6673 2.0 50 0.6823 48.381 28.6493 37.1777 46.9784
0.5505 3.0 75 0.6825 51.1061 31.5147 38.5282 49.8741
0.5505 4.0 100 0.7131 51.0351 32.3268 39.7744 49.4893
0.4736 5.0 125 0.6975 52.9068 32.4415 39.5503 51.2993
0.4033 6.0 150 0.7925 51.3766 30.4233 37.7124 49.5155
0.3306 7.0 175 0.8079 52.2073 31.8487 38.6156 50.8166
0.3306 8.0 200 0.9168 51.6434 31.3338 37.4811 50.1527
0.256 9.0 225 0.9810 49.7984 30.3608 36.7693 48.7107
0.1823 10.0 250 0.9289 51.679 31.2458 36.4793 50.2032
0.1355 11.0 275 1.0269 52.0775 31.1824 37.5405 50.5995
0.1355 12.0 300 1.0736 51.3365 31.2121 38.37 50.0703
0.0974 13.0 325 1.0935 52.4146 32.5704 38.0578 51.424
0.0681 14.0 350 1.1100 51.5136 31.6307 38.5212 50.2267
0.0476 15.0 375 1.1507 51.9246 31.5588 36.8706 50.7219
0.0476 16.0 400 1.1667 53.7686 33.3238 38.145 52.2277
0.0336 17.0 425 1.1606 51.9682 31.4379 37.6764 50.8294
0.0232 18.0 450 1.1961 51.6253 31.6575 37.5128 50.406
0.0232 19.0 475 1.2162 51.7758 31.8239 36.3796 50.3009
0.0182 20.0 500 1.2315 52.3537 31.6854 36.6454 50.8292

Framework versions

  • Transformers 4.38.1
  • Pytorch 2.2.1+cu121
  • Datasets 2.17.1
  • Tokenizers 0.15.2