GuysTrans
/

bart-base-re-attention-seq-512

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

GuysTrans commited on Oct 31, 2023

Commit

8ca5247

•

1 Parent(s): e14dd31

End of training

Files changed (3) hide show

README.md +22 -2
generation_config.json +1 -0
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,10 @@
 ---
 license: apache-2.0
-base_model: facebook/bart-base
 tags:
 - generated_from_trainer
 model-index:
 - name: bart-base-re-attention-seq-512
   results: []
@@ -13,7 +15,18 @@ should probably proofread and complete it, then remove this comment. -->
 # bart-base-re-attention-seq-512
-This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 ## Model description
@@ -40,6 +53,13 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 1
 ### Framework versions
 - Transformers 4.33.0

 ---
 license: apache-2.0
+base_model: GuysTrans/bart-base-re-attention-seq-512
 tags:
 - generated_from_trainer
+metrics:
+- rouge
 model-index:
 - name: bart-base-re-attention-seq-512
   results: []
 # bart-base-re-attention-seq-512
+This model is a fine-tuned version of [GuysTrans/bart-base-re-attention-seq-512](https://huggingface.co/GuysTrans/bart-base-re-attention-seq-512) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.2424
+- Rouge1: 40.8423
+- Rouge2: 32.0048
+- Rougel: 38.1663
+- Rougelsum: 40.3421
+- Bleu-1: 26.2378
+- Bleu-2: 21.0039
+- Bleu-3: 18.6195
+- Bleu-4: 17.1228
+- Gen Len: 92.6278
 ## Model description
 - lr_scheduler_type: linear
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Bleu-1  | Bleu-2  | Bleu-3  | Bleu-4  | Gen Len |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:-------:|:-------:|:-------:|:-------:|
+| 2.3482        | 1.0   | 18247 | 1.2424          | 40.8423 | 32.0048 | 38.1663 | 40.3421   | 26.2378 | 21.0039 | 18.6195 | 17.1228 | 92.6278 |
 ### Framework versions
 - Transformers 4.33.0

generation_config.json CHANGED Viewed

@@ -5,6 +5,7 @@
   "eos_token_id": 2,
   "forced_bos_token_id": 0,
   "forced_eos_token_id": 2,
   "no_repeat_ngram_size": 3,
   "num_beams": 4,
   "pad_token_id": 1,

   "eos_token_id": 2,
   "forced_bos_token_id": 0,
   "forced_eos_token_id": 2,
+  "max_length": 512,
   "no_repeat_ngram_size": 3,
   "num_beams": 4,
   "pad_token_id": 1,

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0e2d8653718fda18af50778f4a8f75763f4858c7cd3a9b74cdf825fd19b69b4a
 size 558018637

 version https://git-lfs.github.com/spec/v1
+oid sha256:ca0278fa5b9a1ea0802afcdb1eeaf616782e7672cb5e591caa9d4ad82a24ec19
 size 558018637