Add evaluation results on the default config and test split of multi_news

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [multi_news](https://huggingface.co./datasets/multi_news) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co./datasets/autoevaluate/autoeval-eval-multi_news-default-e22c67-2252871794).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co./spaces/autoevaluate/leaderboards?dataset=multi_news).\
Evaluate your model on more datasets [here](https://huggingface.co./spaces/autoevaluate/model-evaluator?dataset=multi_news).

Files changed (1) hide show

README.md +34 -1

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ tags:
 - booksum
 - long-document
 - long-form
-license:
 - apache-2.0
 - bsd-3-clause
 datasets:
@@ -278,6 +278,39 @@ model-index:
       type: gen_len
       value: 163.9394
       verified: true
 ---
 # Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization

 - booksum
 - long-document
 - long-form
+license:
 - apache-2.0
 - bsd-3-clause
 datasets:
       type: gen_len
       value: 163.9394
       verified: true
+  - task:
+      type: summarization
+      name: Summarization
+    dataset:
+      name: multi_news
+      type: multi_news
+      config: default
+      split: test
+    metrics:
+    - name: ROUGE-1
+      type: rouge
+      value: 39.0834
+      verified: true
+    - name: ROUGE-2
+      type: rouge
+      value: 11.4043
+      verified: true
+    - name: ROUGE-L
+      type: rouge
+      value: 19.1813
+      verified: true
+    - name: ROUGE-LSUM
+      type: rouge
+      value: 35.1581
+      verified: true
+    - name: loss
+      type: loss
+      value: 4.654905319213867
+      verified: true
+    - name: gen_len
+      type: gen_len
+      value: 186.2494
+      verified: true
 ---
 # Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization