pszemraj
/

long-t5-tglobal-base-16384-book-summary

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Jun 28, 2022

Commit

1c62066

·

1 Parent(s): d97a777

booksum link

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -65,7 +65,7 @@ parameters:
 ---
-# long-t5-tglobal-base-16384-booksum
 - summarize long text and get a SparkNotes-esque summary of arbitrary topics!
 - generalizes reasonably well to academic & narrative text.
@@ -116,7 +116,7 @@ Pass [other parameters related to beam search textgen](https://huggingface.co/bl
 ## Training and evaluation data
-`kmfoda/booksum` dataset. Summaries longer than 1024 LongT5 tokens were filtered out with the intent of preventing the model from learning to generate "partial" summaries.
 > - early checkpoints of this model were trained on a "smaller" subsection of the dataset as it was filtered for summaries of **1024 characters**. This was subsequently caught and adjusted to **1024 tokens** and then trained further for at least five epochs.

 ---
+# long-t5-tglobal-base-16384 + BookSum
 - summarize long text and get a SparkNotes-esque summary of arbitrary topics!
 - generalizes reasonably well to academic & narrative text.
 ## Training and evaluation data
+`kmfoda/booksum` dataset on HuggingFace - read [the original paper here](https://arxiv.org/abs/2105.08209). Summaries longer than 1024 LongT5 tokens were filtered out with the intent of preventing the model from learning to generate "partial" summaries.
 > - early checkpoints of this model were trained on a "smaller" subsection of the dataset as it was filtered for summaries of **1024 characters**. This was subsequently caught and adjusted to **1024 tokens** and then trained further for at least five epochs.