Model Card for mT5-base-HunSum-2

The mT5-base-HunSum-2 is a Hungarian abstractive summarization model, which was trained on the SZTAKI-HLT/HunSum-2-abstractive dataset. The model is based on google/mt5-base.

Intended uses & limitations

  • Model type: Text Summarization
  • Language(s) (NLP): Hungarian
  • Resource(s) for more information:

Parameters

  • Batch Size: 12
  • Learning Rate: 5e-5
  • Weight Decay: 0.01
  • Warmup Steps: 3000
  • Epochs: 10
  • no_repeat_ngram_size: 3
  • num_beams: 5
  • early_stopping: False
  • encoder_no_repeat_ngram_size: 4

Results

Metric Value
ROUGE-1 40.06
ROUGE-2 12.67
ROUGE-L 25.93
Downloads last month
112
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for SZTAKI-HLT/mT5-base-HunSum-2

Base model

google/mt5-base
Finetuned
(169)
this model

Dataset used to train SZTAKI-HLT/mT5-base-HunSum-2