Model Card for mT5-base-HunSum-2
The mT5-base-HunSum-2 is a Hungarian abstractive summarization model, which was trained on the SZTAKI-HLT/HunSum-2-abstractive dataset. The model is based on google/mt5-base.
Intended uses & limitations
- Model type: Text Summarization
- Language(s) (NLP): Hungarian
- Resource(s) for more information:
Parameters
- Batch Size: 12
- Learning Rate: 5e-5
- Weight Decay: 0.01
- Warmup Steps: 3000
- Epochs: 10
- no_repeat_ngram_size: 3
- num_beams: 5
- early_stopping: False
- encoder_no_repeat_ngram_size: 4
Results
Metric | Value |
---|---|
ROUGE-1 | 40.06 |
ROUGE-2 | 12.67 |
ROUGE-L | 25.93 |
- Downloads last month
- 112
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for SZTAKI-HLT/mT5-base-HunSum-2
Base model
google/mt5-base