--- license: apache-2.0 tags: - generated_from_trainer metrics: - rouge model-index: - name: t5-small-machine-articles-tag-generation results: [] --- # t5-small-machine-articles-tag-generation This model is a fine-tuned version of [t5-small](https://huggingface.co./t5-small) on the None dataset. It achieves the following results on the evaluation set: - Loss: 1.9833 - Rouge1: 35.3543 - Rouge2: 18.1226 - Rougel: 31.3958 - Rougelsum: 31.414 - Gen Len: 17.6596 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 2e-05 - train_batch_size: 16 - eval_batch_size: 16 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - num_epochs: 20 - mixed_precision_training: Native AMP ### Training results | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len | |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:| | 3.7917 | 1.0 | 47 | 3.0002 | 19.9138 | 6.9215 | 17.6969 | 17.7888 | 18.9787 | | 3.0113 | 2.0 | 94 | 2.5823 | 22.9993 | 9.0341 | 20.8118 | 20.7657 | 18.7021 | | 2.7086 | 3.0 | 141 | 2.3643 | 26.7716 | 12.2207 | 24.1983 | 24.2611 | 18.3298 | | 2.5192 | 4.0 | 188 | 2.2361 | 28.5866 | 13.6305 | 26.1201 | 26.1367 | 17.9894 | | 2.4089 | 5.0 | 235 | 2.1661 | 30.1919 | 13.8779 | 27.1523 | 27.1256 | 18.0638 | | 2.3293 | 6.0 | 282 | 2.1185 | 31.1222 | 15.6736 | 27.3953 | 27.4457 | 17.8404 | | 2.2635 | 7.0 | 329 | 2.0875 | 32.3166 | 16.3032 | 28.7062 | 28.732 | 17.9149 | | 2.2349 | 8.0 | 376 | 2.0653 | 31.8387 | 15.616 | 28.3254 | 28.4288 | 17.7979 | | 2.1945 | 9.0 | 423 | 2.0473 | 32.388 | 16.4027 | 28.5642 | 28.6096 | 17.6809 | | 2.1658 | 10.0 | 470 | 2.0352 | 33.9489 | 16.999 | 29.8446 | 29.8251 | 17.5426 | | 2.1414 | 11.0 | 517 | 2.0252 | 34.0804 | 17.6999 | 30.1921 | 30.2739 | 17.5106 | | 2.1103 | 12.0 | 564 | 2.0155 | 34.3488 | 17.8273 | 30.2613 | 30.3358 | 17.5957 | | 2.1052 | 13.0 | 611 | 2.0053 | 35.1038 | 18.3494 | 30.6999 | 30.7655 | 17.6064 | | 2.0795 | 14.0 | 658 | 2.0004 | 35.366 | 18.8791 | 31.4931 | 31.5691 | 17.7872 | | 2.0612 | 15.0 | 705 | 1.9951 | 36.1778 | 18.7911 | 31.5974 | 31.6309 | 17.6064 | | 2.0792 | 16.0 | 752 | 1.9886 | 35.0387 | 18.2363 | 31.5279 | 31.5694 | 17.6702 | | 2.0695 | 17.0 | 799 | 1.9868 | 36.1432 | 18.4902 | 31.8314 | 31.7955 | 17.617 | | 2.0593 | 18.0 | 846 | 1.9844 | 35.7847 | 18.3497 | 31.745 | 31.7007 | 17.6809 | | 2.0395 | 19.0 | 893 | 1.9842 | 36.0629 | 18.9649 | 32.098 | 32.0453 | 17.5745 | | 2.0623 | 20.0 | 940 | 1.9833 | 35.3543 | 18.1226 | 31.3958 | 31.414 | 17.6596 | ### Framework versions - Transformers 4.26.1 - Pytorch 1.13.1+cu116 - Datasets 2.9.0 - Tokenizers 0.13.2