MahsaShahidi
/

Persian-Image-Captioning

Image-Text-to-Text

vision-encoder-decoder

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

MahsaShahidi commited on Feb 22, 2022

Commit

30b2eb0

•

1 Parent(s): 91f5872

Update README.md

Files changed (1) hide show

README.md +0 -30

README.md CHANGED Viewed

@@ -11,36 +11,6 @@ should probably proofread and complete it, then remove this comment. -->
 # Persian-Image-Captioning
 This model is a fine-tuned version of [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) on coco-flickr-farsi.
-It achieves the following results on the evaluation set:
-- Loss: 2.4300
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 4e-05
-- train_batch_size: 8
-- eval_batch_size: 8
-- seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- num_epochs: 2
-- mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 3.4861        | 0.24  | 3500  | 2.9843          |
-| 2.8854        | 0.47  | 7000  | 2.7656          |
-| 2.701         | 0.71  | 10500 | 2.6337          |
-| 2.6279        | 0.95  | 14000 | 2.5680          |
-| 2.4782        | 1.19  | 17500 | 2.5179          |
-| 2.4321        | 1.42  | 21000 | 2.4838          |
-| 2.3876        | 1.66  | 24500 | 2.4513          |
-| 2.3854        | 1.9   | 28000 | 2.4300          |
 ### Framework versions

 # Persian-Image-Captioning
 This model is a fine-tuned version of [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) on coco-flickr-farsi.
 ### Framework versions