ernie-ai
/

autotrain-document-text-language-ar-en-zh-3338392240

Image Classification

Trained with AutoTrain

Inference Endpoints

Model card Files Files and versions Community

ernie-ai commited on Feb 8, 2023

Commit

d9dfe10

·

1 Parent(s): 250d708

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -15,6 +15,20 @@ widget:
 co2_eq_emissions:
   emissions: 2.2266908460523576
 ---
 # Model Trained Using AutoTrain

 co2_eq_emissions:
   emissions: 2.2266908460523576
 ---
+# finetuned-vit-doc-text-classifer
+This model is a fine-tuned version of Microsoft’s Swin Transformer tiny-sized model [microsoft/swin-tiny-patch4-window7-224](https://huggingface.co/microsoft/swin-tiny-patch4-window7-224) on the ernie-ai/image-text-examples-ar-cn-latin-notext dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.267
+- Accuracy: 0.882
+## Model description
+It is an image classificatin model fine-tuned to predict whether an images contains text and if that text is Latin script, Chinese or Arabic. It also classifies non-text images.
+## Training and evaluation data
+Dataset: [ernie-ai/image-text-examples-ar-cn-latin-notext]
 # Model Trained Using AutoTrain