Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,20 @@ widget:
|
|
15 |
co2_eq_emissions:
|
16 |
emissions: 2.2266908460523576
|
17 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
# Model Trained Using AutoTrain
|
20 |
|
|
|
15 |
co2_eq_emissions:
|
16 |
emissions: 2.2266908460523576
|
17 |
---
|
18 |
+
# finetuned-vit-doc-text-classifer
|
19 |
+
|
20 |
+
This model is a fine-tuned version of Microsoft’s Swin Transformer tiny-sized model [microsoft/swin-tiny-patch4-window7-224](https://huggingface.co/microsoft/swin-tiny-patch4-window7-224) on the ernie-ai/image-text-examples-ar-cn-latin-notext dataset.
|
21 |
+
It achieves the following results on the evaluation set:
|
22 |
+
- Loss: 0.267
|
23 |
+
- Accuracy: 0.882
|
24 |
+
|
25 |
+
## Model description
|
26 |
+
|
27 |
+
It is an image classificatin model fine-tuned to predict whether an images contains text and if that text is Latin script, Chinese or Arabic. It also classifies non-text images.
|
28 |
+
|
29 |
+
## Training and evaluation data
|
30 |
+
|
31 |
+
Dataset: [ernie-ai/image-text-examples-ar-cn-latin-notext]
|
32 |
|
33 |
# Model Trained Using AutoTrain
|
34 |
|