Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,9 @@ pipeline_tag: image-feature-extraction
|
|
10 |
|
11 |
This is fashion image feature extractor model.
|
12 |
|
|
|
|
|
|
|
13 |
# 1. Model Architecture
|
14 |
|
15 |
I used [microsoft/swin-base-patch4-window7-224](https://huggingface.co/microsoft/swin-base-patch4-window7-224) for base image encoder model. Just added a 128 size fully connected layer to lower embedding size.
|
@@ -26,11 +29,10 @@ Initially, anchor - positive - negative pairs were explicitly constructed in a 1
|
|
26 |
User posting images from onthelook and kream were crawled and preprocessed. First, raw data of image-product thumbnail combinations from posts were collected. Then, object detection was performed on posting images, and category classification was performed on product thumbnails to pair images of the same category together. For thumbnail category classification, a trained category classifier was used. Finally, about 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
|
27 |
Finally, approximately 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
|
28 |
|
|
|
29 |
|
30 |
-
You can find object-detection model -> [https://huggingface.co/yainage90/fashion-object-detection](https://huggingface.co/yainage90/fashion-object-detection)
|
31 |
-
|
32 |
-
You can find details of model in this github repo -> [fashion-visual-search](https://github.com/yainage90/fashion-visual-search)
|
33 |
|
|
|
34 |
|
35 |
```python
|
36 |
from PIL import Image
|
|
|
10 |
|
11 |
This is fashion image feature extractor model.
|
12 |
|
13 |
+
You can find object-detection model -> [https://huggingface.co/yainage90/fashion-object-detection](https://huggingface.co/yainage90/fashion-object-detection)
|
14 |
+
You can find details of model in this github repo -> [fashion-visual-search](https://github.com/yainage90/fashion-visual-search)
|
15 |
+
|
16 |
# 1. Model Architecture
|
17 |
|
18 |
I used [microsoft/swin-base-patch4-window7-224](https://huggingface.co/microsoft/swin-base-patch4-window7-224) for base image encoder model. Just added a 128 size fully connected layer to lower embedding size.
|
|
|
29 |
User posting images from onthelook and kream were crawled and preprocessed. First, raw data of image-product thumbnail combinations from posts were collected. Then, object detection was performed on posting images, and category classification was performed on product thumbnails to pair images of the same category together. For thumbnail category classification, a trained category classifier was used. Finally, about 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
|
30 |
Finally, approximately 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
|
31 |
|
32 |
+
<img src="data_sample.png" width="300" alt="data sample">
|
33 |
|
|
|
|
|
|
|
34 |
|
35 |
+
# 3. Usage
|
36 |
|
37 |
```python
|
38 |
from PIL import Image
|