yainage90 commited on
Commit
65bf6be
·
verified ·
1 Parent(s): 88cea97

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -10,6 +10,9 @@ pipeline_tag: image-feature-extraction
10
 
11
  This is fashion image feature extractor model.
12
 
 
 
 
13
  # 1. Model Architecture
14
 
15
  I used [microsoft/swin-base-patch4-window7-224](https://huggingface.co/microsoft/swin-base-patch4-window7-224) for base image encoder model. Just added a 128 size fully connected layer to lower embedding size.
@@ -26,11 +29,10 @@ Initially, anchor - positive - negative pairs were explicitly constructed in a 1
26
  User posting images from onthelook and kream were crawled and preprocessed. First, raw data of image-product thumbnail combinations from posts were collected. Then, object detection was performed on posting images, and category classification was performed on product thumbnails to pair images of the same category together. For thumbnail category classification, a trained category classifier was used. Finally, about 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
27
  Finally, approximately 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
28
 
 
29
 
30
- You can find object-detection model -> [https://huggingface.co/yainage90/fashion-object-detection](https://huggingface.co/yainage90/fashion-object-detection)
31
-
32
- You can find details of model in this github repo -> [fashion-visual-search](https://github.com/yainage90/fashion-visual-search)
33
 
 
34
 
35
  ```python
36
  from PIL import Image
 
10
 
11
  This is fashion image feature extractor model.
12
 
13
+ You can find object-detection model -> [https://huggingface.co/yainage90/fashion-object-detection](https://huggingface.co/yainage90/fashion-object-detection)
14
+ You can find details of model in this github repo -> [fashion-visual-search](https://github.com/yainage90/fashion-visual-search)
15
+
16
  # 1. Model Architecture
17
 
18
  I used [microsoft/swin-base-patch4-window7-224](https://huggingface.co/microsoft/swin-base-patch4-window7-224) for base image encoder model. Just added a 128 size fully connected layer to lower embedding size.
 
29
  User posting images from onthelook and kream were crawled and preprocessed. First, raw data of image-product thumbnail combinations from posts were collected. Then, object detection was performed on posting images, and category classification was performed on product thumbnails to pair images of the same category together. For thumbnail category classification, a trained category classifier was used. Finally, about 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
30
  Finally, approximately 290,000 anchor-positive image pairs were created for 6 categories: tops, bottoms, outer, shoes, bags, and hats.
31
 
32
+ <img src="data_sample.png" width="300" alt="data sample">
33
 
 
 
 
34
 
35
+ # 3. Usage
36
 
37
  ```python
38
  from PIL import Image