update README.md
Browse files
README.md
CHANGED
@@ -58,4 +58,13 @@ with torch.no_grad():
|
|
58 |
|
59 |
print("Label probs:", text_probs) # prints: [[1.0, 0.0, 0.0]]
|
60 |
```
|
|
|
|
|
61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
58 |
|
59 |
print("Label probs:", text_probs) # prints: [[1.0, 0.0, 0.0]]
|
60 |
```
|
61 |
+
# Model architecture
|
62 |
+
The model was trained a ViT-B/16 Transformer architecture as an image encoder and uses a 12-layer RoBERTa as a text encoder. The text encoder was trained upon the pre-trained Japanese RoBERTa model [rinna/japanese-roberta-base](https://huggingface.co/rinna/japanese-roberta-base) with the same sentencepiece tokenizer.
|
63 |
|
64 |
+
# Training
|
65 |
+
The model was trained on [CC12M](https://github.com/google-research-datasets/conceptual-12m) translated the captions to Japanese.
|
66 |
+
|
67 |
+
|
68 |
+
# License
|
69 |
+
|
70 |
+
[Apache-2.0 license](https://www.apache.org/licenses/LICENSE-2.0)
|