danielheinz
commited on
Commit
•
e9bd328
1
Parent(s):
f23cbcf
Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,8 @@ model-index:
|
|
18 |
- type: spearmanr
|
19 |
value: 0.904
|
20 |
---
|
|
|
|
|
21 |
The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
|
22 |
|
23 |
## Model Training
|
@@ -26,6 +28,10 @@ The model has been fine-tuned on the German subsets of the following datasets:
|
|
26 |
- [paws-x](https://huggingface.co/datasets/paws-x)
|
27 |
- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
|
28 |
|
|
|
|
|
|
|
|
|
29 |
# Results
|
30 |
The model achieves the following results:
|
31 |
- 0.920 on stsb's validation subset
|
|
|
18 |
- type: spearmanr
|
19 |
value: 0.904
|
20 |
---
|
21 |
+
**INFO**: The model is being continuously updated.
|
22 |
+
|
23 |
The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
|
24 |
|
25 |
## Model Training
|
|
|
28 |
- [paws-x](https://huggingface.co/datasets/paws-x)
|
29 |
- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
|
30 |
|
31 |
+
The training procedure can be divided into two stages:
|
32 |
+
- training on paraphrase corpora with the Multiple Negatives Ranking Loss
|
33 |
+
- training on semantic textual similarity using theh Cosine Similarity Loss
|
34 |
+
|
35 |
# Results
|
36 |
The model achieves the following results:
|
37 |
- 0.920 on stsb's validation subset
|