AIDA-UPM
/

mstsb-paraphrase-multilingual-mpnet-base-v2

Sentence Similarity

feature-extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

Huertas97 commited on Jul 13, 2021

Commit

5b95be3

•

1 Parent(s): 5329fcf

README avg results

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -79,7 +79,19 @@ print(sentence_embeddings)
 ## Evaluation Results
 <!--- Describe how your model was evaluated -->
-Check the test results in the Semantic Textual Similarity Tasks. The 15 languages available at the [Multilingual STSB](https://github.com/Huertas97/Multilingual-STSB) have been combined into monolingual and cross-lingual tasks, giving a total of 31 tasks. Monolingual tasks have both sentences from the same language source (e.g., Ar-Ar, Es-Es), while cross-lingual tasks have two sentences, each in a different language being one of them English (e.g., en-ar, en-es). For the sake of readability tasks have been splitted into monolingual and cross-lingual tasks.
 | Monolingual Task | Pearson Cosine test | Spearman Cosine  test |
 |------------------|---------------------|-----------------------|

 ## Evaluation Results
 <!--- Describe how your model was evaluated -->
+Check the test results in the Semantic Textual Similarity Tasks. The 15 languages available at the [Multilingual STSB](https://github.com/Huertas97/Multilingual-STSB) have been combined into monolingual and cross-lingual tasks, giving a total of 31 tasks. Monolingual tasks have both sentences from the same language source (e.g., Ar-Ar, Es-Es), while cross-lingual tasks have two sentences, each in a different language being one of them English (e.g., en-ar, en-es).
+Here we compare the average multilingual semantic textual similairty capabilities between the  `paraphrase-multilingual-mpnet-base-v2` based model and the `mstsb-paraphrase-multilingual-mpnet-base-v2` fine-tuned model across the 31 tasks. It is worth noting that both models are multilingual, but the second model is adjusted with multilingual data for semantic similarity. The average of correlation coefficients is computed by transforming each correlation coefficient to a Fisher's z value, averaging them, and then back-transforming to a correlation coefficient.
+| Model                                       | Average Spearman Cosine Test |
+|---------------------------------------------|------------------------------|
+| mstsb-paraphrase-multilingual-mpnet-base-v2 | 0.835890                     |
+| paraphrase-multilingual-mpnet-base-v2       | 0.818896                     |
+For the sake of readability tasks have been splitted into monolingual and cross-lingual tasks.
 | Monolingual Task | Pearson Cosine test | Spearman Cosine  test |
 |------------------|---------------------|-----------------------|