projecte-aina
/

roberta-base-ca-cased-sts

Text Classification

semantic textual similarity

Catalan Textual Corpus

Inference Endpoints

Model card Files Files and versions Community

bsc-temu commited on Nov 26, 2021

Commit

ab7e37c

•

1 Parent(s): c1c2dea

Update README.md

Files changed (1) hide show

README.md +10 -3

README.md CHANGED Viewed

@@ -59,11 +59,18 @@ The **roberta-base-ca-cased-sts** is a Semantic Textual Similarity (STS) model f
 We used the TE dataset in Catalan called [STS-ca](https://huggingface.co/datasets/projecte-aina/sts-ca) for training and evaluation.
 ## Evaluation and results
-Below, the evaluation result on the STS-ca test set:
-| Task        | STS-ca (pearson) |
-| ------------|:----|
 | BERTa       | **81.20** |
 For more details, check the fine-tuning and evaluation scripts in the official [GitHub repository](https://github.com/projecte-aina/berta).
 ## Citing

 We used the TE dataset in Catalan called [STS-ca](https://huggingface.co/datasets/projecte-aina/sts-ca) for training and evaluation.
 ## Evaluation and results
+We evaluated the _roberta-base-ca-cased-sts_ on the STS-ca test set against standard multilingual and monolingual baselines:
+| Model       | STS-ca (Pearson)   |
+|:------------|:----|
 | BERTa       | **81.20** |
+| mBERT       | 76.34 |
+| XLM-RoBERTa | 75.40 |
+| WikiBERT-ca | 77.18 |
+| Task | Model        | STS-ca (pearson) |
+|:------------|:------------|:----|
+| Semantic Textual Similarity | roberta-base-ca-cased-sts     | **81.20** |
 For more details, check the fine-tuning and evaluation scripts in the official [GitHub repository](https://github.com/projecte-aina/berta).
 ## Citing