bhavitvyamalik
commited on
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -16,14 +16,14 @@ This repository contains the model weights for translation models trained with M
|
|
16 |
* Dataset: All of OPUS including HPLT
|
17 |
* Model: transformer-base
|
18 |
* Tokenizer: SentencePiece (Unigram)
|
19 |
-
* Cleaning: We use OpusCleaner for cleaning the corpus. Details about rules used can be found in the filter files in [Github](https://github.com/hplt-project/HPLT-MT-Models/tree/main/v1.0/data/ca
|
20 |
|
21 |
To run inference with Marian, refer to the [Inference/Decoding/Translation](https://github.com/hplt-project/HPLT-MT-Models/tree/main/v1.0#inferencedecodingtranslation) section of our GitHub repository.
|
22 |
|
23 |
|
24 |
## Benchmarks
|
25 |
|
26 |
-
| testset | BLEU | chr-F |
|
27 |
| -------------------------------------- | ---- | ----- | ----- |
|
28 |
-
| flores200 | 44.5 | 66.5 | 0.
|
29 |
-
| ntrex | 35.7 | 60.2 | 0.
|
|
|
16 |
* Dataset: All of OPUS including HPLT
|
17 |
* Model: transformer-base
|
18 |
* Tokenizer: SentencePiece (Unigram)
|
19 |
+
* Cleaning: We use OpusCleaner for cleaning the corpus. Details about rules used can be found in the filter files in [Github](https://github.com/hplt-project/HPLT-MT-Models/tree/main/v1.0/data/en-ca/raw/v2)
|
20 |
|
21 |
To run inference with Marian, refer to the [Inference/Decoding/Translation](https://github.com/hplt-project/HPLT-MT-Models/tree/main/v1.0#inferencedecodingtranslation) section of our GitHub repository.
|
22 |
|
23 |
|
24 |
## Benchmarks
|
25 |
|
26 |
+
| testset | BLEU | chr-F | COMET-22 |
|
27 |
| -------------------------------------- | ---- | ----- | ----- |
|
28 |
+
| flores200 | 44.5 | 66.5 | 0.8771 |
|
29 |
+
| ntrex | 35.7 | 60.2 | 0.8545 |
|