Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,8 @@ library_name: fairseq
|
|
12 |
## Model description
|
13 |
|
14 |
This model was trained from scratch using the [Fairseq toolkit](https://fairseq.readthedocs.io/en/latest/) on a combination of Basque-Catalan datasets
|
15 |
-
totalling approximately 75 million sentence pairs.Parallel Basque-Catalan data was collected from
|
|
|
16 |
|
17 |
## Intended uses and limitations
|
18 |
|
|
|
12 |
## Model description
|
13 |
|
14 |
This model was trained from scratch using the [Fairseq toolkit](https://fairseq.readthedocs.io/en/latest/) on a combination of Basque-Catalan datasets
|
15 |
+
totalling approximately 75 million sentence pairs.Parallel Basque-Catalan data was collected from Opus and additional synthetic data was created from
|
16 |
+
the [Projecte Aina ES-CA corpus](https://huggingface.co/projecte-aina/mt-aina-ca-es) by translating the Spanish side using the ES-EU translator of [HiTZ](http://hitz.eus/). The model was evaluated on the Flores and NTREX evaluation datasets.
|
17 |
|
18 |
## Intended uses and limitations
|
19 |
|