Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,8 @@ The model architecture and config are the same as [M2M-100](https://huggingface.
|
|
18 |
SMaLL-100 is a seq-to-seq model for the translation task. The input to the model is ```source:[tgt_lang_code] + src_tokens + [EOS]``` and ```target: tgt_tokens + [EOS]```.
|
19 |
|
20 |
# `small-100-th` is the fine-tuned version of SMALL-100 for Thai
|
21 |
-
The dataset can be acquired from [
|
|
|
22 |
|
23 |
## small-100-th inference
|
24 |
```
|
|
|
18 |
SMaLL-100 is a seq-to-seq model for the translation task. The input to the model is ```source:[tgt_lang_code] + src_tokens + [EOS]``` and ```target: tgt_tokens + [EOS]```.
|
19 |
|
20 |
# `small-100-th` is the fine-tuned version of SMALL-100 for Thai
|
21 |
+
The dataset can be acquired from [scb-mt-en-th-2020](https://airesearch.in.th/releases/machine-translation-datasets/) and [OPUS](https://opus.nlpl.eu/).
|
22 |
+
It can also be directly download from [Vistec](https://github.com/vistec-AI/thai2nmt/releases/tag/scb-mt-en-th-2020%2Bmt-opus_v1.0).
|
23 |
|
24 |
## small-100-th inference
|
25 |
```
|