iwslt-bt-en-ur / README.md
samiulhaq's picture
Update README.md
c1c05e3
|
raw
history blame
1.51 kB
---
language:
- en
- ur
license: apache-2.0
datasets:
- iwslt14
metrics:
- bleu
library_name: fairseq
pipeline_tag: translation
---
### English to Urdu Translation
English to Urdu translation model is a Transformer model trained on IWSLT back-translated data using Faireq.
This model is produced during the experimentation related to building Context-Aware NMT models for low-resourced languages such as Urdu, Hindi, Sindhi, Pashtu and Punjabi. This particular model does not contains any contextual information and it is baseline sentence-level transformer model.
The evaluation is done on WMT2017 standard test set.
* source group: English
* target group: Urdu
* model: transformer
* Contextual
* Test Set: WMT2017
* pre-processing: Moses + Indic Tokenizer
* Dataset + Libray Details: [DLNMT](https://github.com/sami-haq99/nrpu-dlnmt)
## Benchmarks
| testset | BLEU |
|-----------------------|-------|
| Wmt2017 | 57.95 |
## How to use model?
* This model can be accessed via git clone:
```
git clone https://huggingface.co./samiulhaq/iwslt-bt-en-ur
```
* You can use Fairseq library to access the model for translations:
```
from fairseq.models.transformer import TransformerModel
```
### Load the model
```
model = TransformerModel.from_pretrained('path/to/model')
```
#### Set the model to evaluation mode
```
model.eval()
```
#### Perform inference
```
input_text = 'Hello, how are you?'
output_text = model.translate(input_text)
print(output_text)
```