Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ output = model.generate(input_ids, max_length=50)
|
|
33 |
print(tokenizer.decode(output[0], skip_special_tokens=True))
|
34 |
```
|
35 |
### Data
|
36 |
-
We use
|
37 |
|
38 |
### Contributors
|
39 |
Main Contributor: [Sa Phyo Thu Htet](https://github.com/SaPhyoThuHtet)
|
|
|
33 |
print(tokenizer.decode(output[0], skip_special_tokens=True))
|
34 |
```
|
35 |
### Data
|
36 |
+
We use 20,000 Burmese sentences and most are from our open-source [data](https://huggingface.co/datasets/Simbolo-Servicio/wiki-burmese-sentences) which contains 100,000 sentences sourced from Wikipedia.
|
37 |
|
38 |
### Contributors
|
39 |
Main Contributor: [Sa Phyo Thu Htet](https://github.com/SaPhyoThuHtet)
|