haoranxu
/

X-ALMA-13B-Pretrain

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

haoranxu commited on Oct 7

Commit

c57c245

•

1 Parent(s): 0c92966

Update README.md

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -60,8 +60,18 @@ base_model:
 ---
-X-ALMA builds upon [ALMA-R](https://arxiv.org/pdf/2401.08417) by expanding support from 6 to 50 languages. It utilizes a plug-and-play architecture with language-specific modules, complemented by a carefully designed training recipe. This release includes the **X-ALMA pre-trained base model**.
 X-ALMA-13B-Pretrain is pre-trained on 50 languages: en,da,nl,de,is,no,sv,af,ca,ro,gl,it,pt,es,bg,mk,sr,uk,ru,id,ms,th,vi,mg,fr,hu,el,cs,pl,lt,lv,ka,zh,ja,ko,fi,et,gu,hi,mr,ne,ur,az,kk,ky,tr,uz,ar,he,fa.
 All X-ALMA checkpoints are released at huggingface:

 ---
+[X-ALMA](https://arxiv.org/pdf/2410.03115) builds upon [ALMA-R](https://arxiv.org/pdf/2401.08417) by expanding support from 6 to 50 languages. It utilizes a plug-and-play architecture with language-specific modules, complemented by a carefully designed training recipe. This release includes the **X-ALMA pre-trained base model**.
+```
+@misc{xu2024xalmaplugplay,
+      title={X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale},
+      author={Haoran Xu and Kenton Murray and Philipp Koehn and Hieu Hoang and Akiko Eriguchi and Huda Khayrallah},
+      year={2024},
+      eprint={2410.03115},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2410.03115},
+}
+```
 X-ALMA-13B-Pretrain is pre-trained on 50 languages: en,da,nl,de,is,no,sv,af,ca,ro,gl,it,pt,es,bg,mk,sr,uk,ru,id,ms,th,vi,mg,fr,hu,el,cs,pl,lt,lv,ka,zh,ja,ko,fi,et,gu,hi,mr,ne,ur,az,kk,ky,tr,uz,ar,he,fa.
 All X-ALMA checkpoints are released at huggingface: