Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning Paper • 2301.09626 • Published Jan 23, 2023 • 2
Embedding structure matters: Comparing methods to adapt multilingual vocabularies to new languages Paper • 2309.04679 • Published Sep 9, 2023
An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Generative LLM Inference Paper • 2402.10712 • Published Feb 16
FOCUS: Effective Embedding Initialization for Specializing Pretrained Multilingual Models on a Single Language Paper • 2305.14481 • Published May 23, 2023 • 1
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction Paper • 2312.13558 • Published Dec 21, 2023 • 5