English
t1101675 commited on
Commit
d9b2b78
·
verified ·
1 Parent(s): 5726922

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -2
README.md CHANGED
@@ -10,6 +10,17 @@ base_model:
10
 
11
  # Data Scorer
12
 
13
- The model to scorer data for data selection in the paper [Data Selection via Optimal Learning for Language Models](https://arxiv.org/abs/2410.07064). To use the model, follow the instructions [here](https://github.com/microsoft/LMOps/tree/main/data_selection#5-use-the-data-scorer-to-score-examples).
14
 
15
- NOTE: you may need to download the [fairseq-125M](https://huggingface.co/KoboldAI/fairseq-dense-125M) to `${PATH_TO_DATA_SELECTION_REPO}/checkpoints/fairseq/125M` to prepare the tokenizer and config.json for the base model
 
 
 
 
 
 
 
 
 
 
 
 
10
 
11
  # Data Scorer
12
 
13
+ The model to score data for data selection in the paper [Data Selection via Optimal Learning for Language Models](https://arxiv.org/abs/2410.07064). To use the model, follow the instructions [here](https://github.com/microsoft/LMOps/tree/main/data_selection#5-use-the-data-scorer-to-score-examples).
14
 
15
+ NOTE: you may need to download the [fairseq-125M](https://huggingface.co/KoboldAI/fairseq-dense-125M) to `${PATH_TO_DATA_SELECTION_REPO}/checkpoints/fairseq/125M` to prepare the tokenizer and config.json for the base model.
16
+
17
+ ### Citation
18
+
19
+ ```bibtex
20
+ @article{gu2024data,
21
+ title={Data Selection via Optimal Control for Language Models},
22
+ author={Gu, Yuxian and Dong, Li and Wang, Hongning and Hao, Yaru and Dong, Qingxiu and Wei, Furu and Huang, Minlie},
23
+ journal={arXiv preprint arXiv:2410.07064},
24
+ year={2024}
25
+ }
26
+ ```