Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,17 @@ base_model:
|
|
10 |
|
11 |
# Data Scorer
|
12 |
|
13 |
-
The model to
|
14 |
|
15 |
-
NOTE: you may need to download the [fairseq-125M](https://huggingface.co/KoboldAI/fairseq-dense-125M) to `${PATH_TO_DATA_SELECTION_REPO}/checkpoints/fairseq/125M` to prepare the tokenizer and config.json for the base model
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
|
11 |
# Data Scorer
|
12 |
|
13 |
+
The model to score data for data selection in the paper [Data Selection via Optimal Learning for Language Models](https://arxiv.org/abs/2410.07064). To use the model, follow the instructions [here](https://github.com/microsoft/LMOps/tree/main/data_selection#5-use-the-data-scorer-to-score-examples).
|
14 |
|
15 |
+
NOTE: you may need to download the [fairseq-125M](https://huggingface.co/KoboldAI/fairseq-dense-125M) to `${PATH_TO_DATA_SELECTION_REPO}/checkpoints/fairseq/125M` to prepare the tokenizer and config.json for the base model.
|
16 |
+
|
17 |
+
### Citation
|
18 |
+
|
19 |
+
```bibtex
|
20 |
+
@article{gu2024data,
|
21 |
+
title={Data Selection via Optimal Control for Language Models},
|
22 |
+
author={Gu, Yuxian and Dong, Li and Wang, Hongning and Hao, Yaru and Dong, Qingxiu and Wei, Furu and Huang, Minlie},
|
23 |
+
journal={arXiv preprint arXiv:2410.07064},
|
24 |
+
year={2024}
|
25 |
+
}
|
26 |
+
```
|