A Survey on Data Selection for LLM Instruction Tuning
Paper
•
2402.05123
•
Published
•
3
Note 综述
Note 其他相关文章: The Art of Data Selection: A Survey on Data Selection for Fine-Tuning Large Language Models https://openreview.net/pdf?id=hTBD3LYoqd A Survey on Data Quality Dimensions and Tools for Machine Learning https://arxiv.org/abs/2406.19614
Note Stanford. 数据多样性
Note https://huggingface.co./datasets/HuggingFaceFW/fineweb-edu-score-2 https://huggingface.co./HuggingFaceFW/fineweb-edu-classifier