ProX Refining Models
Collection
Adapted small language models used to generate data refining programs
•
5 items
•
Updated
•
2
Math-doc-refining-lm is an adapted 0.7B-ProX model, fine-tuned for doc level refining via program generation, and can be applied over math pre-training corpus such as open-web-math.
@article{zhou2024programming,
title={Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale},
author={Zhou, Fan and Wang, Zengzhi and Liu, Qian and Li, Junlong and Liu, Pengfei},
journal={arXiv preprint arXiv:2409.17115},
year={2024}
}
Base model
gair-prox/RedPJ-ProX-0.7B