view article Article Optimizing Pretraining Data Mixes with LLM-Estimated Utility By WillHeld • Jan 22 • 3
Tristan/dclm-perplexity-correlations-spearmanr-no-samp-410m Text Generation • Updated Nov 22, 2024 • 173
Tristan/dclm-perplexity-correlations-spearmanr-no-samp-160m Text Generation • Updated Nov 22, 2024 • 171
Tristan/dclm-perplexity-correlations-160m-target-to-be-bad Text Generation • Updated Nov 19, 2024 • 62