ioi-leaderboard/ioi-eval-dummy-openrouter_openai_gpt-3.5-turbo Viewer • Updated about 8 hours ago • 6
ioi-leaderboard/ioi-eval-dummy-openrouter_openai_gpt-3.5-turbo Viewer • Updated about 8 hours ago • 6
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 24 days ago • 195
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 24 days ago • 195
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 93
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 93
A Dataset and Strong Baselines for Classification of Czech News Texts Paper • 2307.10666 • Published Jul 20, 2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 33