The case for specialized pre-training: ultra-fast foundation models for dedicated tasks Aug 4, 2024 • 28
Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing Jul 19, 2024 • 20
Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data Apr 18, 2024 • 22