The case for specialized pre-training: ultra-fast foundation models for dedicated tasks Aug 4, 2024 • 27
Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing Jul 19, 2024 • 18
Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data Apr 18, 2024 • 22