CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published 7 days ago • 86
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 16 days ago • 61
CoRe^2: Collect, Reflect and Refine to Generate Better and Faster Paper • 2503.09662 • Published Mar 12 • 34