Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 16 days ago • 637
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 243
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 9 days ago • 68
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11 • 34
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published 18 days ago • 9
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published 16 days ago • 116
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 18 days ago • 118
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis Paper • 2412.01819 • Published 20 days ago • 30
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published 24 days ago • 32
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 25 days ago • 61
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 25 days ago • 49