Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated about 3 hours ago • 6
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated about 2 hours ago • 7
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • 2 days ago • 62
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 223
view article Article PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face By not-lain • 10 days ago • 11
Cosmos Tokenizer Collection A suite of image and video tokenizers • 10 items • Updated 15 days ago • 18
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 28 days ago • 26
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… about 1 month ago • 63
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published Oct 17 • 35
LoLCATS Collection Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated Oct 14 • 14
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 30
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 118
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention Aug 21 • 22