Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 4 days ago • 31
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • 3 days ago • 17
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • 5 days ago • 75
Hermes: A Large Language Model Framework on the Journey to Autonomous Networks Paper • 2411.06490 • Published 14 days ago • 6
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 9 days ago • 94
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 11 days ago • 94
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Paper • 2411.04928 • Published 17 days ago • 48
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 3 days ago • 176
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping Paper • 2402.14083 • Published Feb 21 • 47
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated about 1 month ago • 26
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21 • 42
Pangea Collection A Fully Open Multilingual Multimodal LLM for 39 Languages • 18 items • Updated 22 days ago • 17
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 20 days ago • 89
LoLCATS Collection Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated Oct 14 • 14