view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 688
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 135
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Dec 13, 2024 • 85
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 21 days ago • 344
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 9 days ago • 182
Running on Zero 1.76k 1.76k Chat With Janus-Pro-7B 🌍 A unified multimodal understanding and generation model.