view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 870
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 2 days ago • 85
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 8 items • Updated 19 days ago • 397
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 2 days ago • 209
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 16 days ago • 107
Phi-4 Collection Phi-4 family of small language and multi-modal models. • 7 items • Updated 11 days ago • 109
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated about 14 hours ago • 39
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated 2 days ago • 55