SketchAgent: Language-Driven Sequential Sketch Generation Paper • 2411.17673 • Published 25 days ago • 18
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published 3 days ago • 41
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 24 days ago • 256
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 20 days ago • 195
Pangea Collection A Fully Open Multilingual Multimodal LLM for 39 Languages • 18 items • Updated Nov 2 • 17
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 24 days ago • 289
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23 • 68
UICoder: Finetuning Large Language Models to Generate User Interface Code through Automated Feedback Paper • 2406.07739 • Published Jun 11 • 2
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated Oct 4 • 25
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 8 days ago • 142
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 4 days ago • 180
Vision Language Models Papers 🖼️💬📝 Collection Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30 • 34
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5 • 55