Running 116 116 starvector-1b-im2svg 📈 Convert images and text into scalable vector graphics (SVG) code
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated 6 days ago • 15
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated about 10 hours ago • 29
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 15 days ago • 347
Babel Collection Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 7 items • Updated 19 days ago • 16