Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated Jan 17 • 33
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? Paper • 2407.01284 • Published Jul 1, 2024 • 81
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Paper • 2502.02339 • Published Feb 4 • 22
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 16 days ago • 443
ContactDoctor/Bio-Medical-MultiModal-Llama-3-8B-V1 Image-Text-to-Text • Updated Oct 17, 2024 • 1.54k • 120
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 211