Visual models Running 616 616 Qwen2-VL-72B ๐ Engage in multi-modal conversations with images and videos