This model uses the architecture of Idefics2 with loaded trained backbones (Mistral 7b and Siglip) but random connector. This model is intended as a pre-training starting point