Mantis model family optimized for multi-image reasoning with interleaved text/image format
Multimodal Language Model