Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora Paper โข 2511.07080 โข Published Nov 10, 2025 โข 33
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper โข 2509.18174 โข Published Sep 17, 2025 โข 134
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper โข 2505.17894 โข Published May 23, 2025 โข 220