CaptionEmporium/flickr-megalith-10m-internvl2-multi-caption Viewer • Updated Aug 28 • 8.51M • 151 • 7
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer Paper • 2401.10208 • Published Jan 18 • 1
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Paper • 2306.05423 • Published Jun 8, 2023