Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated about 21 hours ago • 275
Iterative Object Count Optimization for Text-to-image Diffusion Models Paper • 2408.11721 • Published Aug 21 • 5
Discriminative Class Tokens for Text-to-Image Diffusion Models Paper • 2303.17155 • Published Mar 30, 2023 • 1
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Paper • 2309.16429 • Published Sep 28, 2023 • 11