Improving Visual Commonsense in Language Models via Multiple Image Generation Paper • 2406.13621 • Published about 1 month ago • 13
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Paper • 2406.10210 • Published Jun 14 • 75