Running on Zero 11 💻 Newborn Article Impact Predict Use title and abstract to predict future academic impact
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published about 1 month ago • 35
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published about 1 month ago • 35
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Paper • 2411.15411 • Published Nov 23 • 7
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published about 1 month ago • 10
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Paper • 2410.10139 • Published Oct 14 • 51
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Paper • 2409.18042 • Published Sep 26 • 36