27 Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text · 4 authors
25 Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models · 14 authors 1
13 MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks · 11 authors
11 GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation · 12 authors 1
11 The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 · 2 authors
9 SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models · 16 authors