26 AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn · 7 authors 2
14 Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration · 8 authors 4
4 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations · 16 authors
3 Neural Relighting with Subsurface Scattering by Learning the Radiance Transfer Gradient · 6 authors