34 JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models · 12 authors 1
28 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model · 10 authors 1
6 Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities · 6 authors 1
5 Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems · 8 authors 1