PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 12 days ago • 112
Towards Learning a Generalist Model for Embodied Navigation Paper • 2312.02010 • Published Dec 4, 2023
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding Paper • 2412.00493 • Published 16 days ago • 15 • 2
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding Paper • 2412.00493 • Published 16 days ago • 15
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding Paper • 2412.00493 • Published 16 days ago • 15
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Paper • 2412.03248 • Published 12 days ago • 25