SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Paper • 2501.18564 • Published Jan 30 • 1
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics Paper • 2406.10721 • Published Jun 15, 2024 • 1