F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions Paper โข 2407.12435 โข Published Jul 17 โข 13
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding Paper โข 2401.09340 โข Published Jan 17 โข 19