Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 1 day ago • 34
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training Paper • 2303.13510 • Published Mar 23, 2023 • 1
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions Paper • 2303.17597 • Published Mar 30, 2023 • 1
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Paper • 2310.01403 • Published Oct 2, 2023 • 1
Evaluating Hallucinations in Chinese Large Language Models Paper • 2310.03368 • Published Oct 5, 2023
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans Paper • 2305.04790 • Published May 8, 2023 • 1
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models Paper • 2306.09347 • Published Jun 15, 2023 • 1
CLIM: Contrastive Language-Image Mosaic for Region Representation Paper • 2312.11376 • Published Dec 18, 2023
T-Eval: Evaluating the Tool Utilization Capability Step by Step Paper • 2312.14033 • Published Dec 21, 2023 • 2
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI Paper • 2312.16170 • Published Dec 26, 2023 • 1
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Paper • 2307.03601 • Published Jul 7, 2023 • 12
Unified Human-Scene Interaction via Prompted Chain-of-Contacts Paper • 2309.07918 • Published Sep 14, 2023 • 1
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation Paper • 2402.13013 • Published Feb 20, 2024 • 1