TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies Paper • 2412.10345 • Published 20 days ago • 2
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension Paper • 2412.03704 • Published 29 days ago • 6
GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment Paper • 2410.08193 • Published Oct 10, 2024 • 3
GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment Paper • 2410.08193 • Published Oct 10, 2024 • 3
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning Paper • 2410.06508 • Published Oct 9, 2024 • 10
Decodable and Sample Invariant Continuous Object Encoder Paper • 2311.00187 • Published Oct 31, 2023 • 1
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences Paper • 2402.08925 • Published Feb 14, 2024 • 1
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models Paper • 2310.15140 • Published Oct 23, 2023 • 1
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL Paper • 2310.07220 • Published Oct 11, 2023 • 1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences Paper • 2401.10529 • Published Jan 19, 2024 • 1
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification Paper • 2311.08648 • Published Nov 15, 2023 • 2
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization Paper • 2310.19668 • Published Oct 30, 2023 • 3