Submitted by deleted 82 Unified Reward Model for Multimodal Understanding and Generation · 5 authors 2
Submitted by deleted 51 EuroBERT: Scaling Multilingual Encoders for European Languages · 19 authors 6
Submitted by deleted 37 S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information · 6 authors 1
Submitted by deleted 32 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching · 3 authors 2
Submitted by deleted 13 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning · 8 authors 1
Submitted by deleted 12 VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control · 7 authors 1
Submitted by deleted 9 TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models · 4 authors 1
Submitted by deleted 9 R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning · 3 authors 2
Submitted by deleted 8 BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities · 10 authors 1
Submitted by deleted 6 TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation · 17 authors 1
Submitted by deleted 5 R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model · 6 authors 1
Submitted by deleted 5 An Empirical Study on Eliciting and Improving R1-like Reasoning Models · 13 authors 1
Submitted by deleted 5 LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding · 11 authors 1
Submitted by deleted 2 Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles · 6 authors 2
Submitted by deleted 1 AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM · 6 authors 1
Submitted by deleted 1 EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test · 4 authors 1