SVRL/verl-scalable-0402_batch768_clipratio0.3_Qwen2.5-14B_scale-reasoning-data-v2-nos-fixmc-fil0-fil8 Updated 2 days ago
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations Paper • 2504.00824 • Published 10 days ago • 38
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations Paper • 2504.00824 • Published 10 days ago • 38
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 12 days ago • 115
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows Paper • 2411.07763 • Published Nov 12, 2024
When Attention Sink Emerges in Language Models: An Empirical View Paper • 2410.10781 • Published Oct 14, 2024
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 16
Predictive Data Selection: The Data That Predicts Is the Data That Teaches Paper • 2503.00808 • Published Mar 2 • 57
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Paper • 2503.18892 • Published 18 days ago • 29
SkyLadder: Better and Faster Pretraining via Context Window Scheduling Paper • 2503.15450 • Published 22 days ago • 11
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Paper • 2503.11579 • Published 28 days ago • 18
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published about 1 month ago • 62
ABC: Achieving Better Control of Multimodal Embeddings using VLMs Paper • 2503.00329 • Published Mar 1 • 18
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models Paper • 2310.07712 • Published Oct 11, 2023
DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Paper • 2502.18460 • Published Feb 25 • 2
Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks Paper • 2501.16902 • Published Jan 28
VISA: Retrieval Augmented Generation with Visual Source Attribution Paper • 2412.14457 • Published Dec 19, 2024