Rethinking Data Selection at Scale: Random Selection is Almost All You Need Paper • 2410.09335 • Published Oct 12 • 14
From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning Paper • 2410.06456 • Published Oct 9 • 35
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints Paper • 2410.06458 • Published Oct 9 • 8
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published 15 days ago • 57
Constraint Back-translation Improves Complex Instruction Following of Large Language Models Paper • 2410.24175 • Published 14 days ago • 15
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages Paper • 2410.23825 • Published 15 days ago • 3
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse Paper • 2410.21333 • Published 18 days ago • 9
Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning Paper • 2410.19290 • Published 21 days ago • 10
AutoTrain: No-code training for state-of-the-art models Paper • 2410.15735 • Published 25 days ago • 56
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published 3 days ago • 15
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization Paper • 2411.06208 • Published 6 days ago • 18
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published 7 days ago • 27
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published 8 days ago • 22
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 2 days ago • 39