-
Rho-1: Not All Tokens Are What You Need
Paper • 2404.07965 • Published • 79 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper • 2404.10667 • Published • 12 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper • 2402.12847 • Published • 25 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 18
Collections
Discover the best community collections!
Collections including paper arxiv:2404.03715
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 135 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 102 -
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Paper • 2402.07456 • Published • 39 -
Learning From Mistakes Makes LLM Better Reasoner
Paper • 2310.20689 • Published • 24
-
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 69 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 57 -
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Paper • 2404.05719 • Published • 57
-
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 90 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 102 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 57 -
Do language models plan ahead for future tokens?
Paper • 2404.00859 • Published • 2