Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published 3 days ago • 30
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published 9 days ago • 47
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 5 days ago • 42
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 17 days ago • 109
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 126
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse Paper • 2410.21333 • Published 28 days ago • 9
Counting Ability of Large Language Models and Impact of Tokenization Paper • 2410.19730 • Published 30 days ago • 10
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples Paper • 2410.14669 • Published Oct 18 • 35
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published Oct 22 • 88
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published about 1 month ago • 40
Why Does the Effective Context Length of LLMs Fall Short? Paper • 2410.18745 • Published about 1 month ago • 16
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Paper • 2410.11795 • Published Oct 15 • 16
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 15 items • Updated 22 days ago • 76
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14 • 16