-
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Paper • 2411.18478 • Published • 32 -
o1-Coder: an o1 Replication for Coding
Paper • 2412.00154 • Published • 41 -
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models
Paper • 2411.19477 • Published • 5 -
Reverse Thinking Makes LLMs Stronger Reasoners
Paper • 2411.19865 • Published • 19
Collections
Discover the best community collections!
Collections including paper arxiv:2411.17863
-
Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement
Paper • 2410.15633 • Published • 7 -
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Paper • 2411.13476 • Published • 15 -
LongKey: Keyphrase Extraction for Long Documents
Paper • 2411.17863 • Published • 10
-
LLoCO: Learning Long Contexts Offline
Paper • 2404.07979 • Published • 20 -
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 112 -
LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration
Paper • 2402.11550 • Published • 16 -
LongAlign: A Recipe for Long Context Alignment of Large Language Models
Paper • 2401.18058 • Published • 20