O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published 27 days ago • 40
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published 13 days ago • 63
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published 6 days ago • 9
Qwen2-Math Collection Math-specific model series based on Qwen2 • 8 items • Updated 25 days ago • 46
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 67
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper • 2407.13623 • Published Jul 18 • 53