Longhui98 (Longhui Yu)

upvoted a paper 2 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 112

upvoted 2 papers 3 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 48

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 82

upvoted 2 papers 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 363

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published Dec 16, 2024 • 11

upvoted a collection 8 months ago

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 51

upvoted a collection 9 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 76

upvoted a paper 9 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 56

upvoted an article 9 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 118

upvoted a collection 9 months ago

AIMO Progress Prize

Collection

Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19, 2024 • 12

upvoted a paper 11 months ago

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 27

upvoted 2 papers over 1 year ago

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

Paper • 2311.06243 • Published Nov 10, 2023 • 22

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18

Longhui Yu

AI & ML interests

Organizations

Longhui98's activity

Kimi k1.5: Scaling Reinforcement Learning with LLMs

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Qwen2.5 Technical Report

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Qwen2-Math

NuminaMath

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

How NuminaMath Won the 1st AIMO Progress Prize

AIMO Progress Prize

Self-Play Preference Optimization for Language Model Alignment

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models