4 15 25

Longhui Yu

Longhui98

https://yulonghui.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Kimi-VL Technical Report

liked a model 4 days ago

moonshotai/Kimi-VL-A3B-Thinking

authored a paper 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

View all activity

Organizations

Longhui98's activity

upvoted a paper 3 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 4 days ago • 102

liked a model 4 days ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • Updated about 7 hours ago • 4.69k • 267

authored 4 papers 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 113

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

Paper • 2308.07758 • Published Aug 15, 2023 • 4

DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality

Paper • 2303.14585 • Published Mar 25, 2023

upvoted a paper 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 113

liked 2 models 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 1.45M • • 11.9k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 18 days ago • 6.7k • 896

upvoted 4 papers 4 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 49

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published Dec 16, 2024 • 11

upvoted a collection 8 months ago

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 51

liked 2 datasets 9 months ago

AI-MO/NuminaMath-TIR

Viewer • Updated Nov 25, 2024 • 72.5k • 10.5k • 124

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 4.17k • 440

updated a Space 9 months ago

README

🦀

upvoted a collection 9 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 76

upvoted a paper 9 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 57

liked a model 9 months ago

mistralai/Mathstral-7B-v0.1

Text Generation • Updated Jul 31, 2024 • 41.2k • 223