Zeyu Qin

qqqzzzyyy

https://alan-qin.github.io/

Alan-Qin

AI & ML interests

Scalable Oversight, AI safety

Recent Activity

updated a model 5 days ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

published a model 5 days ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

liked a dataset 9 days ago

agentica-org/DeepCoder-Preview-Dataset

View all activity

Organizations

None yet

qqqzzzyyy's activity

upvoted 3 collections about 1 month ago

upvoted 2 papers about 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published Feb 17 • 16

upvoted 2 papers 2 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 48

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 118

upvoted a paper 3 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48

upvoted a paper 4 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

upvoted a paper 5 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51

upvoted a paper 6 months ago

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 78

upvoted a collection 7 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 80

upvoted a paper 8 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 50

upvoted an article 9 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 354

upvoted 3 collections 10 months ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Nov 28, 2024 • 209

Coding Instruction datasets

Collection

4 items • Updated Nov 25, 2024 • 1

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 131

upvoted a paper 10 months ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69

upvoted a paper 12 months ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21, 2024 • 30