Zeyu Qin

qqqzzzyyy

https://alan-qin.github.io/

Alan-Qin

AI & ML interests

Scalable Oversight, AI safety

Recent Activity

updated a model 8 days ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

published a model 8 days ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

liked a dataset 12 days ago

agentica-org/DeepCoder-Preview-Dataset

View all activity

Organizations

None yet

qqqzzzyyy's activity

updated a model 8 days ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

Updated 8 days ago • 1

published a model 8 days ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

Updated 8 days ago • 1

liked a dataset 12 days ago

agentica-org/DeepCoder-Preview-Dataset

Viewer • Updated 13 days ago • 25k • 3.5k • 67

liked a dataset 19 days ago

AndrewZeng/math_level1to5_qwen_prompt

Viewer • Updated 20 days ago • 12k • 54 • 1

liked a dataset 20 days ago

Goedel-LM/Goedel-Pset-v1

Viewer • Updated 4 days ago • 1.73M • 177 • 6

liked a dataset about 1 month ago

obiwan96/owm-cog-behaviors

Viewer • Updated Mar 19 • 28.3k • 301 • 2

upvoted a collection about 1 month ago

Cognitive Behaviors

Collection

4 items • Updated Mar 19 • 2

liked a dataset about 1 month ago

allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 6.98k • 93

upvoted a collection about 1 month ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 613

liked a model about 2 months ago

TIGER-Lab/Qwen2.5-Math-7B-CFT

Text Generation • Updated Feb 2 • 46 • 8

liked a dataset about 2 months ago

TIGER-Lab/WebInstruct-CFT

Viewer • Updated Feb 2 • 654k • 378 • 50

upvoted a collection about 2 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 77

liked a Space about 2 months ago

README

👀

liked a model about 2 months ago

meta-llama/Llama-3.2-3B

Text Generation • Updated Oct 24, 2024 • 684k • • 551

upvoted 3 papers 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published Feb 17 • 16

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 48

liked a dataset 2 months ago

PKU-Alignment/BeaverTails

Viewer • Updated Oct 17, 2023 • 364k • 6.3k • 53

upvoted a paper 3 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

liked a model 3 months ago

LLM-LAT/robust-llama3-8b-instruct

Text Generation • Updated Aug 1, 2024 • 191 • 12