60 26 92

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

liked a model 21 days ago

Qwen/Qwen2.5-Omni-7B

upvoted a paper about 2 months ago

START: Self-taught Reasoner with Tools

liked a model about 2 months ago

Qwen/QwQ-32B-GGUF

View all activity

Organizations

chujiezheng's activity

commented a paper 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103 •

commented 2 papers 3 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99 •

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99 •

commented 2 papers 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365 •

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83 •

commented 2 papers 5 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83 •

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 29 •

New activity in chujiezheng/Mistral7B-PairRM-SPPO-ExPO 7 months ago

Adding Evaluation Results

#1 opened 7 months ago by

leaderboard-pr-bot

New activity in mistralai/Mistral-7B-Instruct-v0.3 11 months ago

no system message?

#14 opened 11 months ago by

mclassHF2023

commented a paper 11 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11 •

New activity in chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO 11 months ago

Possibly wrong model

#1 opened 11 months ago by

ByteBrew23

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 11 months ago

Update README.md

#3 opened 11 months ago by

chujiezheng

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO 11 months ago

Update README.md

#2 opened 11 months ago by

chujiezheng

New activity in chujiezheng/Llama3-70B-Chinese-Chat-ExPO 11 months ago

Create README.md

#1 opened 11 months ago by

chujiezheng

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 11 months ago

Update README.md

#2 opened 11 months ago by

chujiezheng

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO 11 months ago

Create README.md

#1 opened 11 months ago by

chujiezheng

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 11 months ago

Create README.md

#1 opened 11 months ago by

chujiezheng

New activity in chujiezheng/LLaMA3-iterative-DPO-final-ExPO 11 months ago

Create README.md

#1 opened 11 months ago by

chujiezheng

New activity in chujiezheng/tulu-2-dpo-13b 11 months ago

Update tokenizer_config.json

#2 opened 11 months ago by

chujiezheng

New activity in allenai/tulu-2-dpo-13b 11 months ago

Update tokenizer_config.json

#4 opened 11 months ago by

chujiezheng