8 11

Bowen Yu

Tigerph

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

upvoted a paper 2 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

upvoted a paper 3 months ago

Evaluating and Aligning CodeLLMs on Human Preference

View all activity

Organizations

Tigerph's activity

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted a paper 2 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 50

upvoted 2 papers 3 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 48

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

commented a paper 3 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80 •

upvoted a paper 4 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20

commented a paper 4 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20 •

upvoted a paper 4 months ago

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31, 2024 • 18

commented a paper 4 months ago

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31, 2024 • 18 •

upvoted a paper 5 months ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 23

commented a paper 5 months ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 23 •

upvoted a paper 5 months ago

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17, 2024 • 17

commented a paper 5 months ago

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17, 2024 • 17 •

upvoted a paper 5 months ago

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12, 2024 • 17

commented a paper 5 months ago

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12, 2024 • 17 •

upvoted a paper 8 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

authored 4 papers 9 months ago

Domain Incremental Lifelong Learning in an Open World

Paper • 2305.06555 • Published May 11, 2023 • 1

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 35

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Paper • 2311.03099 • Published Nov 6, 2023 • 29

Improving Question Generation with Multi-level Content Planning

Paper • 2310.13512 • Published Oct 20, 2023