Peng

BaolinPeng

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Magma: A Foundation Model for Multimodal AI Agents

upvoted a paper 2 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

upvoted a paper 3 months ago

HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

View all activity

Organizations

None yet

BaolinPeng's activity

upvoted a paper about 1 month ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 57

upvoted a paper 2 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 59

upvoted a paper 3 months ago

HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Paper • 2412.20735 • Published Dec 30, 2024 • 11

upvoted a paper 4 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 150

upvoted a paper 5 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 21

upvoted a paper 6 months ago

Improving Autonomous AI Agents with Reflective Tree Search and Self-Learning

Paper • 2410.02052 • Published Oct 2, 2024 • 9

upvoted 3 papers 9 months ago

upvoted a paper 10 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 32

upvoted a paper 11 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 122

upvoted a paper 12 months ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55

upvoted a paper about 1 year ago

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 53

upvoted a paper over 1 year ago

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

liked a dataset over 1 year ago

kaist-ai/CoT-Collection

Viewer • Updated Oct 14, 2023 • 1.84M • 1.34k • 142

upvoted a paper over 1 year ago

Stabilizing RLHF through Advantage Model and Selective Rehearsal

Paper • 2309.10202 • Published Sep 18, 2023 • 11