Xu Yifan's picture

3 4

Xu Yifan

xuyifan

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

upvoted a paper 3 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

upvoted a paper 3 months ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

View all activity

Organizations

None yet

xuyifan's activity

upvoted a paper about 2 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 17

upvoted 2 papers 3 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 34

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 48

New activity in meta-llama/Llama-3.2-11B-Vision 4 months ago

Error encountered when fine-tuning

#30 opened 4 months ago by

commented a paper 10 months ago

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 21 •

upvoted a paper 10 months ago

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 21

New activity in bigscience/bloom over 2 years ago

How to use bloom-176B to generate or evaluate on Multi-graphics?

#62 opened over 2 years ago by

How to use bloom-176B to generate or evaluate on Multi-graphics?

#62 opened over 2 years ago by