Shudan Zhang's picture

1 4

Shudan Zhang

zdaniel0222

Daniel-0222

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

authored a paper 5 months ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

upvoted a paper 5 months ago

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts

View all activity

Organizations

None yet

zdaniel0222's activity

upvoted a paper 4 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

authored a paper 5 months ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 50

upvoted 3 papers 5 months ago

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts

Paper • 2405.04520 • Published May 7, 2024 • 1

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 37

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 50

authored a paper 8 months ago

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17