4 15 3

Kanzhi Cheng

cckevinn

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

upvoted a paper 2 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

upvoted a paper 3 days ago

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

View all activity

Organizations

cckevinn's activity

authored a paper 1 day ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 7 days ago • 50

upvoted a paper 2 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 7 days ago • 50

upvoted a paper 3 days ago

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

Paper • 2504.10127 • Published 4 days ago • 15

upvoted 2 papers 29 days ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Paper • 2503.12532 • Published Mar 16 • 14

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published Mar 17 • 49

upvoted a paper about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24

commented a paper about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24 •

authored a paper about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24

liked a Space about 1 month ago

CapArena Auto 1

🥇

Display Leaderboard of LLM Model Evaluations

liked a Space about 2 months ago

ACL Pubcheck

📝

Check your paper for ACL guidelines

upvoted 3 papers 2 months ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published Jan 30 • 25

updated 2 datasets 3 months ago

OS-Copilot/OS-Genesis-web-data

Updated Mar 17 • 47 • 2

OS-Copilot/OS-Genesis-mobile-data

Viewer • Updated Mar 17 • 51.1k • 170 • 2

authored 4 papers 3 months ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

Paper • 2401.10935 • Published Jan 17, 2024 • 4

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Paper • 2406.11736 • Published Jun 17, 2024 • 5

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30, 2024 • 5

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 89

upvoted a paper 4 months ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 89