arxiv:2605.20668
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
upvoted a paper about 1 hour ago
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts updated a dataset about 1 hour ago
prometheus-eval/k-browsecomp submitted a paper about 1 hour ago
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts