Ben Shi's picture

2

Ben Shi

benshi34

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Can Language Models Solve Olympiad Programming?

authored a paper 5 days ago

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

authored a paper 5 days ago

IMPersona: Evaluating Individual Level LM Impersonation

View all activity

Organizations

None yet

benshi34's activity

authored 3 papers 5 days ago

Can Language Models Solve Olympiad Programming?

Paper • 2404.10952 • Published Apr 16, 2024 • 1

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Paper • 2407.12883 • Published Jul 16, 2024 • 9

IMPersona: Evaluating Individual Level LM Impersonation

Paper • 2504.04332 • Published 17 days ago • 1

upvoted a paper 5 days ago

IMPersona: Evaluating Individual Level LM Impersonation

Paper • 2504.04332 • Published 17 days ago • 1

upvoted an article 3 months ago

Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Apr 16, 2024

• 15

updated a dataset 4 months ago

benshi34/qual-analysis-reasoning-retrieval

Viewer • Updated Jan 7 • 80 • 22