4 2 2

Wenting Zhao

wentingzhao

AI & ML interests

None yet

Recent Activity

updated a dataset 19 days ago

commit0/mbpp

commented a paper 21 days ago

Challenges in Trustworthy Human Evaluation of Chatbots

updated a dataset 23 days ago

commit0/openai_humaneval

View all activity

Organizations

wentingzhao's activity

updated a dataset 19 days ago

commit0/mbpp

Viewer • Updated 19 days ago • 974 • 743

commented a paper 21 days ago

Challenges in Trustworthy Human Evaluation of Chatbots

Paper • 2412.04363 • Published 22 days ago • 2 •

updated a dataset 23 days ago

commit0/openai_humaneval

Viewer • Updated 23 days ago • 164 • 98

updated a dataset 24 days ago

wentingzhao/mbpp_predictions_1

Viewer • Updated 24 days ago • 500 • 34

updated a dataset 26 days ago

commit0/commit0

Viewer • Updated 26 days ago • 54 • 45

updated a dataset about 2 months ago

wentingzhao/SWE-bench_Verified

Viewer • Updated Nov 11 • 500 • 33

New activity in wentingzhao/WildHallucinations about 2 months ago

What exactly is the pipeline to use this data?

#2 opened 5 months ago by

Ouz-G

Arxiv link to WildHallucinations instead of WildChat (maybe have both)

#3 opened 4 months ago by

monsoon-nlp

updated a dataset about 2 months ago

wentingzhao/commit0_combined

Viewer • Updated Oct 28 • 54 • 562

updated 6 datasets 2 months ago

updated a dataset 3 months ago

wentingzhao/WildHallucinations

Viewer • Updated Sep 22 • 7.92k • 68 • 3

authored a paper 3 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18 • 43

upvoted a paper 3 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18 • 43

authored 2 papers 4 months ago

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 10

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2 • 61