Yuxuan Fan's picture

5 4 4

Yuxuan Fan

feiba54

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

upvoted a paper 5 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

upvoted a paper 5 months ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

View all activity

Organizations

None yet

feiba54's activity

upvoted a paper 4 days ago

FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

Paper • 2503.06680 • Published 5 days ago • 17

upvoted 2 papers 5 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 31

upvoted a paper 6 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

liked a dataset 6 months ago

NeelNanda/counterfact-tracing

Viewer • Updated Nov 5, 2022 • 21.9k • 140 • 13

liked a dataset 8 months ago

Dahoas/full-hh-rlhf

Viewer • Updated Feb 23, 2023 • 125k • 3.37k • 78

New activity in open-llm-leaderboard/open_llm_leaderboard about 1 year ago

meta-llama/Llama-2-70b-hf is set as "Private or deleted"

#580 opened about 1 year ago by

What's the difference between 'acc' and 'acc_norm' metric?

#578 opened about 1 year ago by

Where to find the evaluation results of certain model?

#582 opened about 1 year ago by

Where to find evaluation results in detail?

#568 opened about 1 year ago by

How are eval details formatted?

#567 opened about 1 year ago by

liked a dataset over 1 year ago

pietrolesci/amazoncat-13k

Viewer • Updated Oct 2, 2023 • 5.99M • 1.04k • 1

liked a model almost 2 years ago

luodian/llama-7b-hf

Text Generation • Updated Jun 23, 2023 • 1.56k • 35