Zhanhui Zhou

ZHZisZZ

ZHZisZZ

AI & ML interests

None yet

Recent Activity

updated a dataset 19 days ago

ZHZisZZ/imdb_preference

updated a dataset 23 days ago

ZHZisZZ/lima-cot

updated a dataset 23 days ago

ZHZisZZ/helpful-anthropic-raw-cot

View all activity

Organizations

ZHZisZZ's activity

updated a dataset 19 days ago

ZHZisZZ/imdb_preference

Viewer • Updated 19 days ago • 25k • 100 • 3

updated 2 datasets 23 days ago

ZHZisZZ/lima-cot

Viewer • Updated 23 days ago • 1.32k • 37

ZHZisZZ/helpful-anthropic-raw-cot

Viewer • Updated 23 days ago • 59.9k • 115

updated a dataset about 1 month ago

ZHZisZZ/ultrachat_200k_cot

Viewer • Updated about 1 month ago • 60k • 114

liked a dataset 5 months ago

ZHZisZZ/imdb_preference

Viewer • Updated 19 days ago • 25k • 100 • 3

authored a paper 7 months ago

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Paper • 2406.11817 • Published Jun 17, 2024 • 13

updated a model 9 months ago

ZHZisZZ/zephyr-7b-dpo-full

Text Generation • Updated Apr 11, 2024 • 14

liked a Space 9 months ago

Running

308

📐

Reward Bench Leaderboard

updated a model 9 months ago

ZHZisZZ/zephyr-7b-rm-qlora

Updated Apr 4, 2024

liked a model 11 months ago

tomh/toxigen_hatebert

Text Classification • Updated May 2, 2022 • 5k • 11

liked a dataset 12 months ago

mmathys/openai-moderation-api-evaluation

Viewer • Updated Aug 28, 2023 • 1.68k • 166 • 25

liked a dataset about 1 year ago

nvidia/HelpSteer

Viewer • Updated 19 days ago • 37.1k • 1.62k • 229

liked a model about 1 year ago

tiiuae/falcon-180B

Text Generation • Updated Sep 6, 2023 • 1.85k • 1.13k

liked a model over 1 year ago

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 19.8k • 212

liked a dataset over 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 8.38k • 1.24k