Wei Xiong's picture

Wei Xiong

weqweasdas

·

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset 1 day ago

raftstudy/uf_iter3

published a dataset 1 day ago

raftstudy/uf_iter3

updated a dataset 1 day ago

raftstudy/uf_iter2

View all activity

Organizations

Papers 4

arxiv:2405.07863

arxiv:2312.11456

arxiv:2306.12420

arxiv:2304.06767

models 23

weqweasdas/zephyr-7b-dpo-full

Text Generation • Updated May 3, 2024 • 2

weqweasdas/zephyr-7b-gemma-dpo

Updated May 1, 2024

weqweasdas/zephyr-7b-sft-full

Updated Apr 30, 2024

weqweasdas/zephyr-7b-dpo-qlora

Updated Apr 30, 2024

weqweasdas/gpt2-cpt-dutch

Text Generation • Updated Apr 29, 2024 • 9

weqweasdas/zephyr-7b-gemma-sft

Updated Apr 29, 2024

weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085

Text Generation • Updated Apr 16, 2024 • 8

weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6

Text Generation • Updated Apr 16, 2024 • 2

weqweasdas/raft_baseline_zephyr_packing_model6

Text Generation • Updated Apr 15, 2024 • 2

weqweasdas/raft_baseline_openchat_llama13b_model1

Text Generation • Updated Apr 14, 2024 • 6

datasets 184

weqweasdas/amc23

Viewer • Updated 18 days ago • 40 • 80

weqweasdas/minerva_math

Viewer • Updated 18 days ago • 272 • 88

weqweasdas/olympiadbench

Viewer • Updated 18 days ago • 675 • 84

weqweasdas/aime24

Viewer • Updated 18 days ago • 30 • 80

weqweasdas/math500

Viewer • Updated 18 days ago • 500 • 78

weqweasdas/medium

Viewer • Updated Feb 14 • 10.7k • 55

weqweasdas/numia_hard

Viewer • Updated Feb 14 • 29.2k • 100

weqweasdas/rs_numia30k

Viewer • Updated Jan 30 • 30.6k • 33

weqweasdas/rs_math_train

Viewer • Updated Jan 29 • 7.5k • 30

weqweasdas/rs_math_test

Viewer • Updated Jan 29 • 5k • 45