YigeYuan's picture

14

YigeYuan

1t4chi

·

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

published a model 16 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

updated a model about 1 month ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT

View all activity

Organizations

None yet

1t4chi's activity

liked a Space 4 months ago

Reward Bench Leaderboard

Explore and analyze RewardBench leaderboard data

liked a model 4 months ago

RLHFlow/RewardModel-Mistral-7B-for-DPA-v1

Text Classification • Updated May 23, 2024 • 591 • 4

liked 3 models 6 months ago

allenai/tulu-v2.5-dpo-13b-hh-rlhf

Text Generation • Updated Jun 14, 2024 • 4 • 1

allenai/tulu-2-dpo-13b

Text Generation • Updated May 17, 2024 • 1.63k • 20

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated May 9, 2024 • 36 • 10

liked 3 datasets 6 months ago

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 4.66k • 134

PKU-Alignment/PKU-SafeRLHF-10K

Viewer • Updated Jul 20, 2023 • 10k • 397 • 63

unalignment/toxic-dpo-v0.2

Viewer • Updated Jan 9, 2024 • 541 • 1.26k • 125

liked 2 models 6 months ago

ChenmieNLP/Zephyr-7B-Beta-Helpful

Text Generation • Updated Oct 10, 2024 • 1 • 1

HelpingAI/HelpingAI-9B

Text Generation • Updated Oct 31, 2024 • 55 • 25

liked 2 datasets 7 months ago

rngusry/UltraFeedback-honesty-preferences

Viewer • Updated Aug 3, 2024 • 251k • 25 • 1

rngusry/UltraFeedback-truthfulness-preferences

Viewer • Updated Jul 25, 2024 • 217k • 22 • 1

liked 2 models 7 months ago

jointpreferences/mistral_7b_sft_helpful

Text Generation • Updated Apr 2, 2024 • 2 • 1

GraySwanAI/Mistral-7B-Instruct-RR

Text Generation • Updated Jul 9, 2024 • 156 • 4