YigeYuan

1t4chi

AI & ML interests

None yet

Recent Activity

updated a model 8 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

published a model 8 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

updated a model 23 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT

View all activity

Organizations

None yet

1t4chi's activity

updated a model 8 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

Updated 8 days ago • 5

published a model 8 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

Updated 8 days ago • 5

updated a model 23 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT

Text Generation • Updated 23 days ago • 300

published a model 23 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT

Text Generation • Updated 23 days ago • 300

published a model 24 days ago

1t4chi/Qwen2.5-Math-7B-QwQMath6K-SFT

Updated 24 days ago

updated a model 27 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

Updated 27 days ago • 19

published a model 27 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

Updated 27 days ago • 19

published a model 29 days ago

1t4chi/Qwen2.5-Math-7B-4GPU-Nothink-KL0.00005

Updated 29 days ago

published a model 30 days ago

1t4chi/Qwen2.5-Math-7B-HJX8k-4GPU-Nothink-KL0.0-FindData

Updated 30 days ago

published a model about 1 month ago

1t4chi/mistral-7b-base-simper

Updated Feb 21

liked a Space 3 months ago

350

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data

liked a model 4 months ago

RLHFlow/RewardModel-Mistral-7B-for-DPA-v1

Text Classification • Updated May 23, 2024 • 599 • 3

liked 3 models 5 months ago

liked 3 datasets 5 months ago

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 4.64k • 133

PKU-Alignment/PKU-SafeRLHF-10K

Viewer • Updated Jul 20, 2023 • 10k • 315 • 63

unalignment/toxic-dpo-v0.2

Viewer • Updated Jan 9, 2024 • 541 • 836 • 126

liked 2 models 5 months ago

ChenmieNLP/Zephyr-7B-Beta-Helpful

Text Generation • Updated Oct 10, 2024 • 4 • 1

HelpingAI/HelpingAI-9B

Text Generation • Updated Oct 31, 2024 • 130 • 25