Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
14
YigeYuan
1t4chi
Follow
AI & ML interests
None yet
Recent Activity
liked
a Space
about 2 months ago
allenai/reward-bench
liked
a model
2 months ago
RLHFlow/RewardModel-Mistral-7B-for-DPA-v1
liked
a model
4 months ago
allenai/tulu-v2.5-dpo-13b-hh-rlhf
View all activity
Organizations
None yet
1t4chi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
about 2 months ago
Running
330
330
Reward Bench Leaderboard
📐
Explore and analyze RewardBench leaderboard data
liked
a model
2 months ago
RLHFlow/RewardModel-Mistral-7B-for-DPA-v1
Text Classification
•
Updated
May 23, 2024
•
475
•
3
liked
3 models
4 months ago
allenai/tulu-v2.5-dpo-13b-hh-rlhf
Text Generation
•
Updated
Jun 14, 2024
•
20
•
1
allenai/tulu-2-dpo-13b
Text Generation
•
Updated
May 17, 2024
•
3.63k
•
20
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
May 9, 2024
•
185
•
10
liked
3 datasets
4 months ago
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
2.71k
•
126
PKU-Alignment/PKU-SafeRLHF-10K
Viewer
•
Updated
Jul 20, 2023
•
10k
•
138
•
63
unalignment/toxic-dpo-v0.2
Viewer
•
Updated
Jan 9, 2024
•
541
•
99
•
122
liked
2 models
4 months ago
ChenmieNLP/Zephyr-7B-Beta-Helpful
Text Generation
•
Updated
Oct 10, 2024
•
140
•
1
HelpingAI/HelpingAI-9B
Text Generation
•
Updated
Oct 31, 2024
•
162
•
25
liked
2 datasets
5 months ago
rngusry/UltraFeedback-honesty-preferences
Viewer
•
Updated
Aug 3, 2024
•
251k
•
49
•
1
rngusry/UltraFeedback-truthfulness-preferences
Viewer
•
Updated
Jul 25, 2024
•
217k
•
32
•
1
liked
2 models
6 months ago
jointpreferences/mistral_7b_sft_helpful
Text Generation
•
Updated
Apr 2, 2024
•
24
•
1
GraySwanAI/Mistral-7B-Instruct-RR
Text Generation
•
Updated
Jul 9, 2024
•
222
•
4