Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
10
18
Wei Xiong
weqweasdas
Follow
dangkai-nk's profile picture
mzhaoshuai's profile picture
yifeihe3's profile picture
14 followers
·
2 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
updated
a dataset
about 18 hours ago
weqweasdas/new_8b_self_corr_sft
updated
a dataset
about 18 hours ago
weqweasdas/new_8b_self_corr
updated
a dataset
about 18 hours ago
weqweasdas/new_8b_corr_math
View all activity
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
12 days ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
Viewer
•
Updated
Nov 2
•
2.32M
•
90
•
4
liked
a model
22 days ago
RLHFlow/Llama3.1-8B-PRM-Mistral-Data
Text Generation
•
Updated
24 days ago
•
1.27k
•
5
liked
2 models
4 months ago
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
Updated
Sep 6
•
670
•
21
RLHFlow/LLaMA3-SFT
Text Generation
•
Updated
30 days ago
•
8.13k
•
8
liked
2 models
6 months ago
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
Updated
Oct 14
•
8.57k
•
40
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
Sep 23
•
11.1k
•
158
liked
4 models
7 months ago
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
Oct 14
•
2.39k
•
36
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
Updated
May 31
•
17
•
10
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
Updated
May 31
•
7
•
7
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
Updated
Jun 12
•
264
•
76
liked
3 models
8 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Oct 14
•
16.9k
•
48
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
Updated
Apr 24
•
17
•
8
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31
•
511
•
22
liked
a Space
9 months ago
Running
279
📐
Reward Bench Leaderboard
liked
2 models
9 months ago
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
Mar 22
•
189
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
Updated
Mar 22
•
635
•
17
liked
a model
over 1 year ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25
•
172
•
16
liked
a Space
over 1 year ago
Runtime error
66
🔥
Robin 7b