Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
10
17
Wei Xiong
weqweasdas
Follow
readysetgo's profile picture
hendrydong's profile picture
dangkai-nk's profile picture
14 followers
·
2 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
updated
a dataset
8 days ago
weqweasdas/ep1_2
updated
a dataset
9 days ago
weqweasdas/ep1_6
updated
a dataset
9 days ago
weqweasdas/ep1_5
View all activity
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
RLHFlow/LLaMA3-SFT
2 months ago
LLaMA3.1-SFT
3
#3 opened 2 months ago by
jackzhang
New activity in
Qwen/Qwen2.5-Math-RM-72B
2 months ago
example to service the RM
1
#2 opened 2 months ago by
weqweasdas
New activity in
RLHFlow/LLaMA3-SFT
3 months ago
How to use llama 3sft model, pipeline or tokenizer.apply_chat_template. Can you provide a simple example? Thank you very much for your contribution
2
#2 opened 3 months ago by
ZHIYII
Missing BOS token in tokenized text
2
#1 opened 3 months ago by
ZhaofengWu
New activity in
RLHF4MATH/Gemma-7B-it-SFT3epoch
4 months ago
Update README.md
#1 opened 4 months ago by
weqweasdas
New activity in
RLHFlow/ArmoRM-Llama3-8B-v0.1
4 months ago
Special tokens in the vocabulary?
4
#13 opened 4 months ago by
nshen7
New activity in
sfairXC/FsfairX-LLaMA3-RM-v0.1
5 months ago
TypeError: Got unsupported ScalarType BFloat16
1
#5 opened 5 months ago by
AIR-hl
New activity in
RLHFlow/pair-preference-model-LLaMA3-8B
5 months ago
Could you please test the consistency of preference between `RLHFlow/pair-preference-model-LLaMA3-8B` and GPT-4 on alpacaeval dataset?
1
#2 opened 5 months ago by
rungao2001
commented
a paper
6 months ago
RLHF Workflow: From Reward Modeling to Online RLHF
Paper
•
2405.07863
•
Published
May 13
•
67
•
5
New activity in
weqweasdas/RM-Mistral-7B
6 months ago
why vocab size is 32001
1
#3 opened 6 months ago by
yechenzhi1
New activity in
weqweasdas/RM-Mistral-7B
8 months ago
License
1
#2 opened 8 months ago by
ravir123
Fix dataset link
#1 opened 8 months ago by
ZennyKenny