Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
XueyingJia
/
qwen-0.5b-sft-HH-online-dpo
like
0
Transformers
TensorBoard
Safetensors
XueyingJia/hh-rlhf-train
Generated from Trainer
trl
online-dpo
Inference Endpoints
arxiv:
2402.04792
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-0.5b-sft-HH-online-dpo
Commit History
End of training
b572617
verified
XueyingJia
commited on
Dec 11, 2024
Model save
008aa21
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2699
1c1fc28
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2600
c3df3e7
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2500
476f8b2
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2400
1dde8d7
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2300
8d8aca7
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2200
40bb5de
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2100
1eca532
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2000
9c399dd
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1900
a3aecb9
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1800
dafc75f
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1700
9b1ccf8
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1600
0f456ee
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1500
ab8fc54
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1400
5de55a8
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1300
738b75e
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1200
3171dab
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1100
85c6dda
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1000
62d2a97
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 900
557d483
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 800
e9cb018
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 700
40040bc
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 600
b276466
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 500
1142817
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 400
bd94697
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 300
467caa1
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 200
67407c1
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 100
c8026cf
verified
XueyingJia
commited on
Dec 10, 2024
initial commit
dfca4a3
verified
XueyingJia
commited on
Dec 10, 2024