Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AmberYifan
/
llama-7b-sft-DPO
like
0
Text Generation
Transformers
TensorBoard
Safetensors
Dahoas/full-hh-rlhf
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
916d115
llama-7b-sft-DPO
Commit History
Training in progress, step 1600
916d115
verified
AmberYifan
commited on
May 1
Training in progress, step 1500
ec5e6ca
verified
AmberYifan
commited on
May 1
Training in progress, step 1400
447402c
verified
AmberYifan
commited on
May 1
Training in progress, step 1300
38992ae
verified
AmberYifan
commited on
May 1
Training in progress, step 1200
e6a887e
verified
AmberYifan
commited on
May 1
Training in progress, step 1100
093385c
verified
AmberYifan
commited on
May 1
Training in progress, step 1000
2e7e3c9
verified
AmberYifan
commited on
Apr 30
Training in progress, step 900
82e5822
verified
AmberYifan
commited on
Apr 30
Training in progress, step 800
dd57f78
verified
AmberYifan
commited on
Apr 30
Training in progress, step 700
f8eecc8
verified
AmberYifan
commited on
Apr 30
Training in progress, step 600
802d34b
verified
AmberYifan
commited on
Apr 30
Training in progress, step 500
01e9f4d
verified
AmberYifan
commited on
Apr 30
Training in progress, step 400
f75a0b3
verified
AmberYifan
commited on
Apr 30
Training in progress, step 300
8cb8d93
verified
AmberYifan
commited on
Apr 30
Training in progress, step 200
e0af658
verified
AmberYifan
commited on
Apr 30
Training in progress, step 100
5ef624a
verified
AmberYifan
commited on
Apr 30
initial commit
7d5648f
verified
AmberYifan
commited on
Apr 30