Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
haizhongzheng
/
Llama-3.2-1B-dpo-lora
like
0
Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
10cc041
Llama-3.2-1B-dpo-lora
/
runs
Commit History
Model save
10cc041
verified
haizhongzheng
commited on
Nov 26
Model save
a02ff0c
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3821
bc6be28
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3800
53daba5
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3700
768122b
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3600
0a19fb9
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3500
4fb2e5e
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3400
56b8cb8
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3300
ba6927d
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3200
c281472
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3100
9880d22
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 3000
26cc6d4
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2900
62d5e08
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2800
1baef0d
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2700
683d5d1
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2600
03a5f83
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2500
a713a59
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2400
65e6267
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2300
5de0558
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2200
3a79b09
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2100
980e0a2
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 2000
aae9afa
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1900
63c8252
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1800
c8da2dd
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1700
0559c8c
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1600
c245413
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1500
e1e636d
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1400
274a53b
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1300
50a79b3
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1200
286b939
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1100
7888d19
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 1000
b74f17b
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 900
8f442a5
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 800
bc6c055
verified
haizhongzheng
commited on
Nov 21
Training in progress, step 700
b3ba910
verified
haizhongzheng
commited on
Nov 20
Training in progress, step 600
bb3fcaa
verified
haizhongzheng
commited on
Nov 20
Training in progress, step 500
5a9c082
verified
haizhongzheng
commited on
Nov 20
Training in progress, step 400
918af94
verified
haizhongzheng
commited on
Nov 20
Training in progress, step 300
b03ae35
verified
haizhongzheng
commited on
Nov 20
Training in progress, step 200
a08a9f5
verified
haizhongzheng
commited on
Nov 20
Training in progress, step 100
4efb4ee
verified
haizhongzheng
commited on
Nov 20