Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-mt-dpo-full_LR5e-7_BS32_rmsprop_3epochs_test
like
0
Text Generation
Transformers
TensorBoard
Safetensors
haoranxu/ALMA-R-Preference
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
9da9b97
tinyllama-1.1b-mt-dpo-full_LR5e-7_BS32_rmsprop_3epochs_test
Commit History
End of training
9da9b97
verified
martimfasantos
commited on
Jul 12
Model save
34f0fcf
verified
martimfasantos
commited on
Jul 12
Training in progress, step 2000
a540bc2
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1900
c105234
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1800
e5f7a18
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1700
68d06c4
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1500
9f13211
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1400
a94f5fc
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1300
122e274
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1200
1ab6605
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1100
5e36c31
verified
martimfasantos
commited on
Jul 12
Training in progress, step 1000
62e1b53
verified
martimfasantos
commited on
Jul 12
Training in progress, step 900
f6a0123
verified
martimfasantos
commited on
Jul 12
Training in progress, step 800
e95b63f
verified
martimfasantos
commited on
Jul 12
Training in progress, step 700
4c47248
verified
martimfasantos
commited on
Jul 12
Training in progress, step 600
201123a
verified
martimfasantos
commited on
Jul 12
Training in progress, step 500
540288e
verified
martimfasantos
commited on
Jul 12
Training in progress, step 400
0ad6150
verified
martimfasantos
commited on
Jul 12
Training in progress, step 300
472f5a5
verified
martimfasantos
commited on
Jul 12
Training in progress, step 200
ca03ad7
verified
martimfasantos
commited on
Jul 12
Training in progress, step 100
e5c6f7c
verified
martimfasantos
commited on
Jul 12
initial commit
7170144
verified
martimfasantos
commited on
Jul 11