Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-mt-dpo-full_LR5e-8_BS16_3epochs
like
0
Text Generation
Transformers
TensorBoard
Safetensors
haoranxu/ALMA-R-Preference
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
bc95c6b
tinyllama-1.1b-mt-dpo-full_LR5e-8_BS16_3epochs
/
runs
/
Jul08_22-01-02_poseidon
Commit History
Model save
05db55f
verified
martimfasantos
commited on
Jul 9, 2024
Training in progress, step 4100
a090205
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 3900
9a34c87
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 3500
a7f6604
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 3100
14b8c67
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 2900
62f24f9
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 2700
6688735
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 2100
c1d5619
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 1900
ffef321
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 1700
0ca5de2
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 1500
5581888
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 1100
b659e2f
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 700
cc41280
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 500
ce1b31e
verified
martimfasantos
commited on
Jul 8, 2024
Training in progress, step 300
6bbc046
verified
martimfasantos
commited on
Jul 8, 2024