Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-sum-dpo-full_LR1e-7_2epochs
like
0
Text Generation
Transformers
TensorBoard
Safetensors
openai/summarize_from_feedback
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
c9e9cc8
tinyllama-1.1b-sum-dpo-full_LR1e-7_2epochs
/
model.safetensors
Commit History
Training in progress, step 1700
60100c7
verified
martimfasantos
commited on
May 15
Training in progress, step 1600
c4a6a0d
verified
martimfasantos
commited on
May 15
Training in progress, step 1500
763fbb4
verified
martimfasantos
commited on
May 15
Training in progress, step 1400
0b52fb0
verified
martimfasantos
commited on
May 15
Training in progress, step 1300
9d18afc
verified
martimfasantos
commited on
May 15
Training in progress, step 1200
b1177d6
verified
martimfasantos
commited on
May 15
Training in progress, step 1100
89582b2
verified
martimfasantos
commited on
May 15
Training in progress, step 1000
92865cd
verified
martimfasantos
commited on
May 15
Training in progress, step 900
f49a51f
verified
martimfasantos
commited on
May 15
Training in progress, step 800
b17b3c2
verified
martimfasantos
commited on
May 15
Training in progress, step 700
9e19549
verified
martimfasantos
commited on
May 15
Training in progress, step 600
86f1ce6
verified
martimfasantos
commited on
May 15
Training in progress, step 500
853ef1f
verified
martimfasantos
commited on
May 15
Training in progress, step 400
08f5c5b
verified
martimfasantos
commited on
May 15
Training in progress, step 300
70f1915
verified
martimfasantos
commited on
May 15
Training in progress, step 200
fda4c7f
verified
martimfasantos
commited on
May 15
Training in progress, step 100
7f16762
verified
martimfasantos
commited on
May 15
Previous
1
2
3
Next