Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-mt-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
haoranxu/ALMA-R-Preference
llama
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
7174143
tinyllama-1.1b-mt-dpo-qlora
Commit History
Training in progress, step 2700
7174143
verified
martimfasantos
commited on
May 29
Training in progress, step 2600
757a047
verified
martimfasantos
commited on
May 29
Training in progress, step 2500
e58e4ef
verified
martimfasantos
commited on
May 29
Training in progress, step 2400
d52962e
verified
martimfasantos
commited on
May 29
Training in progress, step 2300
095afe2
verified
martimfasantos
commited on
May 29
Training in progress, step 2200
9eb1e20
verified
martimfasantos
commited on
May 29
Training in progress, step 2100
152618d
verified
martimfasantos
commited on
May 29
Training in progress, step 2000
20d2880
verified
martimfasantos
commited on
May 29
Training in progress, step 1900
2720db8
verified
martimfasantos
commited on
May 29
Training in progress, step 1800
42bd05b
verified
martimfasantos
commited on
May 29
Training in progress, step 1700
78f6ec8
verified
martimfasantos
commited on
May 29
Training in progress, step 1600
a14f91c
verified
martimfasantos
commited on
May 29
Training in progress, step 1500
ee689b0
verified
martimfasantos
commited on
May 29
Training in progress, step 1400
45cc2d1
verified
martimfasantos
commited on
May 29
Training in progress, step 1300
c975e63
verified
martimfasantos
commited on
May 29
Training in progress, step 1200
0953321
verified
martimfasantos
commited on
May 29
Training in progress, step 1100
50d4aa0
verified
martimfasantos
commited on
May 29
Training in progress, step 1000
9683cf5
verified
martimfasantos
commited on
May 29
Training in progress, step 900
c44d7a1
verified
martimfasantos
commited on
May 29
Training in progress, step 800
5a27950
verified
martimfasantos
commited on
May 29
Training in progress, step 700
61c2e35
verified
martimfasantos
commited on
May 29
Training in progress, step 600
5e24caa
verified
martimfasantos
commited on
May 29
Training in progress, step 500
38273c6
verified
martimfasantos
commited on
May 29
Training in progress, step 400
2c90df7
verified
martimfasantos
commited on
May 29
Training in progress, step 300
bf0750b
verified
martimfasantos
commited on
May 29
Training in progress, step 200
b55e023
verified
martimfasantos
commited on
May 29
Training in progress, step 100
b7f33dc
verified
martimfasantos
commited on
May 29
initial commit
33e4f6e
verified
martimfasantos
commited on
May 29