Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Magpie-Align
/
Llama-3-8B-Magpie-Align-v0.1
like
10
Text Generation
Transformers
TensorBoard
Safetensors
princeton-nlp/llama3-ultrafeedback
Magpie-Align/Magpie-Pro-MT-300K-v0.1
English
llama
alignment-handbook
axolotl
trl
dpo
sft
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.08464
License:
llama3
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Train
Deploy
Use this model
main
Llama-3-8B-Magpie-Align-v0.1
Commit History
Update README.md
f5f3f01
verified
flydust
commited on
Aug 19
Update README.md
cf55cfe
verified
flydust
commited on
Jul 24
Update README.md
bab3aa1
verified
flydust
commited on
Jul 18
Update README.md
1dddcd8
verified
flydust
commited on
Jul 9
Update README.md
924ee48
verified
flydust
commited on
Jul 7
Update README.md
d2ab7f2
verified
flydust
commited on
Jul 5
Update README.md
a83ddac
verified
flydust
commited on
Jul 3
Upload magpie_logo.png
06520aa
verified
flydust
commited on
Jul 3
Update README.md
083b719
verified
flydust
commited on
Jul 3
Upload magpie_logo.png
b6cf28b
verified
flydust
commited on
Jul 3
Update README.md
15fa6c7
verified
flydust
commited on
Jul 3
update tokenizer (fix wrong bos setup)
26b5458
verified
flydust
commited on
Jun 30
End of training
c1052f0
verified
flydust
commited on
Jun 29
Model save
568bba4
verified
flydust
commited on
Jun 29
Training in progress, step 400
128bcc0
verified
flydust
commited on
Jun 29
Training in progress, step 300
5eb2ed1
verified
flydust
commited on
Jun 29
Training in progress, step 200
90132c2
verified
flydust
commited on
Jun 29
Training in progress, step 100
31252d5
verified
flydust
commited on
Jun 29
initial commit
77a3456
verified
flydust
commited on
Jun 29