Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Magpie-Align
/
Llama-3.1-8B-Magpie-Align-v0.2
like
2
Follow
Magpie Alignment
125
Safetensors
Magpie-Align/Llama-3.1-70B-PO-100K-armorm
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
arxiv:
2406.08464
arxiv:
2406.12845
License:
llama3.1
Model card
Files
Files and versions
Community
Train
main
Llama-3.1-8B-Magpie-Align-v0.2
Commit History
Update README.md
30e6682
verified
Zhangchen Xu
commited on
Aug 19, 2024
Update README.md
9ebe5f7
verified
Zhangchen Xu
commited on
Aug 19, 2024
Update README.md
88d17cd
verified
Zhangchen Xu
commited on
Aug 19, 2024
End of training
f98f101
verified
Zhangchen Xu
commited on
Aug 3, 2024
Model save
d8eec6f
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 765
cf8884e
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 700
037bbde
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 600
0fa5b18
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 500
0a183b3
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 400
b8c4d60
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 300
fa08f18
verified
Zhangchen Xu
commited on
Aug 3, 2024
Training in progress, step 200
bf37a10
verified
Zhangchen Xu
commited on
Aug 2, 2024
Training in progress, step 100
f92c0a2
verified
Zhangchen Xu
commited on
Aug 2, 2024
initial commit
ba80c4f
verified
Zhangchen Xu
commited on
Aug 2, 2024