Text Generation
Transformers
Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
MagpieLM-4B-v0.1 / model-00002-of-00002.safetensors

Commit History

Training in progress, step 765
6271b91
verified

flydust commited on

Training in progress, step 700
9c55b20
verified

flydust commited on

Training in progress, step 600
7e1171d
verified

flydust commited on

Training in progress, step 500
12e0888
verified

flydust commited on

Training in progress, step 400
8e9ab73
verified

flydust commited on

Training in progress, step 300
4372244
verified

flydust commited on

Training in progress, step 200
957b9dd
verified

flydust commited on

Training in progress, step 100
9307052
verified

flydust commited on