Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
NicholasCorrado
/
uf-rlced-conifer_tulu-2-7b-dpo-full
like
0
Text Generation
Transformers
Safetensors
data/uf_rlced_conifer
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
uf-rlced-conifer_tulu-2-7b-dpo-full
/
model-00002-of-00003.safetensors
Commit History
Training in progress, step 720
4c99d17
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 700
7a5f712
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 600
da88afb
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 500
bd612dc
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 400
9a81384
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 300
aed764e
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 200
cdc0e13
verified
NicholasCorrado
commited on
Aug 30, 2024
Training in progress, step 100
8d568d2
verified
NicholasCorrado
commited on
Aug 30, 2024
End of training
4b85518
verified
NicholasCorrado
commited on
Aug 30, 2024
Model save
c6d9743
verified
NicholasCorrado
commited on
Aug 30, 2024