Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ShenaoZhang
/
0.001_ablation_5iters_bs128_iter_1
like
0
Text Generation
Transformers
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
conversational
Inference Endpoints
text-generation-inference
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
0.001_ablation_5iters_bs128_iter_1
Commit History
End of training
c4fbcb7
verified
ShenaoZhang
commited on
Apr 24
Model save
0e9204f
verified
ShenaoZhang
commited on
Apr 24
initial commit
aede1ba
verified
ShenaoZhang
commited on
Apr 24