Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SiliangZ
/
RM_mistral_irl2_initilized_from_sft_lr_5e7_idpo
like
0
Safetensors
mistral
Model card
Files
Files and versions
Community
Train
No model card
Downloads last month
10
Safetensors
Model size
7.11B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the
docs
.