Uploaded model

  • Developed by: icefog72
  • License: apache-2.0
  • Finetuned from model : icefog72/Ice0.101-20.03-RP

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
55
Safetensors
Model size
7.24B params
Tensor type
FP16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for icefog72/Ice0.101-20.03-RP-GRPO-1

Finetuned
(1)
this model
Finetunes
1 model
Quantizations
3 models