Model
- Developed by: DARJYO
- Base Type: Fine-tuned language model
- Finetuned model : persadian_14B-GRPO
- Base Architecture: Transformer-based/Phi-4
This model is fine-tuned on datasets for tasks with Unsloth and Huggingface's TRL library.
It is based on the unsloth/Phi-4
model and uses reinforcement learning for improved performance.
- Downloads last month
- 4
Hardware compatibility
Log In
to view the estimation