metadata
base_model: unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit
tags:
- text-generation-inference
- transformers
- qwen2
- trl
- grpo
- QLORA
license: apache-2.0
language:
- en
This is a resoning model for solving high school level Math.
Uploaded model
- Developed by: Tapan101
- License: apache-2.0
- Finetuned from model : unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit