NotAiLOL
/

Qwen2-0.5B-Math

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Coding model comming soon!

Uploaded model

Developed by: NotAiLOL
License: apache-2.0
Finetuned from model : unsloth/Qwen2-0.5B-Instruct-bnb-4bit

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Details

This model was trained on microsoft/orca-math-word-problems-200k for 3 epochs with rsLoRA + QLoRA.

Training Loss Graph

The model follows the Alpaca format:

<|im_start|>system
You are a professional mathematician.|im_end|>

<|im_start|>user
{}<|im_end|>

<|im_start|>assistant
{}

Downloads last month: 30

Safetensors

Model size

494M params

Tensor type

FP16

·

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Finetuned from

Dataset used to train NotAiLOL/Qwen2-0.5B-Math