metadata
language: en
license: apache-2.0
base_model: Qwen/Qwen2.5-3B-Instruct
tags:
- qwen
- lora
- peft
- causal-lm
Qwen2.5-3B-Instruct Fine-tuned Model
This model is a fine-tuned version of Qwen/Qwen2.5-3B-Instruct using LoRA (Low-Rank Adaptation).
Training Details
- Model was trained for 3 epochs on a custom dataset of 553 examples
- Used 4-bit quantization for efficient training
- Used the LoRA+ technique with 16.0 ratio