text-generation

Fine-tuned version of Qwen/Qwen2.5-0.5B-Instruct on the Databricks Dolly-15k instruction dataset using LoRA (PEFT) + TRL's SFTTrainer.

How to use

from transformers import pipeline
gen = pipeline("text-generation", model="samandar1105/text-generation")
result = gen([{"role": "user", "content": "Write a short poem about the ocean."}], max_new_tokens=200)
print(result[0]["generated_text"][-1]["content"])

Training details

  • Base model: Qwen/Qwen2.5-0.5B-Instruct
  • Method: LoRA (r=16, alpha=32) via PEFT + TRL SFTTrainer
  • Epochs: 3
  • Learning rate: 2e-4
Downloads last month
-
Safetensors
Model size
0.5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for samandar1105/text-generation

Adapter
(646)
this model

Dataset used to train samandar1105/text-generation

Spaces using samandar1105/text-generation 2