text-generation

Fine-tuned version of Qwen/Qwen2.5-0.5B-Instruct on the Databricks Dolly-15k instruction dataset using LoRA (PEFT) + TRL's SFTTrainer.

How to use

from transformers import pipeline
gen = pipeline("text-generation", model="samandar1105/text-generation")
result = gen([{"role": "user", "content": "Write a short poem about the ocean."}], max_new_tokens=200)
print(result[0]["generated_text"][-1]["content"])

Training details

Base model: Qwen/Qwen2.5-0.5B-Instruct
Method: LoRA (r=16, alpha=32) via PEFT + TRL SFTTrainer
Epochs: 3
Learning rate: 2e-4

Downloads last month: -

Safetensors

Model size

0.5B params

Tensor type

BF16

Model tree for samandar1105/text-generation

Base model

Qwen/Qwen2.5-0.5B

Finetuned

Qwen/Qwen2.5-0.5B-Instruct

Adapter

(646)

this model

samandar1105
/

text-generation

text-generation

How to use

Training details

Model tree for samandar1105/text-generation

Dataset used to train samandar1105/text-generation

Spaces using samandar1105/text-generation 2