ChatWithMe — Qwen3-4B Fine-tuned on Alpaca

A Qwen3-4B model fine-tuned with LoRA on the Alpaca dataset for instruction-following conversations.

Model Details

  • Base model: Qwen/Qwen3-4B
  • Fine-tuning method: LoRA (r=8, alpha=16)
  • Dataset: yahma/alpaca-cleaned (~52k examples)
  • Chat format: ChatML
  • Training: 1 epoch, SFTTrainer, 4-bit quantization, bf16, cosine scheduler
  • Hardware: NVIDIA A100 80GB
  • Final training loss: 1.0875
  • Final validation loss: 1.0976
  • Perplexity: ~3.00

LoRA Configuration

  • Target modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
  • Rank: 8
  • Alpha: 16
  • Dropout: 0.05
  • Trainable parameters: ~13M / 4B total (0.3%)

Usage


Downloads last month
31
Safetensors
Model size
4B params
Tensor type
F16
·
Inference Providers NEW
Input a message to start chatting with welyty/qwen3-4b-alpaca-chatwithme.

Model tree for welyty/qwen3-4b-alpaca-chatwithme

Finetuned
Qwen/Qwen3-4B
Adapter
(1034)
this model
Adapters
1 model

Dataset used to train welyty/qwen3-4b-alpaca-chatwithme

Space using welyty/qwen3-4b-alpaca-chatwithme 1