Qwen3-1.7B-code-explainer

Model Description

Fine-tuned from Qwen/Qwen3-1.7B using QLoRA (4-bit) with supervised fine-tuning.

Training Details

  • Dataset: Dannys0n/tts-test-dataset
  • LoRA rank: 16, alpha: 32
  • Epochs: 3, Learning rate: 0.0002

Intended Use

This model is a test model used for the CS-394/594 class at DigiPen.

The model is designed as a text generation model that makes live Esports-style casting commentary via json-in & json-out

Limitations

Specifically deesigned for CS2 via GSI webhook on the map Dust2 This model is a single-turn model and has not been trained on support long, multi-turn conversations.

Downloads last month
1
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Dannys0n/Qwen3-1.7B-code-explainer

Finetuned
Qwen/Qwen3-1.7B
Adapter
(536)
this model