Qwen3-1.7B Self-Aligned LoRA

LoRA adapter trained for final instruction-following SFT in the self-alignment reproduction pipeline.

Base Model

  • Qwen/Qwen3-1.7B

Local Artifact Source

  • folder: adapter-261949
  • repo id: ITBill/qwen3-1.7b-self-aligned

Training Summary

  • global step: 3
  • train loss: 2.41523806254069
  • eval loss: 2.3819782733917236

Files

  • adapter_model.safetensors: LoRA weights
  • adapter_config.json: PEFT adapter config
  • tokenizer.json and tokenizer_config.json: tokenizer assets used for the run
  • training_summary.json: exported local training summary
Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ITBill/qwen3-1.7b-self-aligned

Finetuned
Qwen/Qwen3-1.7B
Adapter
(507)
this model