Qwen3-1.7B Backward LoRA

LoRA adapter trained for response-to-instruction backtranslation in the self-alignment reproduction pipeline.

Base Model

  • Qwen/Qwen3-1.7B

Local Artifact Source

  • folder: adapter-261924
  • repo id: ITBill/qwen3-1.7b-backward-lora

Training Summary

  • global step: 834
  • train loss: 1.7093052646810774
  • eval loss: 1.5609089136123657

Files

  • adapter_model.safetensors: LoRA weights
  • adapter_config.json: PEFT adapter config
  • tokenizer.json and tokenizer_config.json: tokenizer assets used for the run
  • training_summary.json: exported local training summary
Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ITBill/qwen3-1.7b-backward-lora

Finetuned
Qwen/Qwen3-1.7B
Adapter
(505)
this model