Training procedure

  • total_batch_size: 32
  • epoch: 3
  • lr: 1.0e-4
  • warm-up rate: 0.1
  • type: Lora

Framework versions

  • LLaMA-Factory: v0.9.0

Paper

  • link: arxiv.org/abs/2412.04905

Data

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for iiiiwis/DEMO_Agent

Base model

Qwen/Qwen2-7B
Finetuned
(85)
this model