This model is the DeciLM-6b-Instruct model, trained specifically for medicine

Galen uses the

### User: {prompt}

### Response:

or

{prompt} 

Prompt templates

Galen Training Recipe:

  • target_modules = ["q_proj", "v_proj", "gate_proj", "down_proj", "up_proj", "k_proj", "o_proj"]
  • Learning Rate: 4e-4
  • LR Scheduler: constant
  • 250 StepsLoss

T3: 1 Hour

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train NewstaR/StableGalen-6b