vishaljangra29's picture
Promptengineering/text-to-sql
4ce9ae1 verified
metadata
library_name: peft
tags:
  - trl
  - sft
  - generated_from_trainer
base_model: sid321axn/Mistral-7B-text-to-sql-finetuned
datasets:
  - generator
model-index:
  - name: mistral_instruct_generation
    results: []

mistral_instruct_generation

This model is a fine-tuned version of sid321axn/Mistral-7B-text-to-sql-finetuned on the generator dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0869

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0002
  • train_batch_size: 4
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: constant
  • lr_scheduler_warmup_steps: 0.03
  • training_steps: 100

Training results

Training Loss Epoch Step Validation Loss
0.3337 2.5 20 0.1577
0.0427 5.0 40 0.0759
0.0195 7.5 60 0.0768
0.011 10.0 80 0.0795
0.0065 12.5 100 0.0869

Framework versions

  • PEFT 0.10.0
  • Transformers 4.40.2
  • Pytorch 2.2.2+cu121
  • Datasets 2.19.1
  • Tokenizers 0.19.1