Edit model card

Training procedure

The following bitsandbytes quantization config was used during training:

  • quant_method: gptq
  • bits: 4
  • tokenizer: None
  • dataset: ['Mihir1108/llm4']
  • group_size: 128
  • damp_percent: 0.1
  • desc_act: False
  • sym: True
  • true_sequential: True
  • use_cuda_fp16: False
  • model_seqlen: 4096
  • block_name_to_quantize: model.layers
  • module_name_preceding_first_block: ['model.embed_tokens']
  • batch_size: 1
  • pad_token_id: None
  • disable_exllama: True
  • max_input_length: None

Framework versions

  • PEFT 0.5.0
Downloads last month
1
Inference API
Unable to determine this model’s pipeline type. Check the docs .