zarakiquemparte's picture
Update README.md
f6736e3
metadata
license: other
library_name: peft
tags:
  - llama2

Model Card for PIPPA ShareGPT Subset QLora 13b

It is an experimental Lora focused on Roleplay that uses a subset of PIPPA ShareGPT

Usage

Custom

SYSTEM: Do thing
USER: {prompt}
CHARACTER:

Bias, Risks, and Limitations

This Lora is not intended for supplying factual information or advice in any form

Training Details

Training Data

1k of conversation from PIPPA ShareGPT

Training Procedure

The version of this Lora uploaded on this repository was trained using a 8x RTX A6000 cluster in 8-bit with regular LoRA adapters and 32-bit AdamW optimizer.

Training Hyperparameters

Training using a fork of Axolotl with two paths Patch 1 Patch 2

  • load_in_4bit: true
  • lora_r: 16
  • lora_alpha: 16
  • lora_dropout: 0.01
  • gradient_accumulation_steps: 8
  • micro_batch_size: 4
  • num_epochs: 3
  • learning_rate: 0.000065

Environmental Impact

Finetuning this model on 8xNVIDIA A6000 48GB in parallel takes about 30 minutes (13B)