|
--- |
|
license: other |
|
datasets: |
|
- ehartford/wizard_vicuna_70k_unfiltered |
|
language: |
|
- en |
|
tags: |
|
- text-generation-inference |
|
pipeline_tag: text-generation |
|
--- |
|
Buy me a coffee if you like this project ;) |
|
<a href="https://www.buymeacoffee.com/s3nh"><img src="https://www.buymeacoffee.com/assets/img/guidelines/download-assets-sm-1.svg" alt=""></a> |
|
|
|
#### Description |
|
|
|
GGML Format model files for [This project](https://huggingface.co/georgesung/llama2_7b_chat_uncensored). |
|
|
|
|
|
### inference |
|
|
|
|
|
```python |
|
|
|
import ctransformers |
|
|
|
from ctransformers import AutoModelForCausalLM |
|
|
|
model = AutoModelForCausalLM.from_pretrained(output_dir, ggml_file, |
|
gpu_layers=32, model_type="llama") |
|
|
|
manual_input: str = "Tell me about your last dream, please." |
|
|
|
|
|
llm(manual_input, |
|
max_new_tokens=256, |
|
temperature=0.9, |
|
top_p= 0.7) |
|
|
|
``` |
|
|
|
|
|
|
|
|
|
#### Original Model card |
|
|
|
# Overview |
|
Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered). |
|
Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train. |
|
|
|
# Prompt style |
|
The model was trained with the following prompt style: |
|
``` |
|
### HUMAN: |
|
Hello |
|
|
|
### RESPONSE: |
|
Hi, how are you? |
|
|
|
### HUMAN: |
|
I'm fine. |
|
|
|
### RESPONSE: |
|
How can I help you? |
|
... |
|
``` |
|
|
|
# Training code |
|
Code used to train the model is available [here](https://github.com/georgesung/llm_qlora). |
|
|
|
To reproduce the results: |
|
``` |
|
git clone https://github.com/georgesung/llm_qlora |
|
cd llm_qlora |
|
pip install -r requirements.txt |
|
python train.py configs/llama2_7b_chat_uncensored.yaml |
|
``` |