llama2-test / env_examples /.env.7b_ggmlv3_q4_0_example
umamicode's picture
Upload folder using huggingface_hub
9bddec3
raw
history blame
741 Bytes
MODEL_PATH = "/path-to/llama-2-7b-chat.ggmlv3.q4_0.bin"
LOAD_IN_8BIT = False
LOAD_IN_4BIT = True
LLAMA_CPP = True
MAX_MAX_NEW_TOKENS = 2048
DEFAULT_MAX_NEW_TOKENS = 1024
MAX_INPUT_TOKEN_LENGTH = 4000
DEFAULT_SYSTEM_PROMPT = "\
You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature.
If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information.\
"