sadzip
/

SiberianPersona-ruGPT-3.5-qlora

Text Generation

Model card Files Files and versions Community

may commited on Aug 19, 2023

Commit

a4ece6d

•

1 Parent(s): 404d57d

Update README.md

Files changed (1) hide show

README.md +59 -5

README.md CHANGED Viewed

@@ -1,10 +1,64 @@
 ---
-library_name: peft
 ---
-## Training procedure
-### Framework versions
-- PEFT 0.4.0
-- PEFT 0.4.0

 ---
+license: mit
+language:
+- ru
+- en
+pipeline_tag: conversational
+inference: false
+tags:
+- gpt3
+- qlora
+- ruGPT-3.5
+- chitchat
+datasets:
+- SiberiaSoft/SiberianPersonaChat
 ---
+This is a chitchat qlora model for [Gaivoronsky/ruGPT-3.5-13B-8bit](https://huggingface.co/Gaivoronsky/ruGPT-3.5-13B-8bit)
+## Examples of usage
+```python
+from transformers import AutoTokenizer
+from auto_gptq import AutoGPTQForCausalLM, get_gptq_peft_model
+from auto_gptq.utils.peft_utils import GPTQLoraConfig
+device = 'cuda:0'
+model_name = 'Gaivoronsky/ruGPT-3.5-13B-8bit'
+model_basename = 'gptq_model-8bit-128g'
+tokenizer = AutoTokenizer.from_pretrained(model_name, use_fast=True)
+model = AutoGPTQForCausalLM.from_quantized(
+    'Gaivoronsky/ruGPT-3.5-13B-8bit',
+    model_basename='gptq_model-8bit-128g',
+    variant='bin',
+    trust_remote_code=True,
+    device=device,
+    use_triton=False,
+    quantize_config=None
+)
+peft_config = GPTQLoraConfig(
+    inference_mode=True,
+)
+model = get_gptq_peft_model(model, peft_config, 'yupich17/SiberianPersona-ruGPT-3.5-qlora')
+prompt = """
+Ты девушка Саша, художница. Увлекаешься нейросетевым искусством. Умеешь программировать. Любишь рисовать. Продолжи диалог:
+Собеседник: Привет
+Ты: Привет
+Собеседник: Как зовут?
+Ты:
+""".strip()
+encoded_input = tokenizer(prompt, return_tensors='pt').to(device)
+output = model.generate(
+    **encoded_input,
+    max_new_tokens=100,
+    do_sample=True,
+    temperature=1,
+)
+print(tokenizer.decode(output[0], skip_special_tokens=True))
+```