Gaivoronsky
/

Mistral-7B-Saiga

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Gaivoronsky commited on Oct 13, 2023

Commit

fcaea19

•

1 Parent(s): 75a98eb

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ pipeline_tag: text-generation
 ---
 This is a generative model converted to fp16 format based on [IlyaGusev/saiga_mistral_7b_lora](https://huggingface.co/IlyaGusev/saiga_mistral_7b_lora)
-Install vLLM
 ```bash
 pip install vllm
 ```
@@ -19,4 +19,15 @@ pip install vllm
 Start server:
 ```bash
 python -u -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --model Gaivoronsky/Mistral-7B-Saiga
 ```

 ---
 This is a generative model converted to fp16 format based on [IlyaGusev/saiga_mistral_7b_lora](https://huggingface.co/IlyaGusev/saiga_mistral_7b_lora)
+Install vLLM:
 ```bash
 pip install vllm
 ```
 Start server:
 ```bash
 python -u -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --model Gaivoronsky/Mistral-7B-Saiga
+```
+Client:
+```python
+import openai
+response = openai.ChatCompletion.create(
+        model="Gaivoronsky/Mistral-7B-Saiga",
+        messages=[{"role": "user", "content": 'Привет'}],
+        max_tokens=512,
+        )
+response['choices'][0]['message']['content']
 ```