Moses25
/

Mistral-7B-Instruct-V0.4

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Moses25 commited on Apr 12

Commit

820797a

•

1 Parent(s): f6f2d08

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -53,3 +53,15 @@ def predict(content_prompt):
 predict(text)
 output：你好！作为一个大型语言模型，我一直在学习和提高自己的能力。最近，我一直在努力学习新知识、改进算法，以便更好地回答用户的问题并提供帮助。同时，我也会定期接受人工智能专家的指导和评估，以确保我的表现不断提升。希望这些信息对你有所帮助！
 ```

 predict(text)
 output：你好！作为一个大型语言模型，我一直在学习和提高自己的能力。最近，我一直在努力学习新知识、改进算法，以便更好地回答用户的问题并提供帮助。同时，我也会定期接受人工智能专家的指导和评估，以确保我的表现不断提升。希望这些信息对你有所帮助！
 ```
+## vllm server
+```
+llama2-chat-template.jinja file is chat-template above
+model_path=Mistral-7B-Instruct-V0.4
+python  -m vllm.entrypoints.openai.api_server --model=$model_path \
+        --trust-remote-code --host 0.0.0.0  --port 7777 \
+        --gpu-memory-utilization 0.8 \
+        --max-model-len 8192 --chat-template llama2-chat-template.jinja \
+        --tensor-parallel-size 1 --served-model-name chatbot
+```