Xorbits
/

qwen-chat-14B-ggml

ChengjieLi commited on Nov 23, 2023

Commit

11efca5

•

1 Parent(s): 90132bf

Upload folder using huggingface_hub

Files changed (4) hide show

README.md ADDED Viewed

+---
+license: apache-2.0
+---
+## qwen-chat-14B-ggml
+This repo contains GGML format model files for qwen-chat-14B.
+### Example code
+#### Install packages
+```bash
+pip install xinference[ggml]>=0.4.3
+pip install qwen-cpp
+```
+If you want to run with GPU acceleration, refer to [installation](https://github.com/xorbitsai/inference#installation).
+####  Start a local instance of Xinference
+```bash
+xinference -p 9997
+```
+#### Launch and inference
+```python
+from xinference.client import Client
+client = Client("http://localhost:9997")
+model_uid = client.launch_model(
+    model_name="qwen-chat",
+    model_format="ggmlv3",
+    model_size_in_billions=14,
+    quantization="q4_0",
+    )
+model = client.get_model(model_uid)
+chat_history = []
+prompt = "最大的动物是什么？"
+model.chat(
+    prompt,
+    chat_history,
+    generate_config={"max_tokens": 1024}
+)
+```
+### More information
+[Xinference](https://github.com/xorbitsai/inference) Replace OpenAI GPT with another LLM in your app
+by changing a single line of code. Xinference gives you the freedom to use any LLM you need.
+With Xinference, you are empowered to run inference with any open-source language models,
+speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
+<i><a href="https://join.slack.com/t/xorbitsio/shared_invite/zt-1z3zsm9ep-87yI9YZ_B79HLB2ccTq4WA">👉 Join our Slack community!</a></i>

configuration.json ADDED Viewed

+{
+    "framework": "xinference",
+    "task": "code",
+    "model": {
+        "type": "qwen-chat"
+    },
+    "allow_remote": true,
+    "pipeline": {
+        "type": "text-generation-chat-pipeline"
+    }
+}

qwen.tiktoken ADDED Viewed

The diff for this file is too large to render. See raw diff

qwen14b-ggml-q4_0.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4513662ff761c0edda9730105448ec6cbca3d86a312dc3b657f78d9151edb262
+size 7972657952