check

Files changed (3) hide show

README.md ADDED Viewed

+---
+frameworks:
+- Pytorch
+license: other
+license_name: glm-4
+license_link: LICENSE
+pipeline_tag: text-generation
+tags:
+  - glm
+  - edge
+inference: false
+---
+# GLM-Edge-4B-Chat
+中文阅读, 点击[这里](README_zh.md)
+## Inference with Transformers
+### Installation
+Install the transformers library from the source code:
+```shell
+pip install git+https://github.com/huggingface/transformers.git
+```
+### Inference
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+MODEL_PATH = "THUDM/glm-edge-4b-chat"
+tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH)
+model = AutoModelForCausalLM.from_pretrained(MODEL_PATH, device_map="auto")
+message = [{"role": "user", "content": "hello!"}]
+inputs = tokenizer.apply_chat_template(
+    message,
+    return_tensors="pt",
+    add_generation_prompt=True,
+    return_dict=True,
+).to(model.device)
+generate_kwargs = {
+    "input_ids": inputs["input_ids"],
+    "attention_mask": inputs["attention_mask"],
+    "max_new_tokens": 128,
+    "do_sample": False,
+}
+out = model.generate(**generate_kwargs)
+print(tokenizer.decode(out[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True))
+```
+## License
+The usage of this model’s weights is subject to the terms outlined in the [LICENSE](LICENSE).

README_zh.md ADDED Viewed

+# GLM-Edge-4B-Chat
+## 使用 transformers 库进行推理
+### 安装
+请安装源代码的transformers库。
+```shell
+pip install git+https://github.com/huggingface/transformers.git
+```
+### 推理
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+MODEL_PATH = "THUDM/glm-edge-4b-chat"
+tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH)
+model = AutoModelForCausalLM.from_pretrained(MODEL_PATH, device_map="auto")
+message = [{"role": "user", "content": "hello!"}]
+inputs = tokenizer.apply_chat_template(
+    message,
+    return_tensors="pt",
+    add_generation_prompt=True,
+    return_dict=True,
+).to(model.device)
+generate_kwargs = {
+    "input_ids": inputs["input_ids"],
+    "attention_mask": inputs["attention_mask"],
+    "max_new_tokens": 128,
+    "do_sample": False,
+}
+out = model.generate(**generate_kwargs)
+print(tokenizer.decode(out[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True))
+```
+## 协议
+本模型的权重的使用则需要遵循 [LICENSE](LICENSE)。

generation_config.json DELETED Viewed

@@ -1,10 +0,0 @@
-{
-  "_from_model_config": true,
-  "eos_token_id": [
-    59246,
-    59253,
-    59255
-  ],
-  "pad_token_id": 59246,
-  "transformers_version": "4.47.0.dev0"
-}