clonefy commited on
Commit
b6eb0ef
1 Parent(s): e6f98dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -2,7 +2,7 @@
2
  license: other
3
  license_name: tongyi-qianwen-research
4
  license_link: >-
5
- https://huggingface.co/Qwen/Qwen-1.5-0_5B-Chat/blob/main/LICENSE
6
  language:
7
  - en
8
  pipeline_tag: text-generation
@@ -10,7 +10,7 @@ tags:
10
  - chat
11
  ---
12
 
13
- # Qwen-1.5-0.5B-Chat
14
 
15
 
16
  ## Introduction
@@ -47,10 +47,10 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
47
  device = "cuda" # the device to load the model onto
48
 
49
  model = AutoModelForCausalLM.from_pretrained(
50
- "Qwen/Qwen-1.5-0_5B-Chat",
51
  device_map="auto"
52
  )
53
- tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen-1.5-0_5B-Chat")
54
 
55
  prompt = "Give me a short introduction to large language model."
56
  messages = [
@@ -75,7 +75,7 @@ generated_ids = [
75
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
76
  ```
77
 
78
- For quantized models, we advise you to use the GPTQ, AWQ, and GGUF correspondents, namely `Qwen-beta-0_5B-Chat-GPTQ`, `Qwen-beta-0_5B-Chat-AWQ`, and `Qwen-beta-0_5B-Chat-GGUF`.
79
 
80
 
81
  ## Limitations
 
2
  license: other
3
  license_name: tongyi-qianwen-research
4
  license_link: >-
5
+ https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat/blob/main/LICENSE
6
  language:
7
  - en
8
  pipeline_tag: text-generation
 
10
  - chat
11
  ---
12
 
13
+ # Qwen1.5-0.5B-Chat
14
 
15
 
16
  ## Introduction
 
47
  device = "cuda" # the device to load the model onto
48
 
49
  model = AutoModelForCausalLM.from_pretrained(
50
+ "Qwen/Qwen1.5-0.5B-Chat",
51
  device_map="auto"
52
  )
53
+ tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen1.5-0.5B-Chat")
54
 
55
  prompt = "Give me a short introduction to large language model."
56
  messages = [
 
75
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
76
  ```
77
 
78
+ For quantized models, we advise you to use the GPTQ, AWQ, and GGUF correspondents, namely `Qwen1.5-0.5B-Chat-GPTQ`, `Qwen1.5-0.5B-Chat-AWQ`, and `Qwen1.5-0.5B-Chat-GGUF`.
79
 
80
 
81
  ## Limitations