prince-canuma commited on
Commit
05f7afe
1 Parent(s): 39bec73

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: other
5
+ tags:
6
+ - pretrained
7
+ - moe
8
+ - mlx
9
+ license_name: tongyi-qianwen
10
+ license_link: https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B/blob/main/LICENSE
11
+ pipeline_tag: text-generation
12
+ ---
13
+
14
+ # mlx-community/Qwen1.5-MoE-A2.7B-Chat-4bit
15
+ This model was converted to MLX format from [`Qwen/Qwen1.5-MoE-A2.7B`](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B-Chat) using mlx-lm version version [d661440](https://github.com/ml-explore/mlx-examples/commit/d661440dbb8e1970fadad79c5061e786fe1c54ca).
16
+
17
+ Model added by [Prince Canuma](https://twitter.com/Prince_Canuma).
18
+
19
+ Refer to the [original model card](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) for more details on the model.
20
+ ## Use with mlx
21
+
22
+ ```bash
23
+ pip install mlx-lm
24
+ ```
25
+
26
+ ```python
27
+ from mlx_lm import load, generate
28
+
29
+ model, tokenizer = load("mlx-community/Qwen1.5-MoE-A2.7B-4bit")
30
+ response = generate(model, tokenizer, prompt="Write a story about Einstein", verbose=True)
31
+ ```