mlx-community
/

Nous-Hermes-2-Mixtral-8x7B-DPO-4bit

Model card Files Files and versions Community

thomadev0 commited on Jan 17

Commit

f1d96ff

•

1 Parent(s): 695f3cb

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -37,5 +37,6 @@ response = generate(model, tokenizer, prompt="hello", verbose=True)
 ## Use with mlx_lm cli
 ```bash
 python3 -m mlx_lm.generate --model mlx-community/Nous-Hermes-2-Mixtral-8x7B-DPO-4bit --prompt "<|im_start|>system\nYou are an accurate, educational, and helpful information assistant<|im_end|>\n<|im_start|>user\nWhat is the difference between awq vs gptq quantitization?<|im_end|>\n<|im_start|>assistant\n" --max-tokens 2048
 ```

 ## Use with mlx_lm cli
 ```bash
+pip install -U mlx-lm
 python3 -m mlx_lm.generate --model mlx-community/Nous-Hermes-2-Mixtral-8x7B-DPO-4bit --prompt "<|im_start|>system\nYou are an accurate, educational, and helpful information assistant<|im_end|>\n<|im_start|>user\nWhat is the difference between awq vs gptq quantitization?<|im_end|>\n<|im_start|>assistant\n" --max-tokens 2048
 ```