--- language: - vi license: llama2 tags: - mlx --- # ontocord/vinallama-7b-chat-mlx-4bit This model was converted to MLX format from [`vilm/vinallama-7b-chat`](). Refer to the [original model card](https://huggingface.co/vilm/vinallama-7b-chat) for more details on the model. ## Use with mlx ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tokenizer = load("ontocord/vinallama-7b-chat-mlx-4bit") response = generate(model, tokenizer, prompt="hello", verbose=True) ```