--- license: gemma library_name: transformers tags: - mlx widget: - messages: - role: user content: How does the brain work? inference: parameters: max_new_tokens: 200 extra_gated_heading: Access Gemma on Hugging Face extra_gated_prompt: To access Gemma on Hugging Face, you’re required to review and agree to Google’s usage license. To do this, please ensure you’re logged-in to Hugging Face and click below. Requests are processed immediately. extra_gated_button_content: Acknowledge license --- # batmac/gemma-1.1-7b-it-mlx-4bit This model was converted to MLX format from [`google/gemma-1.1-7b-it`]() using mlx-lm version **0.12.1**. Refer to the [original model card](https://huggingface.co/google/gemma-1.1-7b-it) for more details on the model. ## Use with mlx ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tokenizer = load("batmac/gemma-1.1-7b-it-mlx-4bit") response = generate(model, tokenizer, prompt="hello", verbose=True) ```