Gemma-7B-it GGUF Quantized

Usage

This model can be used with the latest version of llama.cpp and LM Studio >0.2.16.

GGUF

Model size

8.54B params

Architecture

gemma

4-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.