Gemma-7B-it GGUF Quantized

Usage

This model can be used with the latest version of llama.cpp and LM Studio >0.2.16.

Downloads last month
5
GGUF
Model size
8.54B params
Architecture
gemma

4-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.