google
/

gemma-7b-it

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Rename README.md to !huggingface-cli login

#91

by mandyLO - opened 17 days ago

base: refs/heads/main

←

from: refs/pr/91

Discussion Files changed

mandyLO

17 days ago

No description provided.

Rename README.md to !huggingface-cli loginb3a15684

mandyLO

17 days ago

Import necessary classes for model loading and quantization

from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig

Configure model quantization to 4-bit for memory and computation efficiency

quantization_config = BitsAndBytesConfig(load_in_4bit=True)

Load the tokenizer for the Gemma 7B Italian model

tokenizer = AutoTokenizer.from_pretrained("google/gemma-7b-it")

Load the Gemma 7B Italian model itself, with 4-bit quantization

model = AutoModelForCausalLM.from_pretrained("google/gemma-7b-it",
quantization_config=quantization_config)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment