How many GPU memories that the MoE module needs?

by Jazzlee - opened Jan 16

Discussion

Jazzlee

Jan 16

Is Int4 or Int8 possible? how to do it?

cloudyu

Owner Jan 16

this model is about 61B, I guess it need 64G for int8 and 32G for int4

cloudyu

Owner Jan 16


tokenizer = AutoTokenizer.from_pretrained(model_path, use_default_system_prompt=False)
model = AutoModelForCausalLM.from_pretrained(
    model_path, torch_dtype=torch.bfloat16, device_map='auto',local_files_only=False, load_in_4bit=True
)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment