GPU Memory Constraints for 01-ai/Yi-9B-200K Model

by microcn - opened Mar 27, 2024

Mar 27, 2024

What are the GPU memory requirements for loading the 01-ai/Yi-9B-200K model? I am currently facing an issue where loading the model with two RTX 4090 GPUs fails when using the following code:

model = AutoModelForCausalLM.from_pretrained(MODEL_DIR, torch_dtype="auto")
tokenizer = AutoTokenizer.from_pretrained(MODEL_DIR, use_fast=False)

adamo1139

Mar 31, 2024

if you lower max_position_embeddings in config.json to a lower value, it should load. Required VRAM will differ based on whether you have flash attention or not.

YShow

Apr 30, 2024

Add device_map=device_map when loading the model

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment