What is the required amount of VRAM for running it?

by boohwooh - opened Jul 22, 2023

Jul 22, 2023

I will run this on runpod.io for test. The base model (llama-2 70b) is 120Gb but this is more than 300Gb. What is the required amount of VRAM for running it?

ycros

Jul 22, 2023

I'm running it quantized to 4bits (with a typical load_in_4bit bitsandbytes) and it's using 46.511GB.

boohwooh

Jul 23, 2023

I'm running it quantized to 4bits (with a typical load_in_4bit bitsandbytes) and it's using 46.511GB.

Thank you. I was curious about the base model vram requirements.

boohwooh changed discussion status to closed Jul 23, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment