Necessary hardware for Operating the 34B Model

#40

by blurjp - opened Nov 29, 2023

Nov 29, 2023

I currently use a 4090, but the inference process is extremely slow. Is it impractical to expect this model to run efficiently on just a single 4090?

ShampX

Dec 10, 2023

Did you solve that? I have a same problem.

Jan 3, 2024

You can use these 2 bit versions made with quip#. Inference is slower than usual but it should work on a single 4090.

YShow

Apr 30, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment