Will this run on a 4090 and 64GB of DDR5?

by AIGUYCONTENT - opened May 19, 2024

May 19, 2024

I know there is an 8B quant available. However, I need an intelligent AI that can help me reason through things throughout a multi-step conversation on a single topic.

RDson

May 19, 2024

•

edited May 19, 2024

The full model will not, but a quantized model will and at Q4 (which is usually the preferred quant) it will certainly better than the 8B model. You will need to trade-off between speed and performance. You can find performance comparisons here.

gopi87

May 20, 2024

hi plese use try to use

https://huggingface.co/mradermacher/Smaug-Llama-3-70B-Instruct-i1-GGUF

iq4 xs

if you have 64 gp you can use i1-Q5_K_M

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment