4.25bpw version

by Apel-sin - opened Jun 14, 2024

Discussion

Apel-sin

Jun 14, 2024

Big thanks for u work!
Can u make 4.25bpw version? 4.65bpw does not fit in 48Gb VRAM :)

LoneStriker

Owner Jun 15, 2024

Up:
https://huggingface.co/LoneStriker/Smaug-Llama-3-70B-Instruct-32K-4.25bpw-h6-exl2

Apel-sin

Jun 15, 2024

You're the best! Thanx!

RebornZA

Jun 20, 2024

@Apel-sin May I ask, when you say "Smaug-Llama-3-70B-Instruct-4.65bpw-h6-exl2" doesn't fit in 48 gigs of VRAM, you mean specifically the 32k version here?

Apel-sin

Jul 1, 2024

@Apel-sin May I ask, when you say "Smaug-Llama-3-70B-Instruct-4.65bpw-h6-exl2" doesn't fit in 48 gigs of VRAM, you mean specifically the 32k version here?

@RebornZA sorry for long answer, I did not see the notification :(
This is strange. I do some experiments and 4.65 version work fine with 32K context length. It didn't work last time.

Thanx for question!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment