LavaPlanet
/

Venus-120b-v1.1-exl2_2-2.64bpw

This should fit on 2x3090s on windows with a 18-19,24 gpu split with 6k-8k context. Uses the new exlv2 quant

Inference Examples

Inference API (serverless) is not available, repository is disabled.