cognitivecomputations
/

dolphin-2.9-llama3-8b-256k

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Resources

View closed (1)

Was trying to quantize to 8 bits to reduce VRAM footprint. Got the stuff below.

#3 opened about 1 month ago by

pls help

#2 opened about 1 month ago by