01-ai
/

Yi-34B-Chat-8bits

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

Resources

View closed (2)

the int8 speed are very slow

#1 opened about 1 year ago by