Text Generation
Transformers
Safetensors
dbrx
conversational
text-generation-inference

Can you provide a quantitative solution? For example, it can be used quantitatively through llama.cpp.

#22
by edisonzf2020 - opened

Can you provide a quantitative solution? For example, it can be used quantitatively through llama.cpp.

Databricks org
This comment has been hidden
Databricks org

Hi @edisonzf2020 , thanks for your question!

We are working with the community an enabling more quantized versions of models. A few examples to follow:

I'll close this comment for now, but please re-open if the above approaches we are pursuing doesn't answer your question.

hanlintang changed discussion status to closed

Sign up or log in to comment