Text Generation
Transformers
PyTorch
Safetensors
English
llama
conversational
Eval Results
Inference Endpoints
text-generation-inference

exl2 quanitzed version

#1
by Thireus - opened

I somehow missed this being released. It'll take 3+ hours for the measurement + quantization to complete.

Cognitive Computations org

because it was just released a few hours ago :D

Thank you so much!

Sign up or log in to comment