bartowski
/

Meta-Llama-3-8B-Instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (3)

Bug in tokenize()/detokenize()/tokenize() cycle

#9 opened 6 months ago by

Llama 3 8B Instruct - Q8 vs FP16 vs FP32

#8 opened 7 months ago by

Can you make CodeQwen1.5-7B-Chat IQ4_XS version?

#6 opened 8 months ago by

Anyone experiences quality degrade for math question?

#4 opened 9 months ago by

You think you could re-quant with the regex fix?

#3 opened 9 months ago by

Hi - are you going add new llama 70b version as well?

#1 opened 9 months ago by