metadata
license: apache-2.0
Introduce
Quantizing the shenzhi-wang/Llama3-8B-Chinese-Chat to f16, q2, q3, q4, q5, q6 and q8 with Llama.cpp.
license: apache-2.0
Quantizing the shenzhi-wang/Llama3-8B-Chinese-Chat to f16, q2, q3, q4, q5, q6 and q8 with Llama.cpp.