kaizen9
/

Llama_3_8B_SQ_21616_sparse

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama_3_8B_SQ_21616_sparse

1 contributor

History: 2 commits

kaizen9's picture

Quantized model upload with DuQuant

201b481 verified 2 months ago

.gitattributes

1.52 kB

initial commit 2 months ago
README.md

5.17 kB

Quantized model upload with DuQuant 2 months ago
config.json

891 Bytes

Quantized model upload with DuQuant 2 months ago
generation_config.json

121 Bytes

Quantized model upload with DuQuant 2 months ago
model-00001-of-00003.safetensors

4.98 GB
LFS

Quantized model upload with DuQuant 2 months ago
model-00002-of-00003.safetensors

5 GB
LFS

Quantized model upload with DuQuant 2 months ago
model-00003-of-00003.safetensors

3.9 GB
LFS

Quantized model upload with DuQuant 2 months ago
model.safetensors.index.json

20.2 kB

Quantized model upload with DuQuant 2 months ago