notes

experimental NVFP4A16 (confirmed forking with vLLM)

Please report if you find any issue with the model.

Any feedbacks are welcome!

'Make knowledge free for everyone'

Quantized version of: meta-llama/Meta-Llama-3-8B-Instruct Buy Me a Coffee at ko-fi.com

Downloads last month
1
Safetensors
Model size
5B params
Tensor type
BF16
F32
F8_E4M3
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for DevQuasar/meta-llama.Meta-Llama-3-8B-Instruct-NVFP4A16

Quantized
(639)
this model