UQFF
Collection
UQFF models. Examples for each in the model card!
β’
16 items
β’
Updated
β’
14
meta-llama/Llama-4-Scout-17B-16E-Instruct
, UQFF quantization
Run with mistral.rs. Documentation: UQFF docs.
Note: If you are using an Apple Silicon device (on Metal), prefer using an π₯ AFQ quantization for best performance!
Quantization type(s) | Example |
---|---|
Q4K | ./mistralrs-server -i vision-plain -m EricB/Llama-4-Scout-17B-16E-Instruct-UQFF -a llama4 --from-uqff "llama4-scout-instruct-q4k-0.uqff;llama4-scout-instruct-q4k-1.uqff;llama4-scout-instruct-q4k-2.uqff;llama4-scout-instruct-q4k-3.uqff;llama4-scout-instruct-q4k-4.uqff;llama4-scout-instruct-q4k-5.uqff;llama4-scout-instruct-q4k-6.uqff" |
AFQ4 | Coming soon! |
Base model
meta-llama/Llama-4-Scout-17B-16E