Hugging Quants

Activity Feed

AI & ML interests

Optimised quants for high-throughput deployments! Compatible with Transformers, TGI & vLLM 🤗

Recent Activity

medmekk updated a model 16 days ago

hugging-quants/Llama-4-Scout-17B-16E-Instruct-fbgemm

medmekk published a model 16 days ago

hugging-quants/Llama-4-Scout-17B-16E-Instruct-fbgemm

medmekk updated a model 16 days ago

hugging-quants/Llama-4-Scout-17B-16E-Instruct-fbgemm-unfused

View all activity

Organization Card

Community About org cards

Welcome to the home of exciting quantized models! We'd love to see increased adoption of powerful state-of-the-art open models, and quantization is a key component to make them work on more types of hardware.

Resources:

Llama 3.1 Quantized Models: Optimised Quants of Llama 3.1 for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗.
Hugging Face Llama Recipes: A set of minimal recipes to get started with Llama 3.1.

Collections 3

models 21

datasets

None public yet

AI & ML interests

Recent Activity

Team members 10

Collections 3

models 21 Sort: Recently updated

datasets

models 21