Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

8-bit precision

text-generation-inference

Misc with no match

4-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

66

Full-text search

Active filters: llmcompressor

RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • Updated 8 days ago • 623 • 5

RedHatAI/Qwen3-32B-FP8-dynamic

Text Generation • Updated 4 days ago • 17 • 3

ISTA-DASLab/gemma-3-27b-it-GPTQ-4b-128g

Image-Text-to-Text • Updated Mar 20 • 24k • 27

RedHatAI/Qwen2.5-1.5B-quantized.w8a8

Text Generation • Updated Dec 3, 2024 • 75 • 1

RedHatAI/Qwen2.5-7B-quantized.w8a8

Text Generation • Updated Dec 3, 2024 • 3.18k • 1

RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8

Text Generation • Updated Jan 22 • 16.1k • 6

RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8

Text Generation • Updated Feb 27 • 597 • 2

RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8

Text Generation • Updated Feb 27 • 573 • 10

RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8

Text Generation • Updated Feb 27 • 5.72k • 4

RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w4a16

Text Generation • Updated Feb 27 • 1.23k • 4

RedHatAI/phi-4-quantized.w8a8

Text Generation • Updated 20 days ago • 81 • 1

RedHatAI/phi-4-quantized.w4a16

Text Generation • Updated 13 days ago • 896 • 1

ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g

Image-Text-to-Text • Updated about 1 month ago • 18.1k • 13

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic

Image-Text-to-Text • Updated 4 days ago • 2.84k • 6

RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic

Image-Text-to-Text • Updated 11 days ago • 7.33k • 8

ISTA-DASLab/gemma-3-4b-it-GPTQ-4b-128g

Image-Text-to-Text • Updated 24 days ago • 686 • 2

ISTA-DASLab/gemma-3-12b-it-GPTQ-4b-128g

Image-Text-to-Text • Updated 24 days ago • 1.53k • 2

adriabama06/DeepCoder-1.5B-Preview-FP8-W8A8

Text Generation • Updated 23 days ago • 21 • 1

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8

Image-Text-to-Text • Updated 4 days ago • 1.34k • 3

RedHatAI/DeepSeek-R1-quantized.w4a16

Text Generation • Updated 14 days ago • 239 • 4

RedHatAI/Qwen3-30B-A3B-FP8-dynamic

Text Generation • Updated 1 day ago • 3 • 1

RedHatAI/Qwen3-235B-A22B-FP8-dynamic

Text Generation • Updated about 7 hours ago • 1

RedHatAI/Llama-3.2-1B-Instruct-quantized.w8a8

Text Generation • Updated Oct 16, 2024 • 13.9k • 7

RedHatAI/Llama-3.2-3B-Instruct-quantized.w8a8

Text Generation • Updated Oct 16, 2024 • 9.06k • 1

RedHatAI/Llama-3.2-1B-Instruct-FP8

Text Generation • Updated Oct 16, 2024 • 1.11k • 2

RedHatAI/Llama-3.2-3B-Instruct-FP8

Text Generation • Updated Oct 16, 2024 • 45.9k • 6

RedHatAI/Qwen2.5-0.5B-quantized.w8a8

Text Generation • Updated Dec 3, 2024 • 86

RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a8

Text Generation • Updated Dec 9, 2024 • 80

RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8

Text Generation • Updated 20 days ago • 198

RedHatAI/Qwen2.5-72B-quantized.w8a8

Text Generation • Updated Dec 3, 2024 • 115