Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

8-bit precision

4-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

217

Full-text search

Active filters: vllm

mradermacher/Nemotron-4-340B-Instruct-hf-i1-GGUF

Updated 3 days ago • 1

mradermacher/Nemotron-4-340B-Base-hf-GGUF

neuralmagic/SmolLM-360M-Instruct-quantized.w8a8

Text Generation • Updated Oct 9 • 65

neuralmagic/SmolLM-135M-Instruct-quantized.w8a8

Text Generation • Updated Oct 9 • 356

mradermacher/Nemotron-4-340B-Base-hf-i1-GGUF

Updated Aug 30 • 1

cmshin96/MN-12B-Lyra-v3-awq

Text Generation • Updated Sep 7 • 11

cmshin96/MN-12B-Lyra-v1-awq

Text Generation • Updated Sep 7 • 11

cmshin96/MN-12B-Lyra-v4-awq

Text Generation • Updated Sep 14 • 8

LlamaFinetuneBase/Pixtral-12B-2409

nm-testing/Meta-Llama-3.1-8B-Instruct-FP8-hf

Text Generation • Updated Sep 24 • 176

neuralmagic/Llama-3.2-1B-Instruct-FP8-dynamic

Text Generation • Updated Oct 9 • 698 • 2

soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8

Updated Sep 27 • 131

TouchNight/Ministral-8B-Instruct-2410-HF

Updated Oct 18 • 66

TouchNight/Ministral-8B-Instruct-2410-HF-Q5_K_M-GGUF

Updated Oct 18 • 3

ijohn07/Ministral-8B-Instruct-2410-HF-Q8_0-GGUF

Updated Oct 19 • 5

adriabama06/reader-lm-1.5b-AWQ

Text Generation • Updated Nov 1 • 19 • 1

sasha0552/Ministral-8B-Instruct-2410

QuantFactory/TouchNight-Ministral-8B-Instruct-2410-HF-GGUF

Updated Oct 20 • 127 • 2

GrimsenClory/Ministral-8B-Instruct-2410-Q6_K-GGUF

Updated Oct 21 • 13

gphorvath/Ministral-8B-Instruct-2410-Q4_K_M-GGUF

Updated Oct 26 • 14

Gleisson1/Ministral-8B-Instruct-2410-HF-4bit

Updated Oct 26 • 23

paultimothymooney/Ministral-8B-Instruct-2410-Q8_0-GGUF

Updated Oct 28 • 6

paultimothymooney/Ministral-8B-Instruct-2410-Q4_K_M-GGUF

Updated Oct 28 • 6

LouiSeHU/Mistral-Small-Instruct-2409-Q4_0-GGUF

Updated Oct 29 • 26

yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Oct 31 • 5

Ritvik19/Ministral-8B-Instruct-2410-Q4_K_M-GGUF

Updated Nov 2 • 5

Gustav0-Freind/missmall

Updated Nov 5 • 4

yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8

Text Generation • Updated Nov 5 • 11

SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8

Updated Nov 8 • 10

neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic

Text Generation • Updated 2 days ago • 120