Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

8-bit precision

Mixture of Experts

Misc with no match

4-bit precision

text-embeddings-inference

Carbon Emissions

Models

404

Full-text search

Active filters: fp8

neuralmagic/pixtral-12b-FP8-dynamic

Text Generation • Updated 9 days ago • 8.24k • 7

predibase/Qwen2.5-14B-FP8

Updated Oct 10, 2024 • 12

CalamitousFelicitousness/banana-2-b-72b-FP8-Dynamic

Updated Oct 11, 2024 • 2

taozi555/Llama-Guard-3-8B-FP8

Updated Oct 12, 2024 • 2

ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV

Updated Oct 15, 2024 • 6

Infermatic/Lumimaid-v0.2-70B-FP8-Dynamic

Updated Oct 15, 2024 • 6

predibase/Qwen2.5-32B-Instruct-FP8

Updated Oct 16, 2024 • 99

Infermatic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-Dynamic

Text Generation • Updated Oct 16, 2024 • 61

predibase/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV

Updated Oct 16, 2024 • 8

neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic

Text Generation • Updated Oct 17, 2024 • 916 • 14

Infermatic/magnum-v4-72b-FP8-Dynamic

Text Generation • Updated Oct 21, 2024 • 220 • 1

amd/dbrx-base-FP8-KV

Updated Dec 19, 2024 • 9

Infermatic/Stellar-Odyssey-12b-v0.0-FP8-Dynamic

Updated Oct 24, 2024 • 3

Infermatic/Chronos-Platinum-72B-FP8-Dynamic

Updated Oct 27, 2024 • 11

Infermatic/Nautilus-70B-v0.1-FP8-Dynamic

Updated Oct 28, 2024 • 7

yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Oct 31, 2024 • 209k

mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC

Text Generation • Updated Nov 6, 2024 • 25 • 3

amd/Mistral-7B-v0.1-FP8-KV

Updated Nov 1, 2024 • 1.31k

yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8

Text Generation • Updated Nov 5, 2024 • 8

tencent-community/Hunyuan-A52B-Instruct-FP8

Text Generation • Updated Nov 5, 2024 • 46 • 1

Dev0502/Qwen2.5-14B-Instruct-abliterated-v2-FP8

Updated Nov 7, 2024 • 135

andecy64/Nxcode-CQ-7B-orpo-FP8

Updated Dec 21, 2024 • 5

SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8

Updated Nov 8, 2024 • 7

EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark

Updated Nov 15, 2024 • 17

yejingfu/nmagic-Meta-Llama-3-70B-Instruct-FP8

Updated Nov 15, 2024 • 4

EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark

Updated Nov 15, 2024 • 7

EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark

Updated Nov 16, 2024 • 9

liuxl12/Qwen2.5-32B-Instruct-FP8

Updated Nov 18, 2024 • 6

Model-SafeTensors/Meta-Llama-3-8B-Instruct-FP8

Updated Jul 18, 2024 • 190

taozi555/hiwaifu-12b-v1.1-fp8

Updated Nov 20, 2024 • 10