-
-
-
-
-
-
Inference Providers
Active filters:
fp8
neuralmagic/pixtral-12b-FP8-dynamic
Text Generation
•
Updated
•
8.24k
•
7
predibase/Qwen2.5-14B-FP8
Updated
•
12
CalamitousFelicitousness/banana-2-b-72b-FP8-Dynamic
taozi555/Llama-Guard-3-8B-FP8
ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
Infermatic/Lumimaid-v0.2-70B-FP8-Dynamic
predibase/Qwen2.5-32B-Instruct-FP8
Updated
•
99
Infermatic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-Dynamic
Text Generation
•
Updated
•
61
predibase/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
Updated
•
916
•
14
Infermatic/magnum-v4-72b-FP8-Dynamic
Text Generation
•
Updated
•
220
•
1
amd/dbrx-base-FP8-KV
Infermatic/Stellar-Odyssey-12b-v0.0-FP8-Dynamic
Infermatic/Chronos-Platinum-72B-FP8-Dynamic
Updated
•
11
Infermatic/Nautilus-70B-v0.1-FP8-Dynamic
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
209k
mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC
Text Generation
•
Updated
•
25
•
3
amd/Mistral-7B-v0.1-FP8-KV
Updated
•
1.31k
yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
8
tencent-community/Hunyuan-A52B-Instruct-FP8
Text Generation
•
Updated
•
46
•
1
Dev0502/Qwen2.5-14B-Instruct-abliterated-v2-FP8
Updated
•
135
andecy64/Nxcode-CQ-7B-orpo-FP8
SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
Updated
•
17
yejingfu/nmagic-Meta-Llama-3-70B-Instruct-FP8
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
liuxl12/Qwen2.5-32B-Instruct-FP8
Model-SafeTensors/Meta-Llama-3-8B-Instruct-FP8
Updated
•
190
taozi555/hiwaifu-12b-v1.1-fp8
Updated
•
10