Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
ONNX
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Eval Results
4-bit precision
Merge
custom_code
text-embeddings-inference
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
69
Full-text search
Edit filters
Sort: Trending
Active filters:
ONNX
Clear all
Intel/whisper-small-onnx-int4-inc
Automatic Speech Recognition
•
Updated
Oct 16, 2023
•
5
•
2
Intel/whisper-medium-onnx-int4-inc
Automatic Speech Recognition
•
Updated
Oct 16, 2023
•
9
•
1
Intel/whisper-large-v2-onnx-int4-inc
Automatic Speech Recognition
•
Updated
Oct 16, 2023
•
10
•
25
vgorce/distilbert-base-multi-cased-ner
Token Classification
•
Updated
Dec 8, 2023
•
2.11k
amd/HRNet
Image Segmentation
•
Updated
Jan 9
•
2
amd/retinaface
Updated
Mar 29
•
5
yilunzhang/all-mpnet-base-v2-onnx
Sentence Similarity
•
Updated
Jan 4
•
154
philipchung/bge-m3-onnx
Feature Extraction
•
Updated
Apr 4
•
20
•
1
renwoshin/Phi-3-mini-128k-instruct-onnx-tf
Text Generation
•
Updated
Apr 26
•
13
•
1
Xenova/Phi-3-mini-4k-instruct
Text Generation
•
Updated
May 7
•
283
•
16
fine-tuned/jina-embeddings-v2-base-en-03052024-im2p-webapp
Feature Extraction
•
Updated
May 3
•
4
Xenova/Phi-3-mini-4k-instruct_fp16
Text Generation
•
Updated
May 7
•
356
•
1
FusionQuill/Phi-3-mini-128k-instruct-onnx
Text Generation
•
Updated
May 17
•
8
microsoft/Phi-3-small-8k-instruct-onnx-cuda
Text Generation
•
Updated
May 22
•
1.81k
•
10
microsoft/Phi-3-medium-4k-instruct-onnx-cpu
Text Generation
•
Updated
May 23
•
158
•
3
microsoft/Phi-3-medium-128k-instruct-onnx-cpu
Text Generation
•
Updated
May 23
•
159
•
8
microsoft/Phi-3-medium-128k-instruct-onnx-cuda
Text Generation
•
Updated
May 23
•
177
•
22
microsoft/Phi-3-medium-128k-instruct-onnx-directml
Text Generation
•
Updated
May 22
•
41
•
5
luweigen/Llama-3-8B-Instruct-int4-onnx-directml
Text Generation
•
Updated
Jun 15
EmbeddedLLM/llama-2-7b-chat-int4-onnx-directml
Text Generation
•
Updated
Jun 19
•
200
EmbeddedLLM/llama-2-13b-chat-int4-onnx-directml
Text Generation
•
Updated
Jun 17
•
13
EmbeddedLLM/mistral-7b-instruct-v0.3-int4-onnx-directml
Text Generation
•
Updated
Jun 17
•
63
•
2
EmbeddedLLM/mistral-7b-instruct-v0.3-onnx
Text Generation
•
Updated
Jun 17
•
2
EmbeddedLLM/gemma-2b-it-onnx
Text Generation
•
Updated
Jun 17
EmbeddedLLM/Starling-LM-7b-beta-onnx
Text Generation
•
Updated
Jun 17
EmbeddedLLM/openchat-3.6-8b-20240522-onnx
Text Generation
•
Updated
Jun 17
EmbeddedLLM/Phi-3-vision-128k-instruct-onnx
Text Generation
•
Updated
Jun 20
EmbeddedLLM/01-ai_Yi-1.5-6B-Chat-onnx
Text Generation
•
Updated
Jun 20
EmbeddedLLM/gemma-7b-it-onnx
Text Generation
•
Updated
Jun 20
EmbeddedLLM/Phi-3-mini-4k-instruct-062024-onnx
Text Generation
•
Updated
Jul 5
Previous
1
2
3
Next