Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Multimodal
Feature Extraction
Text-to-Image
​
Image-to-Text
Text-to-Video
Visual Question Answering
Document Question Answering
Graph Machine Learning
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Image-to-Image
Unconditional Image Generation
Video Classification
Zero-Shot Image Classification
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Conversational
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Reinforcement Learning
Reinforcement Learning
Robotics

Models

7
new Full-text search
Active filters: voice-activity-detection

pyannote/segmentation

• Updated Nov 10, 2022 • 2.19M • 110

philschmid/pyannote-segmentation

• Updated Nov 8, 2022 • 11.2k • 1

philschmid/pyannote-speaker-diarization-endpoint

• Updated Nov 22, 2022 • 11.1k • 3

pyannote/brouhaha

• Updated Nov 15, 2022 • 159 • 7

anilbs/segmentation

• Updated Nov 1, 2022 • 15

julien-c/voice-activity-detection

• Updated Dec 21, 2020 • 3

d4data/Indian-voice-cloning

• Updated 19 days ago
Company
© Hugging Face
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs