padmalcom's picture
Update README.md
fba6616
metadata
language:
  - multilingual
license: apache-2.0
tags:
  - voice
  - classification
  - vocalization
  - speech
  - audio
datasets:
  - NonverbalVocalization
widget:
  - src: >-
      https://huggingface.co/padmalcom/wav2vec2-large-nonverbalvocalization-classification/resolve/main/test.wav
    example_title: Sample 1
pipeline_tag: audio-classification
metrics:
  - accuracy

This language indendent wav2vec2 classification model is based on this dataset.

Sound classes are:

  • teeth-chattering
  • teeth-grinding
  • tongue-clicking
  • nose-blowing
  • coughing
  • yawning
  • throat clearing
  • sighing
  • lip-popping
  • lip-smacking
  • panting
  • crying
  • laughing
  • sneezing
  • moaning
  • screaming

inference.py shows, how the model can be used.