Wav2Vec2-base-ADS and IDS Classification Fine-tuned facebook/wav2vec2-base on Adult and Infant directed speech dataset. | |
The data used for training was randomly sampled. | |
The data was 8kHz and hence it was upsampled to 16kHz for training. | |
When using this model, make sure that your speech input is sampled at 16kHz. |