XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model Paper • 2406.04904 • Published Jun 7, 2024 • 6
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim Audio Classification • Updated Sep 19, 2024 • 69.5k • 99
loganhart/wav2vec2-base-960h-no-softmax-quality-daps Audio Classification • Updated May 5, 2024 • 166