MU-NLPC/whisper-large-v2-audio-captioning
Updated
•
218
•
9
Whisper models finetuned on audio captioning instead of speech recognition. These model aim to briefly describe what happens in the audio scene.