openai/whisper-base.en · Noise Level

Hey @MemberDS ! Sorry about the late reply here, that's a super interesting question regarding Whisper noise level. There are no details about the level of noise on which the model is trained on, but you can find details about the performance of the model under noise in Section 3.7 of the paper https://arxiv.org/pdf/2212.04356.pdf

We recommend normalising the audio before passing it through the Whisper model (see https://huggingface.co/docs/transformers/model_doc/whisper#transformers.WhisperFeatureExtractor.__call__.do_normalize and https://github.com/huggingface/transformers/issues/19888)

This package a provides a Python port of the Audacity noise reduction algorithm https://pypi.org/project/noisereduce/
You can try applying this to your audio to pre-process it and reduce the overall input noise