distil-whisper
/

distil-large-v2

Automatic Speech Recognition

Transformers.js

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

The model type whisper is not supported to be used with BetterTransformer

#21

by Venkatesh4342 - opened Dec 14, 2023

Dec 14, 2023

till yesterday it was supporting now above error is popping up.

Whisper Distillation org Dec 29, 2023

•

edited Dec 29, 2023

Hey @Venkatesh4342 ! Whisper now has native support for PyTorch SDPA flash attention. To use it, first upgrade your version of PyTorch to 2.1.2: https://pytorch.org/get-started/locally/

Then update Transformers to use main: https://huggingface.co/docs/transformers/installation#install-from-source

Transformers will then use PyTorch SDPA by default, alongside faster Torch STFT pre-processing, which should give you a nice speed-up overall: https://huggingface.co/docs/transformers/perf_infer_gpu_one#flashattention-and-memory-efficient-attention-through-pytorchs-scaleddotproductattention

Otherwise, using the latest version of Optimum should resolve the issue with BetterTransformer.

sanchit-gandhi changed discussion status to closed Dec 29, 2023

Jan 14, 2024

Thanks for the quick respone @sanchit-gandhi it worked.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment