distil-whisper
/

distil-small.en

@@ -34,13 +34,13 @@ For most other applications, the [distil-medium.en](https://huggingface.co/disti
 or [distil-large-v2](https://huggingface.co/distil-whisper/distil-large-v2) checkpoints are recommended, since they are
 both faster and achieve better WER results:
-| Model                                                                      | Params / M | Rel. Latency | Short-Form WER | Long-Form WER |
-|----------------------------------------------------------------------------|------------|--------------|----------------|---------------|
-| [large-v2](https://huggingface.co/openai/whisper-large-v2)                 | 1550       | 1.0          | **9.1**        | 11.7          |
-|                                                                            |            |              |                |               |
-| [distil-large-v2](https://huggingface.co/distil-whisper/distil-large-v2)   | 756        | 5.8          | 10.1           | **11.6**      |
-| [distil-medium.en](https://huggingface.co/distil-whisper/distil-medium.en) | 394        | **6.8**      | 11.1           | 12.4          |
-| [distil-small.en](https://huggingface.co/distil-whisper/distil-small.en)   | **166**    | 5.6          | 12.1           | 12.8          |
 **Note:** Distil-Whisper is currently only available for English speech recognition. We are working with the community
 to distill Whisper on other languages. If you are interested in distilling Whisper in your language, check out the
@@ -168,9 +168,9 @@ result = pipe("https://huggingface.co/datasets/sanchit-gandhi/librispeech_long/r
 ### Speculative Decoding
-Distil-Whisper can be used as an assistant model to Whisper for speculative decoding. Speculative decoding mathematically
-ensures the exact same outputs as Whisper are obtained while being 2 times faster. This makes it the perfect drop-in
-replacement for existing Whisper pipelines, since the same outputs are guaranteed.
 In the following code-snippet, we load the assistant Distil-Whisper model standalone to the main Whisper pipeline. We then
 specify it as the "assistant model" for generation:

 or [distil-large-v2](https://huggingface.co/distil-whisper/distil-large-v2) checkpoints are recommended, since they are
 both faster and achieve better WER results:
+| Model                                                                      | Params / M | Rel. Latency ↑ | Short-Form WER ↓ | Long-Form WER ↓ |
+|----------------------------------------------------------------------------|------------|----------------|------------------|-----------------|
+| [large-v2](https://huggingface.co/openai/whisper-large-v2)                 | 1550       | 1.0            | **9.1**          | 11.7            |
+|                                                                            |            |                |                  |                 |
+| [distil-large-v2](https://huggingface.co/distil-whisper/distil-large-v2)   | 756        | 5.8            | 10.1             | **11.6**        |
+| [distil-medium.en](https://huggingface.co/distil-whisper/distil-medium.en) | 394        | **6.8**        | 11.1             | 12.4            |
+| [distil-small.en](https://huggingface.co/distil-whisper/distil-small.en)   | **166**    | 5.6            | 12.1             | 12.8            |
 **Note:** Distil-Whisper is currently only available for English speech recognition. We are working with the community
 to distill Whisper on other languages. If you are interested in distilling Whisper in your language, check out the
 ### Speculative Decoding
+Distil-Whisper can be used as an assistant model to Whisper for [speculative decoding](https://huggingface.co/blog/whisper-speculative-decoding).
+Speculative decoding mathematically ensures the exact same outputs as Whisper are obtained while being 2 times faster.
+This makes it the perfect drop-in replacement for existing Whisper pipelines, since the same outputs are guaranteed.
 In the following code-snippet, we load the assistant Distil-Whisper model standalone to the main Whisper pipeline. We then
 specify it as the "assistant model" for generation: