Audio Course documentation

Check your understanding of the course material

Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

Check your understanding of the course material

1. What is a vocoder?

2. Wav2Vec2 is an example of

3. What does a blank token in CTC algorithm do?

4. Which of the following statements about CTC models is FALSE?

5. Whisper is an example of

6. What is the easiest way to perform audio classification?

7. True or false? When treating spectrograms as images for classification, you will always benefit from image data augmentation techniques, such as shifting an image, cropping it, or resizing.

< > Update on GitHub