---
license: mit
language:
- en

---
### Model Description:

This is the wav2vec2-base model being pre-trained on the My Science Tutor (MyST train split 380h) dataset (from [LDC](https://catalog.ldc.upenn.edu/LDC2021S05)). 

The pertaining is done by using [fairseq](https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md) (wav2vec2_base_librispeech config).

The converge checkpoint is converted from PyTorch model to Hugging Face model by using a modified version of [convertor script](https://github.com/huggingface/transformers/blob/main/src/transformers/models/wav2vec2/convert_wav2vec2_original_pytorch_checkpoint_to_pytorch.py) offered by Huggingface