Spaces:

OthmaneJ
/

transcribe-distil-wav2vec2

Runtime error

OthmaneJ commited on Oct 12, 2021

Commit

c7a9e39

1 Parent(s): 4800fde

with doc (no quantization)

Files changed (1) hide show

app.py CHANGED Viewed

@@ -11,8 +11,8 @@ processor = Wav2Vec2Processor.from_pretrained(model_name)
 model = Wav2Vec2ForCTC.from_pretrained(model_name)
 # quantization
-model.eval()
-model_int8 = torch.quantization.quantize_dynamic(model, dtype=torch.qint8,inplace = True,)
 # define function to read in sound file
 # def map_to_array(file):
@@ -33,8 +33,8 @@ def inference(audio):
 inputs = gr.inputs.Audio(label="Input Audio", type="file")
 outputs =  gr.outputs.Textbox(label="Output Text")
-title = "distilled wav2vec 2.0 (with quantization)"
-description = "Gradio demo for Robust wav2vec 2.0. To use it, simply upload your audio, or click one of the examples to load them. Read more at the links below. Currently supports .wav and .flac files"
-article = "<p style='text-align: center'><a href='https://arxiv.org/abs/2104.01027' target='_blank'>Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training</a> | <a href='https://github.com/pytorch/fairseq' target='_blank'>Github Repo</a></p>"
 examples=[['poem.wav']]
 gr.Interface(inference, inputs, outputs, title=title, description=description, article=article, examples=examples).launch()

 model = Wav2Vec2ForCTC.from_pretrained(model_name)
 # quantization
+# model.eval()
+# model_int8 = torch.quantization.quantize_dynamic(model, dtype=torch.qint8,inplace = True,)
 # define function to read in sound file
 # def map_to_array(file):
 inputs = gr.inputs.Audio(label="Input Audio", type="file")
 outputs =  gr.outputs.Textbox(label="Output Text")
+title = "distilled wav2vec 2.0"
+description = "Gradio demo for a distilled wav2vec 2.0 (4x faster than large wav2vec 2.0, and 16x times smaller than base wav2vec 2.0 if combined with quantization). To use it, simply upload your audio, or click one of the examples to load them. Read more at the links below. Currently supports .wav and .flac files"
+article = "<p style='text-align: center'><a href='https://github.com/OthmaneJ/distil-wav2vec2' target='_blank'> Github repo for demonstration </a> | <a href='https://huggingface.co/OthmaneJ/distil-wav2vec2' target='_blank'>Pretrained model</a></p>"
 examples=[['poem.wav']]
 gr.Interface(inference, inputs, outputs, title=title, description=description, article=article, examples=examples).launch()