Spaces:

Sandiago21
/

automatic-speech-recognition-spanish

Runtime error

App Files Files Community

Sandiago21 commited on Aug 4, 2023

Commit

f65e443

•

1 Parent(s): 6301fc2

Upload folder using huggingface_hub

Browse files

Files changed (4) hide show

README.md +3 -9
app.py +54 -0
example.wav +0 -0
requirements.txt +3 -0

README.md CHANGED Viewed

@@ -1,12 +1,6 @@
 ---
-title: Automatic Speech Recognition Spanish
-emoji: 🏆
-colorFrom: yellow
-colorTo: pink
-sdk: gradio
-sdk_version: 3.39.0
 app_file: app.py
-pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: automatic-speech-recognition-spanish
 app_file: app.py
+sdk: gradio
+sdk_version: 3.36.0
 ---

app.py ADDED Viewed

	@@ -0,0 +1,54 @@

+import torch
+import gradio as gr
+from transformers import pipeline
+model_id = "Sandiago21/whisper-large-v2-spanish"  # update with your model id
+pipe = pipeline("automatic-speech-recognition", model=model_id)
+title = "Automatic Speech Recognition (ASR)"
+description = """
+Demo for automatic speech recognition in Spanish. Demo uses [Sandiago21/whisper-large-v2-spanish](https://huggingface.co/Sandiago21/whisper-large-v2-spanish) checkpoint, which is based on OpenAI's
+[Whisper](https://huggingface.co/openai/whisper-large-v2) model and is fine-tuned in Spanish Audio dataset
+![Automatic Speech Recognition (ASR)"](https://datasets-server.huggingface.co/assets/huggingface-course/audio-course-images/--/huggingface-course--audio-course-images/train/2/image/image.png "Diagram of Automatic Speech Recognition (ASR)")
+"""
+def transcribe_speech(filepath):
+    output = pipe(
+        filepath,
+        max_new_tokens=256,
+        generate_kwargs={
+            "task": "transcribe",
+            "language": "spanish",
+        },  # update with the language you've fine-tuned on
+        chunk_length_s=30,
+        batch_size=8,
+    )
+    return output["text"]
+demo = gr.Blocks()
+mic_transcribe = gr.Interface(
+    fn=transcribe_speech,
+    inputs=gr.Audio(source="microphone", type="filepath"),
+    outputs=gr.outputs.Textbox(),
+    tilte=title,
+    description=description,
+)
+file_transcribe = gr.Interface(
+    fn=transcribe_speech,
+    inputs=gr.Audio(source="upload", type="filepath"),
+    outputs=gr.outputs.Textbox(),
+    examples=[["./example.wav"]],
+    tilte=title,
+    description=description,
+)
+with demo:
+    gr.TabbedInterface(
+        [mic_transcribe, file_transcribe],
+        ["Transcribe Microphone", "Transcribe Audio File"],
+    ),
+demo.launch()

example.wav ADDED Viewed

Binary file (258 kB). View file

requirements.txt ADDED Viewed

	@@ -0,0 +1,3 @@

+transformers
+torch
+gradio_client==0.2.7