Spaces:

chrisjay
/

afro-speech

Build error

App Files Files Community

chrisjay commited on May 18, 2022

Commit

7a577ae

•

1 Parent(s): 73257d5

final commits to project

Browse files

Files changed (3) hide show

app.py +4 -3
article.py +9 -5
data +1 -1

app.py CHANGED Viewed

@@ -214,13 +214,14 @@ This is a platform to contribute to your African language by recording your voic
 markdown="""
 # 🌍 African Digits Recording Sprint
-> Record numbers 0-9 in your African language.
 1. Fill in your email. This is completely optional. We need this to track your progress for the prize.
 2. Choose your African language
 3. Fill in the speaker metadata (age, gender, accent). This is optional but important to build better speech models.
 4. You will see the image of a number __(this is the number you will record)__.
-5. Fill in the word of that number (optional)
 6. Click record and say the number in your African language.
 7. Click ‘Submit’. It will save your record and go to the next number.
 8. Repeat 4-7
@@ -232,7 +233,7 @@ markdown="""
 block = gr.Blocks(css=BLOCK_CSS)
 with block:
     gr.Markdown(markdown)
-    email = gr.inputs.Textbox(placeholder='your email',label="Email (if you want join the sprint)",default='')
     with gr.Tabs():
         with gr.TabItem('Record'):

 markdown="""
 # 🌍 African Digits Recording Sprint
+> Record numbers 0-9 in your African language. The ten people who record the most from 19th May - 19th June will each receive a prize.
 1. Fill in your email. This is completely optional. We need this to track your progress for the prize.
+__Note:__  You should record all numbers shown till the end. It does not count if you stop mid-way.
 2. Choose your African language
 3. Fill in the speaker metadata (age, gender, accent). This is optional but important to build better speech models.
 4. You will see the image of a number __(this is the number you will record)__.
+5. Fill in the word of that number (optional). You can leave this blank.
 6. Click record and say the number in your African language.
 7. Click ‘Submit’. It will save your record and go to the next number.
 8. Repeat 4-7
 block = gr.Blocks(css=BLOCK_CSS)
 with block:
     gr.Markdown(markdown)
+    email = gr.inputs.Textbox(placeholder='your email',label="Email (Your email is not made public. We need it to consider you for the prize.)",default='')
     with gr.Tabs():
         with gr.TabItem('Record'):

article.py CHANGED Viewed

@@ -7,14 +7,17 @@ Existing speech recognition services are not available in many African languages
 This dataset will boost speech technologies (like speech-to-text, text-to-speech, speech translation, and modeling) for African languages, which hitherto had little or no public dataset.
-**Note:** This is a continuous effort. This sprint is just to kick-start the event.
 **Benefits of such a dataset**
-- Useful dataset to introduce people to audio-related Machine Learning. It can be used as a simple training and/or evaluation dataset for speech processing tasks.
 **About the dataset**
-- The data (metadat,text, and audio recording) are uploaded to [a public Hugging Face dataset](https://huggingface.co/datasets/chrisjay/crowd-speech-africa).
 - We do not collect your name, address or other sensitive information.
 - If for some reason you want to remove your entry, please reach out by email.
 - Your email, if given, is used only to keep track of your progress in order to give the prizes to the top scorers. They are temporarily stored in [this private dataset](https://huggingface.co/datasets/chrisjay/african-digits-recording-sprint-email) and immediately deleted after the sprint.
@@ -22,7 +25,8 @@ This dataset will boost speech technologies (like speech-to-text, text-to-speech
 **Contact**
 In case of questions, issues or anything contact Chris Emezue at:
-- chris@huggingface.co
 """

 This dataset will boost speech technologies (like speech-to-text, text-to-speech, speech translation, and modeling) for African languages, which hitherto had little or no public dataset.
+**Note:** This is a continuous effort. This sprint is just to kick-start the event. Please feel free to share with your family and friends and keep recording more.
 **Benefits of such a dataset**
+- Useful dataset to learn audio-related Machine Learning (automatics speech recognition, text-to-speech, other types of speech processing).
+- It can be used as a simple training and/or evaluation dataset for speech processing tasks.
+- Very easy dataset to train your model on and get good results. With this dataset, you can easily train a model to recognize numbers in your language.
+- Opens up opportunities for more sophisticated speech processing models for African languages.
 **About the dataset**
+- The data (metadata, text, and audio recording) are uploaded to [a public Hugging Face dataset](https://huggingface.co/datasets/chrisjay/crowd-speech-africa). [This](https://huggingface.co/spaces/chrisjay/afro-speech/blob/main/app.py#L90-L106) is the part of our code that handles the upload.
 - We do not collect your name, address or other sensitive information.
 - If for some reason you want to remove your entry, please reach out by email.
 - Your email, if given, is used only to keep track of your progress in order to give the prizes to the top scorers. They are temporarily stored in [this private dataset](https://huggingface.co/datasets/chrisjay/african-digits-recording-sprint-email) and immediately deleted after the sprint.
 **Contact**
 In case of questions, issues or anything contact Chris Emezue at:
+- Email: chris@huggingface.co
+- [Twitter](https://twitter.com/ChrisEmezue)
+- [Telegram](https://t.me/realchrisjay)
 """

data CHANGED Viewed

	@@ -1 +1 @@
1	- Subproject commit ~~f378d5cd72892211c6ff30dbecf891f953836e0f~~


1	+ Subproject commit 4dcfaf1e20e56703bd47ac80aafcaddd43af279b