Spaces:

asr-africa
/

Automatic_Speech_Recognition_for_African_Languages

Running

App Files Files Community

Beijuka commited on Sep 27

Commit

96407d1

verified ·

1 Parent(s): f9d1370

Update src/streamlit_app.py

Browse files

Files changed (1) hide show

src/streamlit_app.py +15 -28

src/streamlit_app.py CHANGED Viewed

@@ -192,32 +192,19 @@ with tab5:
         We will build a test set that can be used for benchmarking ASR models in some of the 30 most spoken African languages. The benchmark dataset will be structured to consist of unique MP3 files and corresponding text files. We will ensure as much as possible that the benchmark datasets are as diverse as possible with dataset characteristics like gender, age, accent, variant, vocabulary, acoustic characteristics to help improve the accuracy of speech recognition models. The speech benchmark dataset will be reviewed, deemed highly quality, and split into dev, test and train sets. Due to the largely acoustic nature of African languages (mostly tonal, diacritical, etc.), a careful speech analysis of African languages is necessary and the benchmark dataset is important to spur more research in the African context.
     """)
     # Citation
-    CITATION_TEXT = """@misc{asr-africa-2025,
-    title        = {Automatic Speech Recognition for African Languages},
-    author       = {Dr Joyce Nakatumba-Nabende, Dr Peter Nabende, Dr Andrew Katumba, Alvin Nahabwe},
-    year         = 2025,
-    publisher    = {Hugging Face},
-    howpublished = "\\url{https://huggingface.co/spaces/asr-africa/Automatic_Speech_Recognition_for_African_Languages}"
-    }"""
-    with st.expander("📙 Citation", expanded=False):
-        st.text_area(
-            "Copy the BibTeX snippet to cite this source",
-            value=CITATION_TEXT,
-            height=150
-        )
-        st.markdown(
-            """
-            <script>
-            function copyText() {
-                const text = document.querySelector('textarea').value;
-                navigator.clipboard.writeText(text);
-                alert("Citation copied to clipboard!");
-            }
-            </script>
-            <button onclick="copyText()">Copy Citation</button>
-            """,
-            unsafe_allow_html=True
-        )

         We will build a test set that can be used for benchmarking ASR models in some of the 30 most spoken African languages. The benchmark dataset will be structured to consist of unique MP3 files and corresponding text files. We will ensure as much as possible that the benchmark datasets are as diverse as possible with dataset characteristics like gender, age, accent, variant, vocabulary, acoustic characteristics to help improve the accuracy of speech recognition models. The speech benchmark dataset will be reviewed, deemed highly quality, and split into dev, test and train sets. Due to the largely acoustic nature of African languages (mostly tonal, diacritical, etc.), a careful speech analysis of African languages is necessary and the benchmark dataset is important to spur more research in the African context.
     """)
     # Citation
+CITATION_TEXT = """@misc{asr-africa-2025,
+title        = {Automatic Speech Recognition for African Languages},
+author       = {Dr Joyce Nakatumba-Nabende, Dr Peter Nabende, Dr Andrew Katumba, Alvin Nahabwe},
+year         = 2025,
+publisher    = {Hugging Face},
+howpublished = "\\url{https://huggingface.co/spaces/asr-africa/Automatic_Speech_Recognition_for_African_Languages}"
+}"""
+with st.expander("📙 Citation", expanded=False):
+    st.text_area(
+        "BibTeX snippet to cite this source",
+        value=CITATION_TEXT,
+        height=150,
+        disabled=True
+    )