benderrodriguez commited on
Commit
9ab26e5
1 Parent(s): dcd6a69

Benchmark description update

Browse files
Files changed (1) hide show
  1. src/about.py +3 -3
src/about.py CHANGED
@@ -54,17 +54,17 @@ The following datasets are used in our evaluation:
54
  "SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion" (Sharoni, O., Shenberg, R., Cooper, E. (2023) SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion. Proc. INTERSPEECH 2023,)
55
 
56
  ### [google/fleurs/he](https://huggingface.co/datasets/google/fleurs)
57
- - **Size**: X hours
58
  - **Domain**: Read speech covering common topics and phrases in Hebrew
59
  - **Source**: Created as part of Google's FLEURS project, designed for multilingual speech tasks and evaluation. Data collected through crowdsourcing from Hebrew speakers.
60
 
61
  ### [mozilla-foundation/common_voice_17_0/he](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0)
62
- - **Size**: X hours (test set of the corpus)
63
  - **Domain**: Read sentences in Hebrew from various texts.
64
  - **Source**: Collected through Mozilla's Common Voice initiative, where volunteers contribute recordings and validate other speakers' contributions
65
 
66
  ### [imvladikon/hebrew_speech_kan](https://huggingface.co/datasets/imvladikon/hebrew_speech_kan)
67
- - **Size**: 1.7 hours (validation setof the corpus)
68
  - **Domain**: Varied content types from the Kan (Israeli Public Broadcasting Corporation) youtube channel
69
  - **Source**: Published by Vladimir Gurevich. Scraped audio and subtitles data from YouTube channel "כאן" (Kan).
70
  """
 
54
  "SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion" (Sharoni, O., Shenberg, R., Cooper, E. (2023) SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion. Proc. INTERSPEECH 2023,)
55
 
56
  ### [google/fleurs/he](https://huggingface.co/datasets/google/fleurs)
57
+ - **Size**: 2 hours (test set of the corpus)
58
  - **Domain**: Read speech covering common topics and phrases in Hebrew
59
  - **Source**: Created as part of Google's FLEURS project, designed for multilingual speech tasks and evaluation. Data collected through crowdsourcing from Hebrew speakers.
60
 
61
  ### [mozilla-foundation/common_voice_17_0/he](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0)
62
+ - **Size**: 2 hours (validated set of the corpus)
63
  - **Domain**: Read sentences in Hebrew from various texts.
64
  - **Source**: Collected through Mozilla's Common Voice initiative, where volunteers contribute recordings and validate other speakers' contributions
65
 
66
  ### [imvladikon/hebrew_speech_kan](https://huggingface.co/datasets/imvladikon/hebrew_speech_kan)
67
+ - **Size**: 1.7 hours (validation set of the corpus)
68
  - **Domain**: Varied content types from the Kan (Israeli Public Broadcasting Corporation) youtube channel
69
  - **Source**: Published by Vladimir Gurevich. Scraped audio and subtitles data from YouTube channel "כאן" (Kan).
70
  """