reazonspeech-espnet-v1

reazonspeech-espnet-v1 is an ESPnet model trained for Japanese automatic speech recognition (ASR).

  • This model was trained on 15,000 hours of ReazonSpeech corpus.
  • Make sure that your audio file is sampled at 16khz when using this model.

For more details, please visit the official project page.

Downloads last month
12
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train reazon-research/reazonspeech-espnet-v1

Spaces using reazon-research/reazonspeech-espnet-v1 2