metadata
license: apache-2.0
language:
- de
library_name: transformers
pipeline_tag: automatic-speech-recognition
model-index:
- name: whisper-large-v3-german by Florian Zimmermeister @primeLine
results:
- task:
type: automatic-speech-recognition
name: Speech Recognition
dataset:
name: Common Voice de
type: common_voice_15
args: de
metrics:
- type: wer
value: 3.5
name: Test WER
- type: cer
value: 1
name: Test CER
Summary
This model map provides information about a model based on Whisper Large v3 that has been fine-tuned for speech recognition in German. Whisper is a powerful speech recognition platform developed by OpenAI. This model has been specially optimized for processing and recognizing German speech.
Applications
This model can be used in various application areas, including
- Transcription of spoken German language
- Voice commands and voice control
- Automatic subtitling for German videos
- Voice-based search queries in German
- Dictation functions in word processing programs
Training data
The training data for this model includes a large amount of spoken German from various sources. The data was carefully selected and processed to optimize recognition performance.
Training process
The training of the model was performed with the following hyperparameters
- Batch size: 1024
- Epochs: 2
- Learning rate: 1e-5
- Data augmentation: No
Model author: Florian Zimmermeister