michaellin/morse-translate-transcribe-100k
Viewer • Updated • 100k • 27
This model is fine-tuned off of openai/whisper-small of 100k synthetic samples of Morse Code audio, transcription (raw capitalized decoded text), and translation (English interpretation). It uses the unused <|startoflm|> token as a language marker.
This model was trained off of synthetically generated text generated by Claude by Anthropic. It may contain biases of the underlying model, especially when using the translation task. Much of the generated text is relevant to the amateur radio domain.
Audio data has been generated with stochastically determined tone, noise, and WPM, off of the transcribed text ground truth.
This model was fine-tuned on a Nvidia RTX 4070 Ti Super for approximately 48 hours over the course of 15 epochs.
Base model
openai/whisper-small