speechbrain
/

slu-timers-and-such-direct-librispeech-asr

Spoken language understanding

Model card Files Files and versions Community

lorenlugosch commited on Apr 4, 2021

Commit

c0375f0

•

1 Parent(s): d15cb46

Update README.md

Files changed (1) hide show

README.md +13 -2

README.md CHANGED Viewed

@@ -13,7 +13,18 @@ metrics:
 # End-to-end SLU model for Timers and Such
-Attention-based RNN sequence-to-sequence model for Timers and Such trained on the `train-real` subset. Achieves  86.7% accuracy on `test-real`.
 #### Referencing SpeechBrain
@@ -24,7 +35,7 @@ title = {SpeechBrain},
 year = {2021},
 publisher = {GitHub},
 journal = {GitHub repository},
-howpublished = {\\\\\\\\url{https://github.com/speechbrain/speechbrain}},
 }
 ```

 # End-to-end SLU model for Timers and Such
+Attention-based RNN sequence-to-sequence model for [Timers and Such](https://zenodo.org/record/4623772) trained on the `train-real` subset. This model checkpoint achieves 86.7% accuracy on `test-real`.
+The model uses an ASR model trained on LibriSpeech (`speechbrain/asr-crdnn-rnnlm-librispeech`) to extract features from the input audio, then maps these features to an intent and slot labels using a beam search.
+The dataset has four intents: `SetTimer`, `SetAlarm`, `SimpleMath`, and `UnitConversion`. Try testing the model by saying something like "set a timer for 5 minutes" or "what's 32 degrees Celsius in Fahrenheit?"
+You can try the model on the `math.wav` file included here as follows:
+```
+from speechbrain.pretrained import EndToEndSLU
+slu = EndToEndSLU.from_hparams("speechbrain/slu-timers-and-such-direct-librispeech-asr")
+slu.decode_file("math.wav")
+```
 #### Referencing SpeechBrain
 year = {2021},
 publisher = {GitHub},
 journal = {GitHub repository},
+howpublished = {\\\\\\\\\\\\\\\\url{https://github.com/speechbrain/speechbrain}},
 }
 ```