ARespiratory audio classification model

This model classifies respiratory audio recordings from the ICBHI 2017 Challenge dataset into crackles, wheezes, both, or none (multi-label classification). It utilizes the AST encoder (MIT/ast-finetuned-audioset-14-14-0.443) with a lightweight classification head.

The model has been pushed to the Hub using the PyTorchModelHubMixin integration.


Dataset


Performance metrics

Label F1 Precision Recall AUC
Crackle 0.6756 0.6147 0.7500 0.7033
Wheeze 0.4853 0.6565 0.3849 0.8031
Macro Avg 0.5805 0.6356 0.5674 0.7532

Usage

Run inference using s05_inference.py.
Ensure you install the necessary dependencies. For setup instructions, please see the documentation.


Notes

For additional details, check the documentation notes.


Contact

For any questions or further information, feel free to reach out via email: fabiocat@mit.edu.

Downloads last month
51
Safetensors
Model size
85.8M params
Tensor type
F32
·
Inference API
Unable to determine this model's library. Check the docs .