ardneebwar commited on
Commit
13c1e51
1 Parent(s): d27efe8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -4
README.md CHANGED
@@ -15,25 +15,36 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # hubert-finetuned-animals
17
 
18
- This model is a fine-tuned version of [facebook/hubert-base-ls960](https://huggingface.co/facebook/hubert-base-ls960) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
  - Loss: 0.5596
21
  - Accuracy: 0.95
22
 
23
  ## Model description
24
 
25
- More information needed
26
 
27
  ## Intended uses & limitations
28
 
29
- More information needed
 
 
 
 
30
 
31
  ## Training and evaluation data
32
 
33
- More information needed
34
 
35
  ## Training procedure
36
 
 
 
 
 
 
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
@@ -68,3 +79,7 @@ The following hyperparameters were used during training:
68
  - Pytorch 2.0.1+cu118
69
  - Datasets 2.14.5
70
  - Tokenizers 0.13.3
 
 
 
 
 
15
 
16
  # hubert-finetuned-animals
17
 
18
+ This model, `hubert-finetuned-animals`, is a fine-tuned version of `facebook/hubert-base-ls960` specifically for the task of animal sound classification. The model has been trained to identify various animal sounds from a subset of the ESC-50 dataset, focusing exclusively on animal categories.
19
  It achieves the following results on the evaluation set:
20
  - Loss: 0.5596
21
  - Accuracy: 0.95
22
 
23
  ## Model description
24
 
25
+ The HuBERT model, originally trained on large amounts of unlabelled audio data, has been fine-tuned here for a downstream task of animal sound classification. This fine-tuning allows the model to specialize in recognizing distinct animal sounds, such as those of dogs, cats, birds, etc., which can be particularly useful in applications such as bioacoustic monitoring, educational tools, and more interactive forms of wildlife conservation efforts.
26
 
27
  ## Intended uses & limitations
28
 
29
+ This model is intended for the classification of specific animal sounds within audio clips. It can be used in software applications related to wildlife research, educational content related to animals, or for entertainment purposes where animal sound recognition is needed.
30
+
31
+ ### Limitations
32
+
33
+ While the model shows high accuracy, it is trained on a limited set of categories from the ESC-50 dataset, which may not cover all possible animal sounds. The performance can vary significantly with audio quality, background noise, and animal sound variations not represented in the training data.
34
 
35
  ## Training and evaluation data
36
 
37
+ The model was fine-tuned on a subset of the ESC-50 dataset, which is a publicly available collection designed for environmental sound classification tasks. This subset specifically includes only the categories relevant to animal sounds. Each category in the dataset contains 40 examples, providing a diverse set of samples for model training and evaluation.
38
 
39
  ## Training procedure
40
 
41
+ The model was fine-tuned using the following procedure:
42
+
43
+ 1. Preprocessing: Audio files were converted into spectrograms.
44
+ 2. Data Split: The data was split into 70% training, 20% testing sets and 10% validation sets.
45
+ 3. Fine-tuning: The model was fine-tuned for 10 epochs on the training set.
46
+ 4. Evaluation: The model's performance was evaluated on the validation set after each epoch to monitor improvement and prevent overfitting.
47
+
48
  ### Training hyperparameters
49
 
50
  The following hyperparameters were used during training:
 
79
  - Pytorch 2.0.1+cu118
80
  - Datasets 2.14.5
81
  - Tokenizers 0.13.3
82
+
83
+ ### Github Repository
84
+
85
+ [Animal Sound Classification](https://github.com/rawbeen248/audio_classification_finetuning)