mrzbrt commited on
Commit
8deff84
1 Parent(s): d76da4b

End of training

Browse files
README.md ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: ntu-spml/distilhubert
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - audiofolder
8
+ metrics:
9
+ - accuracy
10
+ model-index:
11
+ - name: distilhubert-finetuned-gtzan
12
+ results:
13
+ - task:
14
+ name: Audio Classification
15
+ type: audio-classification
16
+ dataset:
17
+ name: audiofolder
18
+ type: audiofolder
19
+ config: default
20
+ split: train
21
+ args: default
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.0
26
+ ---
27
+
28
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
+ should probably proofread and complete it, then remove this comment. -->
30
+
31
+ # distilhubert-finetuned-gtzan
32
+
33
+ This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the audiofolder dataset.
34
+ It achieves the following results on the evaluation set:
35
+ - Loss: 5.7174
36
+ - Accuracy: 0.0
37
+
38
+ ## Model description
39
+
40
+ More information needed
41
+
42
+ ## Intended uses & limitations
43
+
44
+ More information needed
45
+
46
+ ## Training and evaluation data
47
+
48
+ More information needed
49
+
50
+ ## Training procedure
51
+
52
+ ### Training hyperparameters
53
+
54
+ The following hyperparameters were used during training:
55
+ - learning_rate: 5e-05
56
+ - train_batch_size: 8
57
+ - eval_batch_size: 8
58
+ - seed: 42
59
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
+ - lr_scheduler_type: linear
61
+ - lr_scheduler_warmup_ratio: 0.1
62
+ - num_epochs: 10
63
+ - mixed_precision_training: Native AMP
64
+
65
+ ### Training results
66
+
67
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
68
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
69
+ | 5.6861 | 1.0 | 61 | 5.7174 | 0.0 |
70
+ | 5.573 | 2.0 | 122 | 5.7429 | 0.0 |
71
+ | 5.4992 | 3.0 | 183 | 5.7735 | 0.0 |
72
+ | 5.3129 | 4.0 | 244 | 5.7965 | 0.0 |
73
+ | 5.3243 | 5.0 | 305 | 5.8150 | 0.0 |
74
+ | 5.2456 | 6.0 | 366 | 5.7999 | 0.0 |
75
+ | 4.8339 | 7.0 | 427 | 5.8090 | 0.0 |
76
+ | 5.0512 | 8.0 | 488 | 5.8288 | 0.0 |
77
+ | 4.7789 | 9.0 | 549 | 5.8143 | 0.0 |
78
+ | 5.1463 | 10.0 | 610 | 5.8238 | 0.0 |
79
+
80
+
81
+ ### Framework versions
82
+
83
+ - Transformers 4.36.2
84
+ - Pytorch 2.1.0+cu121
85
+ - Datasets 2.16.0
86
+ - Tokenizers 0.15.0
config.json ADDED
@@ -0,0 +1,678 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "ntu-spml/distilhubert",
3
+ "activation_dropout": 0.1,
4
+ "apply_spec_augment": false,
5
+ "architectures": [
6
+ "HubertForSequenceClassification"
7
+ ],
8
+ "attention_dropout": 0.1,
9
+ "bos_token_id": 1,
10
+ "classifier_proj_size": 256,
11
+ "conv_bias": false,
12
+ "conv_dim": [
13
+ 512,
14
+ 512,
15
+ 512,
16
+ 512,
17
+ 512,
18
+ 512,
19
+ 512
20
+ ],
21
+ "conv_kernel": [
22
+ 10,
23
+ 3,
24
+ 3,
25
+ 3,
26
+ 3,
27
+ 2,
28
+ 2
29
+ ],
30
+ "conv_stride": [
31
+ 5,
32
+ 2,
33
+ 2,
34
+ 2,
35
+ 2,
36
+ 2,
37
+ 2
38
+ ],
39
+ "ctc_loss_reduction": "sum",
40
+ "ctc_zero_infinity": false,
41
+ "do_stable_layer_norm": false,
42
+ "eos_token_id": 2,
43
+ "feat_extract_activation": "gelu",
44
+ "feat_extract_norm": "group",
45
+ "feat_proj_dropout": 0.0,
46
+ "feat_proj_layer_norm": false,
47
+ "final_dropout": 0.0,
48
+ "hidden_act": "gelu",
49
+ "hidden_dropout": 0.1,
50
+ "hidden_size": 768,
51
+ "id2label": {
52
+ "0": "['Breathlike sound', 'Drone']",
53
+ "1": "['Breathlike sound', 'Noisy sound', 'Noise', 'Textural', 'Instrumental']",
54
+ "10": "['Cluster sound', 'Chop', 'Glitch']",
55
+ "100": "['Harmonic sound', 'Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Textural']",
56
+ "101": "['Harmonic sound', 'Inharmonic sound', 'Instrumental']",
57
+ "102": "['Harmonic sound', 'Inharmonic sound', 'Noise', 'Textural', 'Instrumental']",
58
+ "103": "['Harmonic sound', 'Inharmonic sound', 'Noisy sound', 'Drone', 'Stab (attack resonance)', 'Noise', 'Instrumental']",
59
+ "104": "['Harmonic sound', 'Inharmonic sound', 'Noisy sound', 'Drone']",
60
+ "105": "['Harmonic sound', 'Inharmonic sound', 'Soundscape', 'Glitch', 'Instrumental']",
61
+ "106": "['Harmonic sound', 'Inharmonic sound', 'Soundscape']",
62
+ "107": "['Harmonic sound', 'Instrumental', 'Chirp']",
63
+ "108": "['Harmonic sound', 'Instrumental']",
64
+ "109": "['Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Synthesis', 'Honk']",
65
+ "11": "['Cluster sound', 'Chop', 'Impact', 'Synthesis']",
66
+ "110": "['Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Drone', 'Glitch', 'Synthesis', 'Noise', 'Percussive']",
67
+ "111": "['Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Noise', 'Textural', 'Instrumental']",
68
+ "112": "['Harmonic sound', 'Noisy sound', 'Drone', 'Synthesis', 'Noise']",
69
+ "113": "['Harmonic sound', 'Noisy sound', 'Drone', 'Synthesis']",
70
+ "114": "['Harmonic sound', 'Noisy sound', 'Glitch']",
71
+ "115": "['Harmonic sound', 'Noisy sound', 'Instrumental']",
72
+ "116": "['Harmonic sound', 'Noisy sound', 'Soundscape', 'Synthesis', 'Noise']",
73
+ "117": "['Harmonic sound', 'Noisy sound', 'Synthesis', 'Noise']",
74
+ "118": "['Harmonic sound', 'Soundscape', 'Vocal', 'Choral']",
75
+ "119": "['Harmonic sound', 'Stab (attack resonance)', 'Instrumental', 'Percussive']",
76
+ "12": "['Cluster sound', 'Composite or Stratified sound', 'Percussive']",
77
+ "120": "['Harmonic sound', 'Stab (attack resonance)', 'Instrumental']",
78
+ "121": "['Harmonic sound', 'Stab (attack resonance)', 'Noise']",
79
+ "122": "['Harmonic sound', 'Stab (attack resonance)', 'Percussive']",
80
+ "123": "['Harmonic sound', 'Sub', 'Synthesis']",
81
+ "124": "['Harmonic sound', 'Synthesis', 'Instrumental', 'Percussive']",
82
+ "125": "['Harmonic sound', 'Synthesis', 'Instrumental']",
83
+ "126": "['Harmonic sound', 'Synthesis', 'Textural']",
84
+ "127": "['Harmonic sound', 'Synthesis']",
85
+ "128": "['Harmonic sound', 'Textural', 'Instrumental']",
86
+ "129": "['Harmonic sound', 'Textural']",
87
+ "13": "['Cluster sound', 'Drone']",
88
+ "130": "['Harmonic sound', 'Vocal', 'Textural', 'Choral']",
89
+ "131": "['Harmonic sound']",
90
+ "132": "['Impact', 'Stab (attack resonance)', 'Synthesis', 'Percussive']",
91
+ "133": "['Impact', 'Synthesis']",
92
+ "134": "['Inharmonic sound', 'Breathlike sound', 'Composite or Stratified sound', 'Vocal', 'Textural', 'Percussive']",
93
+ "135": "['Inharmonic sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Impact', 'Stab (attack resonance)', 'Synthesis', 'Noise']",
94
+ "136": "['Inharmonic sound', 'Breathlike sound', 'Noisy sound', 'Noise', 'Percussive']",
95
+ "137": "['Inharmonic sound', 'Breathlike sound', 'Noisy sound', 'Vocal', 'Textural', 'Percussive']",
96
+ "138": "['Inharmonic sound', 'Breathlike sound', 'Soundscape', 'Crackle']",
97
+ "139": "['Inharmonic sound', 'Breathlike sound', 'Soundscape', 'Glitch', 'Crackle', 'Noise', 'Textural']",
98
+ "14": "['Cluster sound', 'Glitch', 'Crackle']",
99
+ "140": "['Inharmonic sound', 'Chirp']",
100
+ "141": "['Inharmonic sound', 'Chop', 'Glitch', 'Scratch', 'Crackle']",
101
+ "142": "['Inharmonic sound', 'Cluster sound', 'Crackle', 'Percussive']",
102
+ "143": "['Inharmonic sound', 'Cluster sound', 'Glitch', 'Noise']",
103
+ "144": "['Inharmonic sound', 'Cluster sound', 'Glitch', 'Stab (attack resonance)']",
104
+ "145": "['Inharmonic sound', 'Cluster sound', 'Glitch', 'Synthesis', 'Percussive']",
105
+ "146": "['Inharmonic sound', 'Cluster sound', 'Impact']",
106
+ "147": "['Inharmonic sound', 'Cluster sound', 'Noisy sound', 'Chop', 'Glitch', 'Percussive']",
107
+ "148": "['Inharmonic sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis']",
108
+ "149": "['Inharmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Instrumental']",
109
+ "15": "['Cluster sound', 'Glitch', 'Synthesis', 'Textural']",
110
+ "150": "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Textural']",
111
+ "151": "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Impact', 'Synthesis', 'Percussive']",
112
+ "152": "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape', 'Synthesis']",
113
+ "153": "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Synthesis', 'Scratch', 'Textural']",
114
+ "154": "['Inharmonic sound', 'Composite or Stratified sound', 'Drone', 'Synthesis', 'Crackle', 'Noise']",
115
+ "155": "['Inharmonic sound', 'Composite or Stratified sound', 'Drone', 'Synthesis', 'Crackle']",
116
+ "156": "['Inharmonic sound', 'Composite or Stratified sound', 'Drone', 'Synthesis']",
117
+ "157": "['Inharmonic sound', 'Composite or Stratified sound', 'Glitch', 'Impact', 'Synthesis', 'Percussive']",
118
+ "158": "['Inharmonic sound', 'Composite or Stratified sound', 'Glitch', 'Synthesis']",
119
+ "159": "['Inharmonic sound', 'Composite or Stratified sound', 'Scratch', 'Instrumental']",
120
+ "16": "['Cluster sound', 'Glitch']",
121
+ "160": "['Inharmonic sound', 'Composite or Stratified sound', 'Soundscape', 'Synthesis', 'Crackle', 'Textural']",
122
+ "161": "['Inharmonic sound', 'Composite or Stratified sound', 'Soundscape']",
123
+ "162": "['Inharmonic sound', 'Composite or Stratified sound', 'Synthesis', 'Crackle']",
124
+ "163": "['Inharmonic sound', 'Composite or Stratified sound', 'Synthesis', 'Textural']",
125
+ "164": "['Inharmonic sound', 'Composite or Stratified sound', 'Vocal', 'Noise', 'Textural']",
126
+ "165": "['Inharmonic sound', 'Crackle', 'Noise']",
127
+ "166": "['Inharmonic sound', 'Crackle', 'Percussive']",
128
+ "167": "['Inharmonic sound', 'Crackle']",
129
+ "168": "['Inharmonic sound', 'Drone', 'Noise']",
130
+ "169": "['Inharmonic sound', 'Drone', 'Sub']",
131
+ "17": "['Cluster sound', 'Impact', 'Stab (attack resonance)']",
132
+ "170": "['Inharmonic sound', 'Drone', 'Synthesis', 'Noise']",
133
+ "171": "['Inharmonic sound', 'Drone', 'Synthesis', 'Textural']",
134
+ "172": "['Inharmonic sound', 'Drone', 'Synthesis']",
135
+ "173": "['Inharmonic sound', 'Drone']",
136
+ "174": "['Inharmonic sound', 'Glitch', 'Noise', 'Percussive']",
137
+ "175": "['Inharmonic sound', 'Glitch', 'Percussive']",
138
+ "176": "['Inharmonic sound', 'Glitch', 'Synthesis', 'Chirp', 'Percussive']",
139
+ "177": "['Inharmonic sound', 'Glitch', 'Synthesis', 'Noise', 'Percussive']",
140
+ "178": "['Inharmonic sound', 'Glitch', 'Synthesis', 'Textural']",
141
+ "179": "['Inharmonic sound', 'Glitch', 'Synthesis']",
142
+ "18": "['Cluster sound', 'Impact', 'Synthesis', 'Percussive']",
143
+ "180": "['Inharmonic sound', 'Glitch', 'Vocal']",
144
+ "181": "['Inharmonic sound', 'Glitch']",
145
+ "182": "['Inharmonic sound', 'Impact']",
146
+ "183": "['Inharmonic sound', 'Instrumental', 'Percussive']",
147
+ "184": "['Inharmonic sound', 'Instrumental']",
148
+ "185": "['Inharmonic sound', 'Noise']",
149
+ "186": "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Chop', 'Glitch', 'Vocal', 'Crackle', 'Noise', 'Textural']",
150
+ "187": "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Drone', 'Glitch', 'Synthesis', 'Noise']",
151
+ "188": "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Glitch', 'Synthesis', 'Crackle', 'Noise']",
152
+ "189": "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Percussive']",
153
+ "19": "['Cluster sound', 'Impact', 'Synthesis', 'Vocal']",
154
+ "190": "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Crackle', 'Noise', 'Textural']",
155
+ "191": "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Synthesis']",
156
+ "192": "['Inharmonic sound', 'Noisy sound', 'Crackle', 'Noise']",
157
+ "193": "['Inharmonic sound', 'Noisy sound', 'Crackle', 'Textural']",
158
+ "194": "['Inharmonic sound', 'Noisy sound', 'Crackle']",
159
+ "195": "['Inharmonic sound', 'Noisy sound', 'Drone', 'Sub', 'Glitch', 'Noise', 'Textural', 'Instrumental']",
160
+ "196": "['Inharmonic sound', 'Noisy sound', 'Glitch', 'Noise']",
161
+ "197": "['Inharmonic sound', 'Noisy sound', 'Glitch', 'Synthesis', 'Noise', 'Percussive']",
162
+ "198": "['Inharmonic sound', 'Noisy sound', 'Glitch', 'Synthesis', 'Noise']",
163
+ "199": "['Inharmonic sound', 'Noisy sound', 'Glitch']",
164
+ "2": "['Breathlike sound', 'Noisy sound', 'Noise', 'Textural']",
165
+ "20": "['Cluster sound', 'Impact']",
166
+ "200": "['Inharmonic sound', 'Noisy sound', 'Impact', 'Synthesis', 'Noise', 'Percussive']",
167
+ "201": "['Inharmonic sound', 'Noisy sound', 'Impact', 'Textural']",
168
+ "202": "['Inharmonic sound', 'Noisy sound', 'Noise', 'Textural']",
169
+ "203": "['Inharmonic sound', 'Noisy sound', 'Noise']",
170
+ "204": "['Inharmonic sound', 'Noisy sound', 'Percussive']",
171
+ "205": "['Inharmonic sound', 'Noisy sound', 'Scratch', 'Crackle', 'Noise', 'Chirp']",
172
+ "206": "['Inharmonic sound', 'Noisy sound', 'Scratch', 'Crackle', 'Noise', 'Instrumental']",
173
+ "207": "['Inharmonic sound', 'Noisy sound', 'Sub', 'Glitch', 'Impact', 'Noise']",
174
+ "208": "['Inharmonic sound', 'Noisy sound', 'Sub', 'Synthesis', 'Noise']",
175
+ "209": "['Inharmonic sound', 'Noisy sound', 'Synthesis', 'Noise']",
176
+ "21": "['Cluster sound', 'Noisy sound', 'Drone', 'Noise']",
177
+ "210": "['Inharmonic sound', 'Noisy sound', 'Vocal']",
178
+ "211": "['Inharmonic sound', 'Noisy sound']",
179
+ "212": "['Inharmonic sound', 'Percussive']",
180
+ "213": "['Inharmonic sound', 'Soundscape', 'Textural']",
181
+ "214": "['Inharmonic sound', 'Soundscape']",
182
+ "215": "['Inharmonic sound', 'Stab (attack resonance)', 'Instrumental']",
183
+ "216": "['Inharmonic sound', 'Sub', 'Textural']",
184
+ "217": "['Inharmonic sound', 'Synthesis', 'Chirp']",
185
+ "218": "['Inharmonic sound', 'Synthesis', 'Noise', 'Percussive']",
186
+ "219": "['Inharmonic sound', 'Synthesis', 'Noise']",
187
+ "22": "['Cluster sound', 'Noisy sound', 'Glitch', 'Textural']",
188
+ "220": "['Inharmonic sound', 'Synthesis', 'Percussive']",
189
+ "221": "['Inharmonic sound', 'Synthesis', 'Textural']",
190
+ "222": "['Inharmonic sound', 'Synthesis']",
191
+ "223": "['Inharmonic sound', 'Textural', 'Instrumental']",
192
+ "224": "['Inharmonic sound', 'Textural']",
193
+ "225": "['Inharmonic sound', 'Vocal']",
194
+ "226": "['Inharmonic sound']",
195
+ "227": "['Instrumental', 'Percussive']",
196
+ "228": "['Instrumental']",
197
+ "229": "['Noise']",
198
+ "23": "['Cluster sound', 'Noisy sound', 'Synthesis']",
199
+ "230": "['Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Instrumental']",
200
+ "231": "['Noisy sound', 'Composite or Stratified sound', 'Drone', 'Glitch', 'Noise', 'Textural']",
201
+ "232": "['Noisy sound', 'Composite or Stratified sound', 'Drone', 'Synthesis', 'Noise']",
202
+ "233": "['Noisy sound', 'Composite or Stratified sound', 'Glitch', 'Noise', 'Percussive']",
203
+ "234": "['Noisy sound', 'Composite or Stratified sound', 'Glitch', 'Scratch', 'Crackle', 'Noise', 'Textural']",
204
+ "235": "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Crackle']",
205
+ "236": "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Drone', 'Crackle', 'Noise', 'Textural']",
206
+ "237": "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Glitch', 'Noise', 'Textural']",
207
+ "238": "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Impact', 'Stab (attack resonance)']",
208
+ "239": "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Percussive']",
209
+ "24": "['Cluster sound', 'Sub']",
210
+ "240": "['Noisy sound', 'Composite or Stratified sound', 'Sub', 'Glitch', 'Impact', 'Synthesis']",
211
+ "241": "['Noisy sound', 'Composite or Stratified sound', 'Synthesis', 'Noise']",
212
+ "242": "['Noisy sound', 'Composite or Stratified sound']",
213
+ "243": "['Noisy sound', 'Crackle', 'Instrumental']",
214
+ "244": "['Noisy sound', 'Crackle', 'Noise', 'Textural']",
215
+ "245": "['Noisy sound', 'Crackle']",
216
+ "246": "['Noisy sound', 'Drone', 'Honk']",
217
+ "247": "['Noisy sound', 'Drone']",
218
+ "248": "['Noisy sound', 'Glitch', 'Crackle', 'Noise', 'Textural']",
219
+ "249": "['Noisy sound', 'Glitch', 'Noise', 'Percussive']",
220
+ "25": "['Cluster sound', 'Synthesis']",
221
+ "250": "['Noisy sound', 'Glitch', 'Noise', 'Textural', 'Percussive']",
222
+ "251": "['Noisy sound', 'Glitch', 'Noise', 'Textural']",
223
+ "252": "['Noisy sound', 'Glitch', 'Noise']",
224
+ "253": "['Noisy sound', 'Glitch', 'Synthesis', 'Noise']",
225
+ "254": "['Noisy sound', 'Glitch']",
226
+ "255": "['Noisy sound', 'Impact']",
227
+ "256": "['Noisy sound', 'Noise']",
228
+ "257": "['Noisy sound', 'Percussive']",
229
+ "258": "['Noisy sound', 'Scratch', 'Crackle', 'Noise']",
230
+ "259": "['Noisy sound', 'Soundscape', 'Drone', 'Glitch', 'Synthesis']",
231
+ "26": "['Combination or harmonic sounds', 'Instrumental']",
232
+ "260": "['Noisy sound', 'Soundscape', 'Noise', 'Textural']",
233
+ "261": "['Noisy sound', 'Soundscape', 'Synthesis', 'Noise', 'Textural']",
234
+ "262": "['Noisy sound', 'Soundscape', 'Textural']",
235
+ "263": "['Noisy sound', 'Sub', 'Glitch', 'Impact', 'Percussive']",
236
+ "264": "['Noisy sound', 'Sub', 'Glitch', 'Impact', 'Synthesis', 'Percussive']",
237
+ "265": "['Noisy sound', 'Sub', 'Glitch', 'Synthesis', 'Noise']",
238
+ "266": "['Noisy sound', 'Sub', 'Glitch', 'Synthesis']",
239
+ "267": "['Noisy sound', 'Synthesis', 'Noise', 'Textural']",
240
+ "268": "['Percussive']",
241
+ "269": "['Sinusoidal sound', 'Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape', 'Drone', 'Chop', 'Sub', 'Glitch', 'Impact', 'Stab (attack resonance)', 'Synthesis', 'Vocal', 'Scratch', 'Crackle', 'Noise', 'Textural', 'Instrumental', 'Chirp', 'Percussive', 'Honk', 'Choral']",
242
+ "27": "['Combination or harmonic sounds', 'Soundscape', 'Instrumental']",
243
+ "270": "['Sinusoidal sound', 'Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds']",
244
+ "271": "['Sinusoidal sound', 'Harmonic sound', 'Inharmonic sound', 'Percussive']",
245
+ "272": "['Sinusoidal sound', 'Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Glitch', 'Synthesis', 'Crackle', 'Textural']",
246
+ "273": "['Sinusoidal sound', 'Harmonic sound', 'Noisy sound', 'Synthesis']",
247
+ "274": "['Sinusoidal sound', 'Harmonic sound', 'Sub', 'Stab (attack resonance)']",
248
+ "275": "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound']",
249
+ "276": "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Composite or Stratified sound', 'Chop', 'Synthesis']",
250
+ "277": "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis', 'Choral']",
251
+ "278": "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Synthesis']",
252
+ "279": "['Sinusoidal sound', 'Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape', 'Synthesis']",
253
+ "28": "['Combination or harmonic sounds', 'Synthesis']",
254
+ "280": "['Sinusoidal sound', 'Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Synthesis']",
255
+ "281": "['Sinusoidal sound', 'Inharmonic sound', 'Sub']",
256
+ "282": "['Sinusoidal sound', 'Noisy sound', 'Percussive']",
257
+ "283": "['Sinusoidal sound', 'Synthesis', 'Chirp']",
258
+ "284": "['Sinusoidal sound', 'Synthesis']",
259
+ "285": "['Soundscape', 'Chop', 'Sub', 'Stab (attack resonance)', 'Instrumental']",
260
+ "286": "['Soundscape', 'Drone', 'Chop', 'Sub', 'Glitch', 'Impact', 'Stab (attack resonance)', 'Synthesis', 'Vocal', 'Scratch', 'Crackle', 'Noise', 'Textural', 'Instrumental', 'Chirp', 'Percussive', 'Honk', 'Choral']",
261
+ "287": "['Soundscape', 'Noise', 'Textural']",
262
+ "288": "['Soundscape', 'Synthesis', 'Textural']",
263
+ "289": "['Soundscape']",
264
+ "29": "['Combination or harmonic sounds', 'Textural']",
265
+ "290": "['Stab (attack resonance)', 'Synthesis', 'Honk']",
266
+ "291": "['Sub', 'Glitch', 'Impact', 'Synthesis', 'Noise']",
267
+ "292": "['Sub', 'Textural']",
268
+ "293": "['Synthesis', 'Textural']",
269
+ "294": "['Synthesis']",
270
+ "295": "['Textural']",
271
+ "296": "['Vocal', 'Choral']",
272
+ "297": "['Vocal', 'Instrumental', 'Percussive']",
273
+ "298": "['Vocal', 'Percussive']",
274
+ "299": "['Vocal']",
275
+ "3": "['Breathlike sound', 'Noisy sound', 'Soundscape', 'Drone', 'Sub', 'Scratch', 'Noise']",
276
+ "30": "['Combination or harmonic sounds', 'Vocal', 'Choral']",
277
+ "300": "[]",
278
+ "31": "['Composite or Stratified sound', 'Chop', 'Synthesis', 'Textural']",
279
+ "32": "['Composite or Stratified sound', 'Chop', 'Synthesis']",
280
+ "33": "['Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape']",
281
+ "34": "['Composite or Stratified sound', 'Drone', 'Synthesis', 'Textural']",
282
+ "35": "['Composite or Stratified sound', 'Drone', 'Synthesis']",
283
+ "36": "['Composite or Stratified sound', 'Drone']",
284
+ "37": "['Composite or Stratified sound', 'Glitch', 'Percussive']",
285
+ "38": "['Composite or Stratified sound', 'Glitch', 'Synthesis', 'Chirp']",
286
+ "39": "['Composite or Stratified sound', 'Glitch']",
287
+ "4": "['Breathlike sound', 'Sub', 'Synthesis']",
288
+ "40": "['Composite or Stratified sound', 'Instrumental']",
289
+ "41": "['Composite or Stratified sound', 'Soundscape', 'Crackle', 'Noise']",
290
+ "42": "['Composite or Stratified sound', 'Soundscape', 'Drone', 'Vocal', 'Noise', 'Instrumental', 'Chirp', 'Percussive']",
291
+ "43": "['Composite or Stratified sound', 'Soundscape', 'Glitch']",
292
+ "44": "['Composite or Stratified sound', 'Soundscape', 'Synthesis', 'Textural']",
293
+ "45": "['Composite or Stratified sound', 'Sub', 'Impact', 'Synthesis', 'Percussive']",
294
+ "46": "['Composite or Stratified sound', 'Sub', 'Noise']",
295
+ "47": "['Composite or Stratified sound', 'Synthesis', 'Textural']",
296
+ "48": "['Composite or Stratified sound', 'Synthesis']",
297
+ "49": "['Composite or Stratified sound', 'Vocal', 'Percussive']",
298
+ "5": "['Breathlike sound', 'Vocal', 'Crackle', 'Noise']",
299
+ "50": "['Composite or Stratified sound']",
300
+ "51": "['Crackle', 'Textural']",
301
+ "52": "['Crackle']",
302
+ "53": "['Drone', 'Crackle', 'Noise']",
303
+ "54": "['Drone', 'Instrumental']",
304
+ "55": "['Drone', 'Synthesis', 'Textural']",
305
+ "56": "['Drone', 'Synthesis']",
306
+ "57": "['Drone']",
307
+ "58": "['Glitch', 'Noise']",
308
+ "59": "['Glitch', 'Synthesis', 'Textural']",
309
+ "6": "['Chop', 'Glitch', 'Noise']",
310
+ "60": "['Glitch', 'Synthesis']",
311
+ "61": "['Harmonic sound', 'Breathlike sound', 'Noisy sound']",
312
+ "62": "['Harmonic sound', 'Breathlike sound', 'Soundscape']",
313
+ "63": "['Harmonic sound', 'Breathlike sound', 'Synthesis']",
314
+ "64": "['Harmonic sound', 'Breathlike sound', 'Textural', 'Instrumental']",
315
+ "65": "['Harmonic sound', 'Chirp']",
316
+ "66": "['Harmonic sound', 'Cluster sound', 'Glitch', 'Impact']",
317
+ "67": "['Harmonic sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis', 'Choral']",
318
+ "68": "['Harmonic sound', 'Combination or harmonic sounds', 'Drone', 'Textural']",
319
+ "69": "['Harmonic sound', 'Combination or harmonic sounds', 'Glitch', 'Synthesis', 'Textural']",
320
+ "7": "['Chop', 'Impact', 'Noise']",
321
+ "70": "['Harmonic sound', 'Combination or harmonic sounds', 'Instrumental', 'Percussive']",
322
+ "71": "['Harmonic sound', 'Combination or harmonic sounds', 'Instrumental']",
323
+ "72": "['Harmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Textural', 'Choral']",
324
+ "73": "['Harmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Textural', 'Instrumental']",
325
+ "74": "['Harmonic sound', 'Combination or harmonic sounds', 'Synthesis']",
326
+ "75": "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis', 'Textural']",
327
+ "76": "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Textural']",
328
+ "77": "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Glitch', 'Stab (attack resonance)', 'Noise', 'Textural']",
329
+ "78": "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Textural', 'Instrumental']",
330
+ "79": "['Harmonic sound', 'Composite or Stratified sound', 'Drone', 'Textural', 'Choral']",
331
+ "8": "['Choral']",
332
+ "80": "['Harmonic sound', 'Composite or Stratified sound', 'Impact', 'Synthesis', 'Textural']",
333
+ "81": "['Harmonic sound', 'Composite or Stratified sound', 'Instrumental']",
334
+ "82": "['Harmonic sound', 'Composite or Stratified sound', 'Sub', 'Synthesis', 'Noise']",
335
+ "83": "['Harmonic sound', 'Composite or Stratified sound', 'Synthesis', 'Textural']",
336
+ "84": "['Harmonic sound', 'Composite or Stratified sound', 'Synthesis']",
337
+ "85": "['Harmonic sound', 'Composite or Stratified sound', 'Textural', 'Instrumental']",
338
+ "86": "['Harmonic sound', 'Composite or Stratified sound', 'Vocal', 'Instrumental']",
339
+ "87": "['Harmonic sound', 'Drone', 'Noise', 'Instrumental']",
340
+ "88": "['Harmonic sound', 'Drone', 'Textural', 'Instrumental']",
341
+ "89": "['Harmonic sound', 'Drone', 'Textural']",
342
+ "9": "['Cluster sound', 'Breathlike sound', 'Composite or Stratified sound', 'Impact']",
343
+ "90": "['Harmonic sound', 'Drone', 'Vocal', 'Choral']",
344
+ "91": "['Harmonic sound', 'Glitch', 'Impact', 'Synthesis']",
345
+ "92": "['Harmonic sound', 'Glitch', 'Noise', 'Percussive']",
346
+ "93": "['Harmonic sound', 'Glitch', 'Noise']",
347
+ "94": "['Harmonic sound', 'Glitch', 'Synthesis', 'Percussive']",
348
+ "95": "['Harmonic sound', 'Glitch', 'Synthesis']",
349
+ "96": "['Harmonic sound', 'Glitch']",
350
+ "97": "['Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Combination or harmonic sounds', 'Synthesis']",
351
+ "98": "['Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Drone', 'Glitch', 'Noise']",
352
+ "99": "['Harmonic sound', 'Inharmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Textural', 'Percussive']"
353
+ },
354
+ "initializer_range": 0.02,
355
+ "intermediate_size": 3072,
356
+ "label2id": {
357
+ "['Breathlike sound', 'Drone']": "0",
358
+ "['Breathlike sound', 'Noisy sound', 'Noise', 'Textural', 'Instrumental']": "1",
359
+ "['Breathlike sound', 'Noisy sound', 'Noise', 'Textural']": "2",
360
+ "['Breathlike sound', 'Noisy sound', 'Soundscape', 'Drone', 'Sub', 'Scratch', 'Noise']": "3",
361
+ "['Breathlike sound', 'Sub', 'Synthesis']": "4",
362
+ "['Breathlike sound', 'Vocal', 'Crackle', 'Noise']": "5",
363
+ "['Chop', 'Glitch', 'Noise']": "6",
364
+ "['Chop', 'Impact', 'Noise']": "7",
365
+ "['Choral']": "8",
366
+ "['Cluster sound', 'Breathlike sound', 'Composite or Stratified sound', 'Impact']": "9",
367
+ "['Cluster sound', 'Chop', 'Glitch']": "10",
368
+ "['Cluster sound', 'Chop', 'Impact', 'Synthesis']": "11",
369
+ "['Cluster sound', 'Composite or Stratified sound', 'Percussive']": "12",
370
+ "['Cluster sound', 'Drone']": "13",
371
+ "['Cluster sound', 'Glitch', 'Crackle']": "14",
372
+ "['Cluster sound', 'Glitch', 'Synthesis', 'Textural']": "15",
373
+ "['Cluster sound', 'Glitch']": "16",
374
+ "['Cluster sound', 'Impact', 'Stab (attack resonance)']": "17",
375
+ "['Cluster sound', 'Impact', 'Synthesis', 'Percussive']": "18",
376
+ "['Cluster sound', 'Impact', 'Synthesis', 'Vocal']": "19",
377
+ "['Cluster sound', 'Impact']": "20",
378
+ "['Cluster sound', 'Noisy sound', 'Drone', 'Noise']": "21",
379
+ "['Cluster sound', 'Noisy sound', 'Glitch', 'Textural']": "22",
380
+ "['Cluster sound', 'Noisy sound', 'Synthesis']": "23",
381
+ "['Cluster sound', 'Sub']": "24",
382
+ "['Cluster sound', 'Synthesis']": "25",
383
+ "['Combination or harmonic sounds', 'Instrumental']": "26",
384
+ "['Combination or harmonic sounds', 'Soundscape', 'Instrumental']": "27",
385
+ "['Combination or harmonic sounds', 'Synthesis']": "28",
386
+ "['Combination or harmonic sounds', 'Textural']": "29",
387
+ "['Combination or harmonic sounds', 'Vocal', 'Choral']": "30",
388
+ "['Composite or Stratified sound', 'Chop', 'Synthesis', 'Textural']": "31",
389
+ "['Composite or Stratified sound', 'Chop', 'Synthesis']": "32",
390
+ "['Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape']": "33",
391
+ "['Composite or Stratified sound', 'Drone', 'Synthesis', 'Textural']": "34",
392
+ "['Composite or Stratified sound', 'Drone', 'Synthesis']": "35",
393
+ "['Composite or Stratified sound', 'Drone']": "36",
394
+ "['Composite or Stratified sound', 'Glitch', 'Percussive']": "37",
395
+ "['Composite or Stratified sound', 'Glitch', 'Synthesis', 'Chirp']": "38",
396
+ "['Composite or Stratified sound', 'Glitch']": "39",
397
+ "['Composite or Stratified sound', 'Instrumental']": "40",
398
+ "['Composite or Stratified sound', 'Soundscape', 'Crackle', 'Noise']": "41",
399
+ "['Composite or Stratified sound', 'Soundscape', 'Drone', 'Vocal', 'Noise', 'Instrumental', 'Chirp', 'Percussive']": "42",
400
+ "['Composite or Stratified sound', 'Soundscape', 'Glitch']": "43",
401
+ "['Composite or Stratified sound', 'Soundscape', 'Synthesis', 'Textural']": "44",
402
+ "['Composite or Stratified sound', 'Sub', 'Impact', 'Synthesis', 'Percussive']": "45",
403
+ "['Composite or Stratified sound', 'Sub', 'Noise']": "46",
404
+ "['Composite or Stratified sound', 'Synthesis', 'Textural']": "47",
405
+ "['Composite or Stratified sound', 'Synthesis']": "48",
406
+ "['Composite or Stratified sound', 'Vocal', 'Percussive']": "49",
407
+ "['Composite or Stratified sound']": "50",
408
+ "['Crackle', 'Textural']": "51",
409
+ "['Crackle']": "52",
410
+ "['Drone', 'Crackle', 'Noise']": "53",
411
+ "['Drone', 'Instrumental']": "54",
412
+ "['Drone', 'Synthesis', 'Textural']": "55",
413
+ "['Drone', 'Synthesis']": "56",
414
+ "['Drone']": "57",
415
+ "['Glitch', 'Noise']": "58",
416
+ "['Glitch', 'Synthesis', 'Textural']": "59",
417
+ "['Glitch', 'Synthesis']": "60",
418
+ "['Harmonic sound', 'Breathlike sound', 'Noisy sound']": "61",
419
+ "['Harmonic sound', 'Breathlike sound', 'Soundscape']": "62",
420
+ "['Harmonic sound', 'Breathlike sound', 'Synthesis']": "63",
421
+ "['Harmonic sound', 'Breathlike sound', 'Textural', 'Instrumental']": "64",
422
+ "['Harmonic sound', 'Chirp']": "65",
423
+ "['Harmonic sound', 'Cluster sound', 'Glitch', 'Impact']": "66",
424
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis', 'Choral']": "67",
425
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Drone', 'Textural']": "68",
426
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Glitch', 'Synthesis', 'Textural']": "69",
427
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Instrumental', 'Percussive']": "70",
428
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Instrumental']": "71",
429
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Textural', 'Choral']": "72",
430
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Textural', 'Instrumental']": "73",
431
+ "['Harmonic sound', 'Combination or harmonic sounds', 'Synthesis']": "74",
432
+ "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis', 'Textural']": "75",
433
+ "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Textural']": "76",
434
+ "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Glitch', 'Stab (attack resonance)', 'Noise', 'Textural']": "77",
435
+ "['Harmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Textural', 'Instrumental']": "78",
436
+ "['Harmonic sound', 'Composite or Stratified sound', 'Drone', 'Textural', 'Choral']": "79",
437
+ "['Harmonic sound', 'Composite or Stratified sound', 'Impact', 'Synthesis', 'Textural']": "80",
438
+ "['Harmonic sound', 'Composite or Stratified sound', 'Instrumental']": "81",
439
+ "['Harmonic sound', 'Composite or Stratified sound', 'Sub', 'Synthesis', 'Noise']": "82",
440
+ "['Harmonic sound', 'Composite or Stratified sound', 'Synthesis', 'Textural']": "83",
441
+ "['Harmonic sound', 'Composite or Stratified sound', 'Synthesis']": "84",
442
+ "['Harmonic sound', 'Composite or Stratified sound', 'Textural', 'Instrumental']": "85",
443
+ "['Harmonic sound', 'Composite or Stratified sound', 'Vocal', 'Instrumental']": "86",
444
+ "['Harmonic sound', 'Drone', 'Noise', 'Instrumental']": "87",
445
+ "['Harmonic sound', 'Drone', 'Textural', 'Instrumental']": "88",
446
+ "['Harmonic sound', 'Drone', 'Textural']": "89",
447
+ "['Harmonic sound', 'Drone', 'Vocal', 'Choral']": "90",
448
+ "['Harmonic sound', 'Glitch', 'Impact', 'Synthesis']": "91",
449
+ "['Harmonic sound', 'Glitch', 'Noise', 'Percussive']": "92",
450
+ "['Harmonic sound', 'Glitch', 'Noise']": "93",
451
+ "['Harmonic sound', 'Glitch', 'Synthesis', 'Percussive']": "94",
452
+ "['Harmonic sound', 'Glitch', 'Synthesis']": "95",
453
+ "['Harmonic sound', 'Glitch']": "96",
454
+ "['Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Combination or harmonic sounds', 'Synthesis']": "97",
455
+ "['Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Drone', 'Glitch', 'Noise']": "98",
456
+ "['Harmonic sound', 'Inharmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Textural', 'Percussive']": "99",
457
+ "['Harmonic sound', 'Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Textural']": "100",
458
+ "['Harmonic sound', 'Inharmonic sound', 'Instrumental']": "101",
459
+ "['Harmonic sound', 'Inharmonic sound', 'Noise', 'Textural', 'Instrumental']": "102",
460
+ "['Harmonic sound', 'Inharmonic sound', 'Noisy sound', 'Drone', 'Stab (attack resonance)', 'Noise', 'Instrumental']": "103",
461
+ "['Harmonic sound', 'Inharmonic sound', 'Noisy sound', 'Drone']": "104",
462
+ "['Harmonic sound', 'Inharmonic sound', 'Soundscape', 'Glitch', 'Instrumental']": "105",
463
+ "['Harmonic sound', 'Inharmonic sound', 'Soundscape']": "106",
464
+ "['Harmonic sound', 'Instrumental', 'Chirp']": "107",
465
+ "['Harmonic sound', 'Instrumental']": "108",
466
+ "['Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Synthesis', 'Honk']": "109",
467
+ "['Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Drone', 'Glitch', 'Synthesis', 'Noise', 'Percussive']": "110",
468
+ "['Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Noise', 'Textural', 'Instrumental']": "111",
469
+ "['Harmonic sound', 'Noisy sound', 'Drone', 'Synthesis', 'Noise']": "112",
470
+ "['Harmonic sound', 'Noisy sound', 'Drone', 'Synthesis']": "113",
471
+ "['Harmonic sound', 'Noisy sound', 'Glitch']": "114",
472
+ "['Harmonic sound', 'Noisy sound', 'Instrumental']": "115",
473
+ "['Harmonic sound', 'Noisy sound', 'Soundscape', 'Synthesis', 'Noise']": "116",
474
+ "['Harmonic sound', 'Noisy sound', 'Synthesis', 'Noise']": "117",
475
+ "['Harmonic sound', 'Soundscape', 'Vocal', 'Choral']": "118",
476
+ "['Harmonic sound', 'Stab (attack resonance)', 'Instrumental', 'Percussive']": "119",
477
+ "['Harmonic sound', 'Stab (attack resonance)', 'Instrumental']": "120",
478
+ "['Harmonic sound', 'Stab (attack resonance)', 'Noise']": "121",
479
+ "['Harmonic sound', 'Stab (attack resonance)', 'Percussive']": "122",
480
+ "['Harmonic sound', 'Sub', 'Synthesis']": "123",
481
+ "['Harmonic sound', 'Synthesis', 'Instrumental', 'Percussive']": "124",
482
+ "['Harmonic sound', 'Synthesis', 'Instrumental']": "125",
483
+ "['Harmonic sound', 'Synthesis', 'Textural']": "126",
484
+ "['Harmonic sound', 'Synthesis']": "127",
485
+ "['Harmonic sound', 'Textural', 'Instrumental']": "128",
486
+ "['Harmonic sound', 'Textural']": "129",
487
+ "['Harmonic sound', 'Vocal', 'Textural', 'Choral']": "130",
488
+ "['Harmonic sound']": "131",
489
+ "['Impact', 'Stab (attack resonance)', 'Synthesis', 'Percussive']": "132",
490
+ "['Impact', 'Synthesis']": "133",
491
+ "['Inharmonic sound', 'Breathlike sound', 'Composite or Stratified sound', 'Vocal', 'Textural', 'Percussive']": "134",
492
+ "['Inharmonic sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Impact', 'Stab (attack resonance)', 'Synthesis', 'Noise']": "135",
493
+ "['Inharmonic sound', 'Breathlike sound', 'Noisy sound', 'Noise', 'Percussive']": "136",
494
+ "['Inharmonic sound', 'Breathlike sound', 'Noisy sound', 'Vocal', 'Textural', 'Percussive']": "137",
495
+ "['Inharmonic sound', 'Breathlike sound', 'Soundscape', 'Crackle']": "138",
496
+ "['Inharmonic sound', 'Breathlike sound', 'Soundscape', 'Glitch', 'Crackle', 'Noise', 'Textural']": "139",
497
+ "['Inharmonic sound', 'Chirp']": "140",
498
+ "['Inharmonic sound', 'Chop', 'Glitch', 'Scratch', 'Crackle']": "141",
499
+ "['Inharmonic sound', 'Cluster sound', 'Crackle', 'Percussive']": "142",
500
+ "['Inharmonic sound', 'Cluster sound', 'Glitch', 'Noise']": "143",
501
+ "['Inharmonic sound', 'Cluster sound', 'Glitch', 'Stab (attack resonance)']": "144",
502
+ "['Inharmonic sound', 'Cluster sound', 'Glitch', 'Synthesis', 'Percussive']": "145",
503
+ "['Inharmonic sound', 'Cluster sound', 'Impact']": "146",
504
+ "['Inharmonic sound', 'Cluster sound', 'Noisy sound', 'Chop', 'Glitch', 'Percussive']": "147",
505
+ "['Inharmonic sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis']": "148",
506
+ "['Inharmonic sound', 'Combination or harmonic sounds', 'Synthesis', 'Instrumental']": "149",
507
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Textural']": "150",
508
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Impact', 'Synthesis', 'Percussive']": "151",
509
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape', 'Synthesis']": "152",
510
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Synthesis', 'Scratch', 'Textural']": "153",
511
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Drone', 'Synthesis', 'Crackle', 'Noise']": "154",
512
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Drone', 'Synthesis', 'Crackle']": "155",
513
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Drone', 'Synthesis']": "156",
514
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Glitch', 'Impact', 'Synthesis', 'Percussive']": "157",
515
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Glitch', 'Synthesis']": "158",
516
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Scratch', 'Instrumental']": "159",
517
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Soundscape', 'Synthesis', 'Crackle', 'Textural']": "160",
518
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Soundscape']": "161",
519
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Synthesis', 'Crackle']": "162",
520
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Synthesis', 'Textural']": "163",
521
+ "['Inharmonic sound', 'Composite or Stratified sound', 'Vocal', 'Noise', 'Textural']": "164",
522
+ "['Inharmonic sound', 'Crackle', 'Noise']": "165",
523
+ "['Inharmonic sound', 'Crackle', 'Percussive']": "166",
524
+ "['Inharmonic sound', 'Crackle']": "167",
525
+ "['Inharmonic sound', 'Drone', 'Noise']": "168",
526
+ "['Inharmonic sound', 'Drone', 'Sub']": "169",
527
+ "['Inharmonic sound', 'Drone', 'Synthesis', 'Noise']": "170",
528
+ "['Inharmonic sound', 'Drone', 'Synthesis', 'Textural']": "171",
529
+ "['Inharmonic sound', 'Drone', 'Synthesis']": "172",
530
+ "['Inharmonic sound', 'Drone']": "173",
531
+ "['Inharmonic sound', 'Glitch', 'Noise', 'Percussive']": "174",
532
+ "['Inharmonic sound', 'Glitch', 'Percussive']": "175",
533
+ "['Inharmonic sound', 'Glitch', 'Synthesis', 'Chirp', 'Percussive']": "176",
534
+ "['Inharmonic sound', 'Glitch', 'Synthesis', 'Noise', 'Percussive']": "177",
535
+ "['Inharmonic sound', 'Glitch', 'Synthesis', 'Textural']": "178",
536
+ "['Inharmonic sound', 'Glitch', 'Synthesis']": "179",
537
+ "['Inharmonic sound', 'Glitch', 'Vocal']": "180",
538
+ "['Inharmonic sound', 'Glitch']": "181",
539
+ "['Inharmonic sound', 'Impact']": "182",
540
+ "['Inharmonic sound', 'Instrumental', 'Percussive']": "183",
541
+ "['Inharmonic sound', 'Instrumental']": "184",
542
+ "['Inharmonic sound', 'Noise']": "185",
543
+ "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Chop', 'Glitch', 'Vocal', 'Crackle', 'Noise', 'Textural']": "186",
544
+ "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Drone', 'Glitch', 'Synthesis', 'Noise']": "187",
545
+ "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Glitch', 'Synthesis', 'Crackle', 'Noise']": "188",
546
+ "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Percussive']": "189",
547
+ "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Crackle', 'Noise', 'Textural']": "190",
548
+ "['Inharmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Synthesis']": "191",
549
+ "['Inharmonic sound', 'Noisy sound', 'Crackle', 'Noise']": "192",
550
+ "['Inharmonic sound', 'Noisy sound', 'Crackle', 'Textural']": "193",
551
+ "['Inharmonic sound', 'Noisy sound', 'Crackle']": "194",
552
+ "['Inharmonic sound', 'Noisy sound', 'Drone', 'Sub', 'Glitch', 'Noise', 'Textural', 'Instrumental']": "195",
553
+ "['Inharmonic sound', 'Noisy sound', 'Glitch', 'Noise']": "196",
554
+ "['Inharmonic sound', 'Noisy sound', 'Glitch', 'Synthesis', 'Noise', 'Percussive']": "197",
555
+ "['Inharmonic sound', 'Noisy sound', 'Glitch', 'Synthesis', 'Noise']": "198",
556
+ "['Inharmonic sound', 'Noisy sound', 'Glitch']": "199",
557
+ "['Inharmonic sound', 'Noisy sound', 'Impact', 'Synthesis', 'Noise', 'Percussive']": "200",
558
+ "['Inharmonic sound', 'Noisy sound', 'Impact', 'Textural']": "201",
559
+ "['Inharmonic sound', 'Noisy sound', 'Noise', 'Textural']": "202",
560
+ "['Inharmonic sound', 'Noisy sound', 'Noise']": "203",
561
+ "['Inharmonic sound', 'Noisy sound', 'Percussive']": "204",
562
+ "['Inharmonic sound', 'Noisy sound', 'Scratch', 'Crackle', 'Noise', 'Chirp']": "205",
563
+ "['Inharmonic sound', 'Noisy sound', 'Scratch', 'Crackle', 'Noise', 'Instrumental']": "206",
564
+ "['Inharmonic sound', 'Noisy sound', 'Sub', 'Glitch', 'Impact', 'Noise']": "207",
565
+ "['Inharmonic sound', 'Noisy sound', 'Sub', 'Synthesis', 'Noise']": "208",
566
+ "['Inharmonic sound', 'Noisy sound', 'Synthesis', 'Noise']": "209",
567
+ "['Inharmonic sound', 'Noisy sound', 'Vocal']": "210",
568
+ "['Inharmonic sound', 'Noisy sound']": "211",
569
+ "['Inharmonic sound', 'Percussive']": "212",
570
+ "['Inharmonic sound', 'Soundscape', 'Textural']": "213",
571
+ "['Inharmonic sound', 'Soundscape']": "214",
572
+ "['Inharmonic sound', 'Stab (attack resonance)', 'Instrumental']": "215",
573
+ "['Inharmonic sound', 'Sub', 'Textural']": "216",
574
+ "['Inharmonic sound', 'Synthesis', 'Chirp']": "217",
575
+ "['Inharmonic sound', 'Synthesis', 'Noise', 'Percussive']": "218",
576
+ "['Inharmonic sound', 'Synthesis', 'Noise']": "219",
577
+ "['Inharmonic sound', 'Synthesis', 'Percussive']": "220",
578
+ "['Inharmonic sound', 'Synthesis', 'Textural']": "221",
579
+ "['Inharmonic sound', 'Synthesis']": "222",
580
+ "['Inharmonic sound', 'Textural', 'Instrumental']": "223",
581
+ "['Inharmonic sound', 'Textural']": "224",
582
+ "['Inharmonic sound', 'Vocal']": "225",
583
+ "['Inharmonic sound']": "226",
584
+ "['Instrumental', 'Percussive']": "227",
585
+ "['Instrumental']": "228",
586
+ "['Noise']": "229",
587
+ "['Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Instrumental']": "230",
588
+ "['Noisy sound', 'Composite or Stratified sound', 'Drone', 'Glitch', 'Noise', 'Textural']": "231",
589
+ "['Noisy sound', 'Composite or Stratified sound', 'Drone', 'Synthesis', 'Noise']": "232",
590
+ "['Noisy sound', 'Composite or Stratified sound', 'Glitch', 'Noise', 'Percussive']": "233",
591
+ "['Noisy sound', 'Composite or Stratified sound', 'Glitch', 'Scratch', 'Crackle', 'Noise', 'Textural']": "234",
592
+ "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Crackle']": "235",
593
+ "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Drone', 'Crackle', 'Noise', 'Textural']": "236",
594
+ "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Glitch', 'Noise', 'Textural']": "237",
595
+ "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Impact', 'Stab (attack resonance)']": "238",
596
+ "['Noisy sound', 'Composite or Stratified sound', 'Soundscape', 'Percussive']": "239",
597
+ "['Noisy sound', 'Composite or Stratified sound', 'Sub', 'Glitch', 'Impact', 'Synthesis']": "240",
598
+ "['Noisy sound', 'Composite or Stratified sound', 'Synthesis', 'Noise']": "241",
599
+ "['Noisy sound', 'Composite or Stratified sound']": "242",
600
+ "['Noisy sound', 'Crackle', 'Instrumental']": "243",
601
+ "['Noisy sound', 'Crackle', 'Noise', 'Textural']": "244",
602
+ "['Noisy sound', 'Crackle']": "245",
603
+ "['Noisy sound', 'Drone', 'Honk']": "246",
604
+ "['Noisy sound', 'Drone']": "247",
605
+ "['Noisy sound', 'Glitch', 'Crackle', 'Noise', 'Textural']": "248",
606
+ "['Noisy sound', 'Glitch', 'Noise', 'Percussive']": "249",
607
+ "['Noisy sound', 'Glitch', 'Noise', 'Textural', 'Percussive']": "250",
608
+ "['Noisy sound', 'Glitch', 'Noise', 'Textural']": "251",
609
+ "['Noisy sound', 'Glitch', 'Noise']": "252",
610
+ "['Noisy sound', 'Glitch', 'Synthesis', 'Noise']": "253",
611
+ "['Noisy sound', 'Glitch']": "254",
612
+ "['Noisy sound', 'Impact']": "255",
613
+ "['Noisy sound', 'Noise']": "256",
614
+ "['Noisy sound', 'Percussive']": "257",
615
+ "['Noisy sound', 'Scratch', 'Crackle', 'Noise']": "258",
616
+ "['Noisy sound', 'Soundscape', 'Drone', 'Glitch', 'Synthesis']": "259",
617
+ "['Noisy sound', 'Soundscape', 'Noise', 'Textural']": "260",
618
+ "['Noisy sound', 'Soundscape', 'Synthesis', 'Noise', 'Textural']": "261",
619
+ "['Noisy sound', 'Soundscape', 'Textural']": "262",
620
+ "['Noisy sound', 'Sub', 'Glitch', 'Impact', 'Percussive']": "263",
621
+ "['Noisy sound', 'Sub', 'Glitch', 'Impact', 'Synthesis', 'Percussive']": "264",
622
+ "['Noisy sound', 'Sub', 'Glitch', 'Synthesis', 'Noise']": "265",
623
+ "['Noisy sound', 'Sub', 'Glitch', 'Synthesis']": "266",
624
+ "['Noisy sound', 'Synthesis', 'Noise', 'Textural']": "267",
625
+ "['Percussive']": "268",
626
+ "['Sinusoidal sound', 'Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape', 'Drone', 'Chop', 'Sub', 'Glitch', 'Impact', 'Stab (attack resonance)', 'Synthesis', 'Vocal', 'Scratch', 'Crackle', 'Noise', 'Textural', 'Instrumental', 'Chirp', 'Percussive', 'Honk', 'Choral']": "269",
627
+ "['Sinusoidal sound', 'Harmonic sound', 'Inharmonic sound', 'Cluster sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds']": "270",
628
+ "['Sinusoidal sound', 'Harmonic sound', 'Inharmonic sound', 'Percussive']": "271",
629
+ "['Sinusoidal sound', 'Harmonic sound', 'Noisy sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Glitch', 'Synthesis', 'Crackle', 'Textural']": "272",
630
+ "['Sinusoidal sound', 'Harmonic sound', 'Noisy sound', 'Synthesis']": "273",
631
+ "['Sinusoidal sound', 'Harmonic sound', 'Sub', 'Stab (attack resonance)']": "274",
632
+ "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Breathlike sound', 'Noisy sound', 'Composite or Stratified sound']": "275",
633
+ "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Composite or Stratified sound', 'Chop', 'Synthesis']": "276",
634
+ "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Drone', 'Synthesis', 'Choral']": "277",
635
+ "['Sinusoidal sound', 'Inharmonic sound', 'Cluster sound', 'Synthesis']": "278",
636
+ "['Sinusoidal sound', 'Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Soundscape', 'Synthesis']": "279",
637
+ "['Sinusoidal sound', 'Inharmonic sound', 'Composite or Stratified sound', 'Combination or harmonic sounds', 'Synthesis']": "280",
638
+ "['Sinusoidal sound', 'Inharmonic sound', 'Sub']": "281",
639
+ "['Sinusoidal sound', 'Noisy sound', 'Percussive']": "282",
640
+ "['Sinusoidal sound', 'Synthesis', 'Chirp']": "283",
641
+ "['Sinusoidal sound', 'Synthesis']": "284",
642
+ "['Soundscape', 'Chop', 'Sub', 'Stab (attack resonance)', 'Instrumental']": "285",
643
+ "['Soundscape', 'Drone', 'Chop', 'Sub', 'Glitch', 'Impact', 'Stab (attack resonance)', 'Synthesis', 'Vocal', 'Scratch', 'Crackle', 'Noise', 'Textural', 'Instrumental', 'Chirp', 'Percussive', 'Honk', 'Choral']": "286",
644
+ "['Soundscape', 'Noise', 'Textural']": "287",
645
+ "['Soundscape', 'Synthesis', 'Textural']": "288",
646
+ "['Soundscape']": "289",
647
+ "['Stab (attack resonance)', 'Synthesis', 'Honk']": "290",
648
+ "['Sub', 'Glitch', 'Impact', 'Synthesis', 'Noise']": "291",
649
+ "['Sub', 'Textural']": "292",
650
+ "['Synthesis', 'Textural']": "293",
651
+ "['Synthesis']": "294",
652
+ "['Textural']": "295",
653
+ "['Vocal', 'Choral']": "296",
654
+ "['Vocal', 'Instrumental', 'Percussive']": "297",
655
+ "['Vocal', 'Percussive']": "298",
656
+ "['Vocal']": "299",
657
+ "[]": "300"
658
+ },
659
+ "layer_norm_eps": 1e-05,
660
+ "layerdrop": 0.0,
661
+ "mask_feature_length": 10,
662
+ "mask_feature_min_masks": 0,
663
+ "mask_feature_prob": 0.0,
664
+ "mask_time_length": 10,
665
+ "mask_time_min_masks": 2,
666
+ "mask_time_prob": 0.05,
667
+ "model_type": "hubert",
668
+ "num_attention_heads": 12,
669
+ "num_conv_pos_embedding_groups": 16,
670
+ "num_conv_pos_embeddings": 128,
671
+ "num_feat_extract_layers": 7,
672
+ "num_hidden_layers": 2,
673
+ "pad_token_id": 0,
674
+ "torch_dtype": "float32",
675
+ "transformers_version": "4.36.2",
676
+ "use_weighted_layer_sum": false,
677
+ "vocab_size": 32
678
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e609f6bf3e71b16f96821e00e016dc17eb9938b4d3c6ff294dff69a91ab3b721
3
+ size 95070892
preprocessor_config.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "do_normalize": true,
3
+ "feature_extractor_type": "Wav2Vec2FeatureExtractor",
4
+ "feature_size": 1,
5
+ "padding_side": "right",
6
+ "padding_value": 0,
7
+ "return_attention_mask": true,
8
+ "sampling_rate": 16000
9
+ }
runs/Dec24_11-15-51_52d39fffe3b5/events.out.tfevents.1703416555.52d39fffe3b5.1467.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c41faddfd397c696ec62b7b93ee36e2e9aa01c3aac65d2724b0e2ea13fcdf0f1
3
+ size 73488
runs/Dec24_11-15-51_52d39fffe3b5/events.out.tfevents.1703417959.52d39fffe3b5.1467.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f1d154ce4d1b20ed85a8b618fe3d43a10cc5118cef6d3f96a2f88adc5253601
3
+ size 411
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:061eaa6d6db4cd6a466c97cfe2882c5202a9843fec3342ea9b5afd135a70a924
3
+ size 4728