Jenthe commited on
Commit
3d8401e
1 Parent(s): 807160f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -85,11 +85,12 @@ feature = ecapa2_model(audio, label='gfe_1|pool|embedding')
85
  feature = ecapa2_model(audio, label='embedding|gfe_1|pool')
86
  ```
87
 
88
- The following table describes the available features:
89
 
90
  | Feature ID| Dimension | Description |
91
  | ----------- | ----------- | ----------- |
92
- | gfe_1, gfe_2 | 2048 | Mean and variance of frame-level features as indicated in Figure 1, extracted before ReLU and BatchNorm layer.
 
93
  | pool | 3072 | Pooled statistics (mean and variance) before the bottleneck speaker embedding layer, extracted before ReLU layer.
94
  | attention | 3072 | Same as the pooled statistics but with the attention weights applied.
95
  | embedding | 192 | The standard ECAPA2 speaker embedding.
 
85
  feature = ecapa2_model(audio, label='embedding|gfe_1|pool')
86
  ```
87
 
88
+ The following table describes the available features. All features consists of the mean and variance of the frame-level encodings at the indicated layer, expect for the speaker embedding.
89
 
90
  | Feature ID| Dimension | Description |
91
  | ----------- | ----------- | ----------- |
92
+ | gfe_1 | 2048 | Mean and variance of frame-level features as indicated in Figure 1, extracted before ReLU and BatchNorm layer.
93
+ | gfe_2 | 2048 | Mean and variance of frame-level features as indicated in Figure 1, extracted before ReLU and BatchNorm layer.
94
  | pool | 3072 | Pooled statistics (mean and variance) before the bottleneck speaker embedding layer, extracted before ReLU layer.
95
  | attention | 3072 | Same as the pooled statistics but with the attention weights applied.
96
  | embedding | 192 | The standard ECAPA2 speaker embedding.