Jenthe
/

ECAPA2

Jenthe commited on Oct 16, 2023

Commit

3d8401e

•

1 Parent(s): 807160f

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -85,11 +85,12 @@ feature = ecapa2_model(audio, label='gfe_1|pool|embedding')
 feature = ecapa2_model(audio, label='embedding|gfe_1|pool')
 ```
-The following table describes the available features:
 | Feature ID| Dimension | Description |
 | ----------- | ----------- | ----------- |
-| gfe_1, gfe_2 | 2048 | Mean and variance of frame-level features as indicated in Figure 1, extracted before ReLU and BatchNorm layer.
 | pool | 3072 | Pooled statistics (mean and variance) before the bottleneck speaker embedding layer, extracted before ReLU layer.
 | attention | 3072 | Same as the pooled statistics but with the attention weights applied.
 | embedding | 192 | The standard ECAPA2 speaker embedding.

 feature = ecapa2_model(audio, label='embedding|gfe_1|pool')
 ```
+The following table describes the available features. All features consists of the mean and variance of the frame-level encodings at the indicated layer, expect for the speaker embedding.
 | Feature ID| Dimension | Description |
 | ----------- | ----------- | ----------- |
+| gfe_1 | 2048 | Mean and variance of frame-level features as indicated in Figure 1, extracted before ReLU and BatchNorm layer.
+| gfe_2 | 2048 | Mean and variance of frame-level features as indicated in Figure 1, extracted before ReLU and BatchNorm layer.
 | pool | 3072 | Pooled statistics (mean and variance) before the bottleneck speaker embedding layer, extracted before ReLU layer.
 | attention | 3072 | Same as the pooled statistics but with the attention weights applied.
 | embedding | 192 | The standard ECAPA2 speaker embedding.