mechanicalsea commited on
Commit
bf864fd
1 Parent(s): 10915d6

update readme and git attr

Browse files
Files changed (2) hide show
  1. .gitattributes +2 -0
  2. README.md +64 -8
.gitattributes CHANGED
@@ -25,3 +25,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
25
  *.zip filter=lfs diff=lfs merge=lfs -text
26
  *.zstandard filter=lfs diff=lfs merge=lfs -text
27
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
25
  *.zip filter=lfs diff=lfs merge=lfs -text
26
  *.zstandard filter=lfs diff=lfs merge=lfs -text
27
  *tfevents* filter=lfs diff=lfs merge=lfs -text
28
+ *.torchparams filter=lfs diff=lfs merge=lfs -text
29
+ *.tar filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,12 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # EfficientTDNN
2
 
3
- Model Version
4
- - Maximum Size: One single model with the maximum size.
5
- - Dynamic Kernel: The model enables various kernel sizes in {1,3,5}.
6
- - Dynamic Depth: The model enables additional various depth in {2,3,4} based on **Dynamic Kernel** version.
7
- - Dynamic Width 1: The model enable additional various width in [0.5, 1.0] based on **Dynamic Depth** version.
8
- - Dynamic Width 2: The model enable additional various width in [0.25, 0.5] based on **Dynamic Width 1** version.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
- It was introduced by [EfficientTDNN](https://arxiv.org/abs/2103.13581).
 
 
 
 
11
 
12
- TODO upload weights.
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ tags:
6
+ - embeddings
7
+ - Speaker
8
+ - Verification
9
+ - Identification
10
+ - NAS
11
+ - TDNN
12
+ - pytorch
13
+ datasets:
14
+ - voxceleb1
15
+ - voxceleb2
16
+ metrics:
17
+ - EER
18
+ - minDCF:
19
+ - p_target: 0.01
20
+ ---
21
+
22
+
23
  # EfficientTDNN
24
 
25
+ Model Version are listed as follows.
26
+
27
+ - **Dynamic Kernel**: The model enables various kernel sizes in {1,3,5}, `kernel/kernel.torchparams`.
28
+ - **Dynamic Depth**: The model enables additional various depth in {2,3,4} based on **Dynamic Kernel** version, `depth/depth.torchparams`.
29
+ - **Dynamic Width 1**: The model enable additional various width in [0.5, 1.0] based on **Dynamic Depth** version, `width1/width1.torchparams`.
30
+ - **Dynamic Width 2**: The model enable additional various width in [0.25, 0.5] based on **Dynamic Width 1** version, `width2/width2.torchparams`.
31
+
32
+ Furthermore, some subnets are given in the form of the weights of batchnorm corresponding to their trained supernets as follows.
33
+
34
+ - **Dynamic Kernel**
35
+ 1. `kernel/kernel.max.bn.tar`
36
+ 2. `kernel/kernel.Kmin.bn.tar`
37
+ - **Dynamic Depth**
38
+ 1. `depth/depth.max.bn.tar`
39
+ 2. `depth/depth.Kmin.bn.tar`
40
+ 3. `depth/depth.Dmin.bn.tar`
41
+ 4. `depth/depth.3.512.5.5.3.3.1536.bn.tar`
42
+ 5. `depth/depth.ecapa-tdnn.3.512.512.512.512.5.3.3.3.1536.bn.tar`
43
+ - **Dynamic Width 1**
44
+ 1. `width1/width1.torchparams`
45
+ 2. `width1/width1.max.bn.tar`
46
+ 3. `width1/width1.Kmin.bn.tar`
47
+ 4. `width1/width1.Dmin.bn.tar`
48
+ 5. `width1/width1.C1min.bn.tar`
49
+ 6. `width1/width1.3.383.256.256.256.5.3.3.3.768.bn.tar`
50
+ - **Dynamic Width 2**
51
+ 1. `width2/width2.max.bn.tar`
52
+ 2. `width2/width2.Kmin.bn.tar`
53
+ 3. `width2/width2.Dmin.bn.tar`
54
+ 4. `width2/width2.C1min.bn.tar`
55
+ 5. `width2/width2.C2min.bn.tar`
56
+ 6. `width2/width2.3.384.3.1152.bn.tar`
57
+ 7. `width2/width2.3.256.256.384.384.1.3.5.3.1152.bn.tar`
58
+ 8. `width2/width2.2.256.256.256.3.3.3.400.bn.tar`
59
+
60
+ The tag is described as follows.
61
 
62
+ - max: `(4, [512, 512, 512, 512, 512], [5, 5, 5, 5, 5], 1536)`
63
+ - Kmin: `(4, [512, 512, 512, 512, 512], [1, 1, 1, 1, 1], 1536)`
64
+ - Dmin: `(2, [512, 512, 512], [1, 1, 1], 1536)`
65
+ - C1min: `(2, [256, 256, 256], [1, 1, 1], 768)`
66
+ - C2min: `(2, [128, 128, 128], [1, 1, 1], 384)`
67
 
68
+ More details about EfficentTDNN can be found in the paper [EfficientTDNN](https://arxiv.org/abs/2103.13581).