kamanphoebe commited on
Commit
62596dc
1 Parent(s): 4673252

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # m2mKD
2
+
3
+ This repository contains the checkpoints for [m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers](https://arxiv.org/abs/2402.16918).
4
+
5
+ ## Released checkpoints
6
+
7
+ For the usage of the checkpoints listed below, please refer to the instructions provided on our [GitHub repo](https://github.com/kamanphoebe/m2mKD).
8
+ - `nac_scale_tinyimnet.pth`/`nac_scale_imnet.pth`: NAC model with a scale-free prior trained using m2mKD.
9
+ - `vmoe_base.pth`: V-MoE-Base model trained using m2mKD.
10
+ - `FT_huge`: a directory containing DeiT-Huge teacher modules for NAC model training.
11
+ - `nac_tinyimnet_students`: a directory containing NAC student modules for Tiny-ImageNet.
12
+
13
+ ## Acknowledgement
14
+
15
+ Our implementation is mainly based on [Deep-Incubation](https://github.com/LeapLabTHU/Deep-Incubation).
16
+
17
+ ## Citation
18
+
19
+ If you use the checkpoints, please cite our paper:
20
+ ```
21
+ @misc{lo2024m2mkd,
22
+ title={m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers},
23
+ author={Ka Man Lo and Yiming Liang and Wenyu Du and Yuantao Fan and Zili Wang and Wenhao Huang and Lei Ma and Jie Fu},
24
+ year={2024},
25
+ eprint={2402.16918},
26
+ archivePrefix={arXiv},
27
+ primaryClass={cs.LG}
28
+ }
29
+ ```