File size: 319 Bytes
972915a |
1 2 3 4 5 6 7 8 9 |
If you use these models, please cite the following paper: @article{turc2019, title={Well-Read Students Learn Better: On the Importance of Pre-training Compact Models}, author={Turc, Iulia and Chang, Ming-Wei and Lee, Kenton and Toutanova, Kristina}, journal={arXiv preprint arXiv:1908.08962v2 }, year={2019} } |