princeton-nlp
/

efficient_mlm_m0.40-801010

Model card Files Files and versions Community

efficient_mlm_m0.40-801010 / README.md

princeton-nlp's picture

Create README.md

df4bf41 over 2 years ago

|

history blame contribute delete

589 Bytes

	---
	inference: false
	---
	This is a model checkpoint for ["Should You Mask 15% in Masked Language Modeling"](https://arxiv.org/abs/2202.08005) [(code)](https://github.com/princeton-nlp/DinkyTrain.git). We use pre layer norm, which is not supported by HuggingFace. To use our model, go to our [github repo](https://github.com/princeton-nlp/DinkyTrain.git), download our code, and import the RoBERTa class from `huggingface/modeling_roberta_prelayernorm.py`. For example,

	``` bash
	from huggingface.modeling_roberta_prelayernorm import RobertaForMaskedLM, RobertaForSequenceClassification
	```