ofirzaf's picture
Update README.md
ec267fb
metadata
language: en
license: apache-2.0
tags:
  - fill-mask
datasets:
  - wikipedia
  - bookcorpus

85% Sparse BERT-Large (uncased) Prune OFA

This model is a result from our paper Prune Once for All: Sparse Pre-Trained Language Models presented in ENLSP NeurIPS Workshop 2021.

For further details on the model and its result, see our paper and our implementation available here.