Skratch99/bert-pretrained

This model was pre-trained using the bert-base-uncased architecture on the train split of wikitext-2-raw-v1 dataset.

References:

@article{DBLP:journals/corr/abs-1810-04805, author = {Jacob Devlin and Ming{-}Wei Chang and Kenton Lee and Kristina Toutanova}, title = {{BERT:} Pre-training of Deep Bidirectional Transformers for Language Understanding}, journal = {CoRR}, volume = {abs/1810.04805}, year = {2018}, url = {http://arxiv.org/abs/1810.04805}, archivePrefix = {arXiv}, eprint = {1810.04805}, timestamp = {Tue, 30 Oct 2018 20:39:56 +0100}, biburl = {https://dblp.org/rec/journals/corr/abs-1810-04805.bib}, bibsource = {dblp computer science bibliography, https://dblp.org} }

Skratch99
/

bert-pretrained

Model tree for Skratch99/bert-pretrained

Dataset used to train Skratch99/bert-pretrained