FacebookAI
/

xlm-mlm-100-1280

Inference Endpoints

Model card Files Files and versions Community

Marissa commited on Jul 7, 2022

Commit

8170ea6

·

1 Parent(s): ce151e2

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -155,7 +155,9 @@ Users (both direct and downstream) should be made aware of the risks, biases and
 # Training
-This model is the XLM model trained on Wikipedia text in 100 languages. The preprocessing included tokenization and byte-pair-encoding. See the [GitHub repo](https://github.com/facebookresearch/XLM#the-17-and-100-languages) and the [associated paper](https://arxiv.org/pdf/1911.02116.pdf) for further details on the training data and training procedure.
 # Evaluation
@@ -183,6 +185,10 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 - **Compute Region:** More information needed
 - **Carbon Emitted:** More information needed
 # Citation
 **BibTeX:**

 # Training
+This model is the XLM model trained on Wikipedia text in 100 languages. The preprocessing included tokenization with byte-pair-encoding. See the [GitHub repo](https://github.com/facebookresearch/XLM#the-17-and-100-languages) and the [associated paper](https://arxiv.org/pdf/1911.02116.pdf) for further details on the training data and training procedure.
+[Conneau et al. (2020)](https://arxiv.org/pdf/1911.02116.pdf) report that this model has 16 layers, 1280 hidden states, 16 attention heads, and the dimension of the feed-forward layer is 1520. The vocabulary size is 200k and the total number of parameters is 570M (see Table 7).
 # Evaluation
 - **Compute Region:** More information needed
 - **Carbon Emitted:** More information needed
+# Technical Specifications
+[Conneau et al. (2020)](https://arxiv.org/pdf/1911.02116.pdf) report that this model has 16 layers, 1280 hidden states, 16 attention heads, and the dimension of the feed-forward layer is 1520. The vocabulary size is 200k and the total number of parameters is 570M (see Table 7).
 # Citation
 **BibTeX:**