Commit
•
3488326
1
Parent(s):
be8cbf7
spelling mistake (#4)
Browse files- spelling mistake (f392e516c77bd6e54febe9b11cd60284ba268338)
Co-authored-by: Rohit Kumar <rohitdavas@users.noreply.huggingface.co>
README.md
CHANGED
@@ -43,7 +43,7 @@ This model can be used for Fill-Mask tasks.
|
|
43 |
|
44 |
Significant research has explored bias and fairness issues with language models (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)).
|
45 |
|
46 |
-
This model was
|
47 |
|
48 |
> The quality of some OSCAR sub-corpora might be lower than expected, specifically for the lowest-resource languages.
|
49 |
|
|
|
43 |
|
44 |
Significant research has explored bias and fairness issues with language models (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)).
|
45 |
|
46 |
+
This model was pretrained on a subcorpus of OSCAR multilingual corpus. Some of the limitations and risks associated with the OSCAR dataset, which are further detailed in the [OSCAR dataset card](https://huggingface.co/datasets/oscar), include the following:
|
47 |
|
48 |
> The quality of some OSCAR sub-corpora might be lower than expected, specifically for the lowest-resource languages.
|
49 |
|