EIStakovskii
commited on
Commit
•
3e62031
1
Parent(s):
b24728d
Update README.md
Browse files
README.md
CHANGED
@@ -18,10 +18,10 @@ widget:
|
|
18 |
|
19 |
license: other
|
20 |
---
|
21 |
-
This model was trained for evaluating linguistic acceptability and grammaticality. The finetuning was carried out based off the camembert-base model
|
22 |
|
23 |
Label_1 means ACCEPTABLE - the sentence is perfectly understandable by native speakers and has no serious grammatic and syntactic flaws.
|
24 |
|
25 |
Label_0 means NOT ACCEPTABLE - the sentence is flawed both orthographically and grammatically.
|
26 |
|
27 |
-
The model was trained on 50 thousand French sentences from the news_commentary dataset
|
|
|
18 |
|
19 |
license: other
|
20 |
---
|
21 |
+
This model was trained for evaluating linguistic acceptability and grammaticality. The finetuning was carried out based off [the camembert-base model](https://huggingface.co/camembert/camembert-base).
|
22 |
|
23 |
Label_1 means ACCEPTABLE - the sentence is perfectly understandable by native speakers and has no serious grammatic and syntactic flaws.
|
24 |
|
25 |
Label_0 means NOT ACCEPTABLE - the sentence is flawed both orthographically and grammatically.
|
26 |
|
27 |
+
The model was trained on 50 thousand French sentences from [the news_commentary dataset](https://huggingface.co/datasets/news_commentary). Out of 50 thousand 25 thousand sentences were algorithmically corrupted using [the open source Python library](https://github.com/eistakovskii/text_corruption_plus). The library was originally developed by [aylliote](https://github.com/aylliote/corruption), but it was slightly adapted for the purposes of this model.
|