lorenzoscottb
commited on
Commit
•
6ab9997
1
Parent(s):
8c3c327
Update README.md
Browse files
README.md
CHANGED
@@ -30,7 +30,16 @@ The model is trained to perform a sequence classification task over phrase-level
|
|
30 |
|
31 |
## Intended uses & limitations
|
32 |
|
33 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
|
35 |
## Training and evaluation data
|
36 |
|
|
|
30 |
|
31 |
## Intended uses & limitations
|
32 |
|
33 |
+
The scope of the model is not to run lexical entailment (i.e., hypernym detection). The model is trained solely to perform a very specific subset of phrase-level entailment, based on adjective-nouns phrases. The type of question you should ask the model are limited, and should have one of three forms:
|
34 |
+
- An adjective+Noun is a Noun (e.g. A red car is a car)
|
35 |
+
- An adjective+Noun is a noun-hypernym (e.g. A red car is a vehicle)
|
36 |
+
- An adjective+Noun is a adjective+noun-hypernym (e.g. A red car is a red vehicle)
|
37 |
+
|
38 |
+
Linguistically speaking, adjectives belong to three macro classes (intersective, subsective, and intensional). From a linguistic and logical stand, these class shape the truth value of the three forms above. For instance, since red is an intersective adjective, the three from are all true. A subjective adjective like small allows just the first two, but not the last – that is, logically speaking, a small car is not a small vehicle.
|
39 |
+
|
40 |
+
In other words, the model was built to study out-of-distribution compositional generalisation with respect to a very specific set of compositional phenomena.
|
41 |
+
|
42 |
+
This poses clear limitations to the question you can ask the model. For instance, if you had to query the model with a basic (false) hypernym detection task (e.g., *A dog is a cat*), the model will consider it as true.
|
43 |
|
44 |
## Training and evaluation data
|
45 |
|