wzuidema commited on
Commit
7620c8a
1 Parent(s): 99b20e7

Update description.md

Browse files
Files changed (1) hide show
  1. description.md +3 -5
description.md CHANGED
@@ -1,9 +1,7 @@
1
  # Detecting stereotypes in the GPT-2 language model using CrowS-Pairs
2
 
3
- *GPT-2* is a language model which can score how likely it is that some text is a valid English sentence: not only grammaticality, but also the 'meaning' of the sentence is part of this score. *CrowS-Pairs* is a dataset with pairs of more and less stereotypical examples for different social groups (e.g., gender and nationality stereotypes).
4
 
5
- You can either select a CrowS-Pairs bias type from the drop-down below and click `Sample`, and then we
6
- sample 10 random pairs from CrowS-Pairs and show whether the stereotypical example gets
7
- a higher score ('is more likely').
8
 
9
- **If GPT-2 systematically prefers the stereotypical examples, it has probably learnt these stereotypes from the training data.**
 
1
  # Detecting stereotypes in the GPT-2 language model using CrowS-Pairs
2
 
3
+ *GPT-2* is a language model that can score how likely it is that some text is a valid English sentence: not only grammaticality, but also the 'meaning' of the sentence is part of this score. *CrowS-Pairs* is a dataset with pairs of more and less stereotypical examples for different social groups (e.g., gender and nationality stereotypes).
4
 
5
+ Below, you can select a CrowS-Pairs bias type from the drop-down menu, and click `Sample` to sample 10 random pairs from CrowS-Pairs. Alternatively, type your own pair of sentences. The demo shows for each pair of sentences which one receives the higher score ('is more likely').
 
 
6
 
7
+ If a language model systematically prefers more stereotypical examples, this is taken as evidence that has learnt these stereotypes from the training data and shows undesirable bias.