Spaces:

amsterdamNLP
/

contrastive-pairs

Running

App Files Files Community

contrastive-pairs / description.md

Martijn van Beers

Use Blocks instead of Interface

55d104b almost 2 years ago

|

No virus

660 Bytes

	# Detecting stereotypes in the GPT-2 language model using CrowS-Pairs

	GPT-2 is a language model which can score how likely it is that some text is a valid English sentence: not only grammaticality, but also the 'meaning' of the sentence is part of this score. CrowS-Pairs is a dataset with pairs of more and less stereotypical examples for different social groups (e.g., gender and nationality stereotypes). We sample 10 random pairs from CrowS-Pairs and show whether the stereotypical example gets a higher score ('is more likely'). If GPT-2 systematically prefers the stereotypical examples, it has probably learnt these stereotypes from the training data.