abdullahmeda commited on
Commit
b3c492a
1 Parent(s): b76af8a

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +9 -0
app.py CHANGED
@@ -136,6 +136,15 @@ with gr.Blocks() as demo:
136
  predictions, and is therefore considered to be a better model. The training of LMs is carried out on large-scale text corpora, it can \
137
  be considered that it has learned some common language patterns and text structures. Therefore, PPL can be used to measure how \
138
  well a text conforms to common characteristics.
 
 
 
 
 
 
 
 
 
139
 
140
  ### GLTR: Giant Language Model Test Room
141
  This idea originates from the following paper: arxiv.org/pdf/1906.04043.pdf. It studies 3 tests to compute features of an input text. Their \
 
136
  predictions, and is therefore considered to be a better model. The training of LMs is carried out on large-scale text corpora, it can \
137
  be considered that it has learned some common language patterns and text structures. Therefore, PPL can be used to measure how \
138
  well a text conforms to common characteristics.
139
+
140
+ I used all variants of the open-source GPT-2 model except xl size to compute the PPL (both text-level and sentence-level PPLs) of the \
141
+ collected texts. It is observed that, regardless of whether it is at the text level or the sentence level, the content generated by LLMs \
142
+ have relatively lower PPLs compared to the text written by humans. LLM captured common patterns and structures in the text it was trained on, \
143
+ and is very good at reproducing them. As a result, text generated by LLMs have relatively concentrated low PPLs.
144
+
145
+ Humans have the ability to express themselves in a wide variety of ways, depending on the context, audience, and purpose of the text they are \
146
+ writing. This can include using creative or imaginative elements, such as metaphors, similes, and unique word choices, which can make it more \
147
+ difficult for GPT2 to predict.
148
 
149
  ### GLTR: Giant Language Model Test Room
150
  This idea originates from the following paper: arxiv.org/pdf/1906.04043.pdf. It studies 3 tests to compute features of an input text. Their \