Update README.md
Browse files
README.md
CHANGED
@@ -309,7 +309,7 @@ The table below contains a more detailed overview of the corpus.
|
|
309 |
| 2645 | The Press | CONSERVATIVE | LONDON | 15.702.276 |
|
310 |
| 2646 | The Star | NONE | LONDON | 163.072.742 |
|
311 |
| 2647 | The Statesman | RADICAL | LONDON | 61.225.215 |
|
312 |
-
|
313 |
|
314 |
Temporally, most of the articles date from the second half of the nineteenth century. The figure below gives an overview of the number of articles by year.
|
315 |
|
@@ -335,6 +335,7 @@ In general, [ERWT-year-masked-25](https://huggingface.co/Livingwithmachines/erwt
|
|
335 |
| ERWT-year-masked-75 | 31.02 | 61.41 | 24.63 | 44.40 |
|
336 |
| PEA | 31.63 | 62.09 | 25.58 | 44.99 |
|
337 |
| PEA-st | 31.65 | 62.19 | 25.59 | 44.99 |
|
|
|
338 |
|
339 |
|
340 |
## Questions?
|
|
|
309 |
| 2645 | The Press | CONSERVATIVE | LONDON | 15.702.276 |
|
310 |
| 2646 | The Star | NONE | LONDON | 163.072.742 |
|
311 |
| 2647 | The Statesman | RADICAL | LONDON | 61.225.215 |
|
312 |
+
Table 1: Overview of Newspapers included in the Heritage Made Digital newspaper corpus
|
313 |
|
314 |
Temporally, most of the articles date from the second half of the nineteenth century. The figure below gives an overview of the number of articles by year.
|
315 |
|
|
|
335 |
| ERWT-year-masked-75 | 31.02 | 61.41 | 24.63 | 44.40 |
|
336 |
| PEA | 31.63 | 62.09 | 25.58 | 44.99 |
|
337 |
| PEA-st | 31.65 | 62.19 | 25.59 | 44.99 |
|
338 |
+
Table 2: Mean and standard deviations of pseudo-perplexity scores computed on 1000 fragments of 64 respectively 128 tokens lenth
|
339 |
|
340 |
|
341 |
## Questions?
|