suproteem commited on
Commit
83f8db0
1 Parent(s): a352cbb

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +26 -0
README.md ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - dell-research-harvard/AmericanStories
5
+ language:
6
+ - en
7
+ ---
8
+
9
+ # StoriesLM: A Family of Language Models With Sequentially-Expanding Pretraining Windows
10
+
11
+ ## Model Family
12
+
13
+ StoriesLM is a family of language models with sequentially-expanding pretraining windows. The pretraining data for the model family comes from the American Stories dataset—a collection of language from historical American news articles. The first language model in the StoriesLM family is pretrained on language data from 1900. Each subsequent language model further trains the previous year’s model checkpoint using data from the following year, up until 1963.
14
+
15
+ ## Dataset
16
+
17
+ The StoriesLM family is pretrained on the American Stories dataset. If you use a model from this family, please also cite the original dataset's authors:
18
+ ```
19
+ @article{dell2024american,
20
+ title={American stories: A large-scale structured text dataset of historical us newspapers},
21
+ author={Dell, Melissa and Carlson, Jacob and Bryan, Tom and Silcock, Emily and Arora, Abhishek and Shen, Zejiang and D'Amico-Wong, Luca and Le, Quan and Querubin, Pablo and Heldring, Leander},
22
+ journal={Advances in Neural Information Processing Systems},
23
+ volume={36},
24
+ year={2024}
25
+ }
26
+ ```