patrickvonplaten commited on
Commit
c1cf658
1 Parent(s): 996c62c

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -1,3 +1,7 @@
1
- ---
2
- pipeline_tag: text2text-generation
3
- ---
 
 
 
 
1
+ # ArXiv-NLP GPT-2 checkpoint
2
+
3
+ This is a GPT-2 small checkpoint for PyTorch. It is the official `gpt2-small` fine-tuned to ArXiv paper on the computational linguistics field.
4
+
5
+ ## Training data
6
+
7
+ This model was trained on a subset of ArXiv papers that were parsed from PDF to txt. The resulting data is made of 80MB of text from the computational linguistics (cs.CL) field.