File size: 391 Bytes
c7a2e68
 
 
4d8f098
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
---
language: en
---
# ArXiv-NLP GPT-2 checkpoint

This is a GPT-2 small checkpoint for PyTorch. It is the official `gpt2-small` fine-tuned to ArXiv paper on the computational linguistics field.

## Training data

This model was trained on a subset of ArXiv papers that were parsed from PDF to txt. The resulting data is made of 80MB of text from the computational linguistics (cs.CL) field.