How to use this model directly from the
from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("lysandre/arxiv") model = AutoModel.from_pretrained("lysandre/arxiv")
This is a GPT-2 small checkpoint for PyTorch. It is the official
gpt2-small finetuned to ArXiv paper on physics fields.
This model was trained on a subset of ArXiv papers that were parsed from PDF to txt. The resulting data is made of 130MB of text, mostly from quantum physics (quant-ph) and other physics sub-fields.