Transformers
Back to all models
Model: gpt2-large

Monthly model downloads

gpt2-large gpt2-large
- downloads
last 30 days

pytorch

tf

How to use this model directly from the 🤗/transformers library:

			
Copy model
tokenizer = AutoTokenizer.from_pretrained("gpt2-large") model = AutoModel.from_pretrained("gpt2-large")

Config

See raw config file
attn_pdrop: 0.1 ...
embd_pdrop: 0.1 ...
▾ finetuning_task: null ...
initializer_range: 0.02 ...
layer_norm_epsilon: 0.00001 ...
n_ctx: 1024 ...
n_embd: 1280 ...
n_head: 20 ...
n_layer: 36 ...
n_positions: 1024 ...
num_labels: 1 ...
output_attentions: false ...
output_hidden_states: false ...
resid_pdrop: 0.1 ...
▾ summary_activation: null ...
summary_first_dropout: 0.1 ...
summary_proj_to_labels: true ...
summary_type: "cls_index" ...
summary_use_proj: true ...
torchscript: false ...
vocab_size: 50257 ...