Transformers
Back to all models
Model: distilgpt2

Monthly model downloads

distilgpt2 distilgpt2
- downloads
last 30 days

pytorch

tf

How to use this model directly from the 🤗/transformers library:

			
Copy model
tokenizer = AutoTokenizer.from_pretrained("distilgpt2") model = AutoModel.from_pretrained("distilgpt2")

Config

See raw config file
attn_pdrop: 0.1 ...
embd_pdrop: 0.1 ...
▾ finetuning_task: null ...
initializer_range: 0.02 ...
layer_norm_epsilon: 0.00001 ...
n_ctx: 1024 ...
n_embd: 768 ...
n_head: 12 ...
n_layer: 6 ...
n_positions: 1024 ...
num_labels: 1 ...
output_attentions: false ...
output_hidden_states: false ...
▾ pruned_heads: {} ...
resid_pdrop: 0.1 ...
▾ summary_activation: null ...
summary_first_dropout: 0.1 ...
summary_proj_to_labels: true ...
summary_type: "cls_index" ...
summary_use_proj: true ...
torchscript: false ...
use_bfloat16: false ...
vocab_size: 50257 ...