Edit model card

This is a GPT2 model uploaded using the KerasNLP library and can be used with JAX, TensorFlow, and PyTorch backends. This model is related to a CausalLM task.

Model config:

  • name: gpt2_backbone
  • trainable: True
  • vocabulary_size: 50257
  • num_layers: 12
  • num_heads: 12
  • hidden_dim: 768
  • intermediate_dim: 3072
  • dropout: 0.1
  • max_sequence_length: 1024

This model card has been generated automatically and should be completed by the model author. See Model Cards documentation for more information.

Downloads last month
14
Inference Examples
Inference API (serverless) does not yet support keras-nlp models for this pipeline type.