Utilities for Generation

class transformers.generation.GreedySearchEncoderDecoderOutput

( sequences: LongTensor = None scores: typing.Optional[typing.Tuple[torch.FloatTensor]] = None encoder_attentions: typing.Optional[typing.Tuple[torch.FloatTensor]] = None encoder_hidden_states: typing.Optional[typing.Tuple[torch.FloatTensor]] = None decoder_attentions: typing.Optional[typing.Tuple[typing.Tuple[torch.FloatTensor]]] = None cross_attentions: typing.Optional[typing.Tuple[typing.Tuple[torch.FloatTensor]]] = None decoder_hidden_states: typing.Optional[typing.Tuple[typing.Tuple[torch.FloatTensor]]] = None )

Parameters

sequences (torch.LongTensor of shape (batch_size, sequence_length)) — The generated sequences. The second dimension (sequence_length) is either equal to max_length or shorter if all batches finished early due to the eos_token_id.
scores (tuple(torch.FloatTensor) optional, returned when output_scores=True is passed or when config.output_scores=True) — Processed prediction scores of the language modeling head (scores for each vocabulary token before SoftMax) at each generation step. Tuple of torch.FloatTensor with up to max_new_tokens elements (one element for each generated token), with each tensor of shape (batch_size, config.vocab_size).
encoder_attentions (tuple(torch.FloatTensor), optional, returned when output_attentions=True is passed or config.output_attentions=True) — Tuple of torch.FloatTensor (one for each layer of the decoder) of shape (batch_size, num_heads, sequence_length, sequence_length).
encoder_hidden_states (tuple(torch.FloatTensor), optional, returned when output_hidden_states=True is passed or when config.output_hidden_states=True) — Tuple of torch.FloatTensor (one for the output of the embeddings + one for the output of each layer) of shape (batch_size, sequence_length, hidden_size).
decoder_attentions (tuple(tuple(torch.FloatTensor)), optional, returned when output_attentions=True is passed or config.output_attentions=True) — Tuple (one element for each generated token) of tuples (one element for each layer of the decoder) of torch.FloatTensor of shape (batch_size, num_heads, generated_length, sequence_length).
cross_attentions (tuple(tuple(torch.FloatTensor)), optional, returned when output_attentions=True is passed or config.output_attentions=True) — Tuple (one element for each generated token) of tuples (one element for each layer of the decoder) of torch.FloatTensor of shape (batch_size, num_heads, generated_length, sequence_length).
decoder_hidden_states (tuple(tuple(torch.FloatTensor)), optional, returned when output_hidden_states=True is passed or when config.output_hidden_states=True) — Tuple (one element for each generated token) of tuples (one element for each layer of the decoder) of torch.FloatTensor of shape (batch_size, generated_length, hidden_size).

Base class for outputs of encoder-decoder generation models using greedy search. Hidden states and attention weights of the decoder (respectively the encoder) can be accessed via the encoder_attentions and the encoder_hidden_states attributes (respectively the decoder_attentions and the decoder_hidden_states attributes)

Transformers

Utilities for Generation

Generate Outputs

PyTorch

class transformers.generation.GreedySearchEncoderDecoderOutput

class transformers.generation.GreedySearchDecoderOnlyOutput

class transformers.generation.SampleEncoderDecoderOutput

class transformers.generation.SampleDecoderOnlyOutput

class transformers.generation.BeamSearchEncoderDecoderOutput

class transformers.generation.BeamSearchDecoderOnlyOutput

class transformers.generation.BeamSampleEncoderDecoderOutput

class transformers.generation.BeamSampleDecoderOnlyOutput

class transformers.generation.ContrastiveSearchEncoderDecoderOutput

class transformers.generation.ContrastiveSearchDecoderOnlyOutput

TensorFlow

class transformers.generation.TFGreedySearchEncoderDecoderOutput

class transformers.generation.TFGreedySearchDecoderOnlyOutput

class transformers.generation.TFSampleEncoderDecoderOutput

class transformers.generation.TFSampleDecoderOnlyOutput

class transformers.generation.TFBeamSearchEncoderDecoderOutput

class transformers.generation.TFBeamSearchDecoderOnlyOutput

class transformers.generation.TFBeamSampleEncoderDecoderOutput

class transformers.generation.TFBeamSampleDecoderOnlyOutput

class transformers.generation.TFContrastiveSearchEncoderDecoderOutput

class transformers.generation.TFContrastiveSearchDecoderOnlyOutput

FLAX

class transformers.generation.FlaxSampleOutput

replace

class transformers.generation.FlaxGreedySearchOutput

replace

class transformers.generation.FlaxBeamSearchOutput

replace

LogitsProcessor

PyTorch

class transformers.AlternatingCodebooksLogitsProcessor

__call__

class transformers.ClassifierFreeGuidanceLogitsProcessor

__call__

class transformers.EncoderNoRepeatNGramLogitsProcessor

__call__

class transformers.EncoderRepetitionPenaltyLogitsProcessor

__call__

class transformers.EpsilonLogitsWarper

__call__

class transformers.EtaLogitsWarper

__call__

class transformers.ExponentialDecayLengthPenalty

__call__

class transformers.ForcedBOSTokenLogitsProcessor

__call__

class transformers.ForcedEOSTokenLogitsProcessor

__call__

class transformers.ForceTokensLogitsProcessor

__call__

class transformers.HammingDiversityLogitsProcessor

__call__

class transformers.InfNanRemoveLogitsProcessor

__call__

class transformers.LogitNormalization

__call__

class transformers.LogitsProcessor

__call__

class transformers.LogitsProcessorList

__call__

class transformers.LogitsWarper

__call__

class transformers.MinLengthLogitsProcessor

__call__

class transformers.MinNewTokensLengthLogitsProcessor

__call__

class transformers.NoBadWordsLogitsProcessor

__call__

class transformers.NoRepeatNGramLogitsProcessor

__call__

class transformers.PrefixConstrainedLogitsProcessor

__call__

class transformers.RepetitionPenaltyLogitsProcessor

__call__

class transformers.SequenceBiasLogitsProcessor

__call__

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call