Utilities for Generation

class transformers.generation.GreedySearchDecoderOnlyOutput

( sequences: LongTensor = None scores: typing.Optional[typing.Tuple[torch.FloatTensor]] = None attentions: typing.Optional[typing.Tuple[typing.Tuple[torch.FloatTensor]]] = None hidden_states: typing.Optional[typing.Tuple[typing.Tuple[torch.FloatTensor]]] = None )

Parameters

sequences (torch.LongTensor of shape (batch_size, sequence_length)) — The generated sequences. The second dimension (sequence_length) is either equal to max_length or shorter if all batches finished early due to the eos_token_id.
scores (tuple(torch.FloatTensor) optional, returned when output_scores=True is passed or when config.output_scores=True) — Processed prediction scores of the language modeling head (scores for each vocabulary token before SoftMax) at each generation step. Tuple of torch.FloatTensor with up to max_new_tokens elements (one element for each generated token), with each tensor of shape (batch_size, config.vocab_size).
attentions (tuple(tuple(torch.FloatTensor)), optional, returned when output_attentions=True is passed or config.output_attentions=True) — Tuple (one element for each generated token) of tuples (one element for each layer of the decoder) of torch.FloatTensor of shape (batch_size, num_heads, generated_length, sequence_length).
hidden_states (tuple(tuple(torch.FloatTensor)), optional, returned when output_hidden_states=True is passed or when config.output_hidden_states=True) — Tuple (one element for each generated token) of tuples (one element for each layer of the decoder) of torch.FloatTensor of shape (batch_size, generated_length, hidden_size).

Base class for outputs of decoder-only generation models using greedy search.

Transformers

Utilities for Generation

Generate Outputs

GreedySearchOutput

class transformers.generation.GreedySearchDecoderOnlyOutput

class transformers.generation.GreedySearchEncoderDecoderOutput

class transformers.generation.FlaxGreedySearchOutput

replace

SampleOutput

class transformers.generation.SampleDecoderOnlyOutput

class transformers.generation.SampleEncoderDecoderOutput

class transformers.generation.FlaxSampleOutput

replace

BeamSearchOutput

class transformers.generation.BeamSearchDecoderOnlyOutput

class transformers.generation.BeamSearchEncoderDecoderOutput

BeamSampleOutput

class transformers.generation.BeamSampleDecoderOnlyOutput

class transformers.generation.BeamSampleEncoderDecoderOutput

LogitsProcessor

class transformers.LogitsProcessor

__call__

class transformers.LogitsProcessorList

__call__

class transformers.LogitsWarper

__call__

class transformers.MinLengthLogitsProcessor

__call__

class transformers.MinNewTokensLengthLogitsProcessor

__call__

class transformers.TemperatureLogitsWarper

__call__

class transformers.RepetitionPenaltyLogitsProcessor

__call__

class transformers.TopPLogitsWarper

__call__

class transformers.TopKLogitsWarper

__call__

class transformers.TypicalLogitsWarper

__call__

class transformers.NoRepeatNGramLogitsProcessor

__call__

class transformers.NoBadWordsLogitsProcessor

__call__

class transformers.PrefixConstrainedLogitsProcessor

__call__

class transformers.HammingDiversityLogitsProcessor

__call__

class transformers.ForcedBOSTokenLogitsProcessor

__call__

class transformers.ForcedEOSTokenLogitsProcessor

__call__

class transformers.InfNanRemoveLogitsProcessor

__call__

class transformers.TFLogitsProcessor

__call__

class transformers.TFLogitsProcessorList

__call__

class transformers.TFLogitsWarper

__call__

class transformers.TFTemperatureLogitsWarper

__call__

class transformers.TFTopPLogitsWarper

__call__

class transformers.TFTopKLogitsWarper

__call__

class transformers.TFMinLengthLogitsProcessor

__call__

class transformers.TFNoBadWordsLogitsProcessor

__call__

class transformers.TFNoRepeatNGramLogitsProcessor

__call__

class transformers.TFRepetitionPenaltyLogitsProcessor

__call__

class transformers.TFForcedBOSTokenLogitsProcessor

__call__

class transformers.TFForcedEOSTokenLogitsProcessor

__call__

class transformers.FlaxLogitsProcessor

__call__

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call