Transformers documentation

Pipelines

Transformers

You are viewing v4.38.2 version. A newer version v4.56.2 is available.

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Pipelines

パイプラインは、推論にモデルを使うための簡単で優れた方法である。パイプラインは、複雑なコードのほとんどを抽象化したオブジェクトです。パイプラインは、ライブラリから複雑なコードのほとんどを抽象化したオブジェクトで、名前付き固有表現認識、マスク言語モデリング、感情分析、特徴抽出、質問応答などのタスクに特化したシンプルなAPIを提供します。 Recognition、Masked Language Modeling、Sentiment Analysis、Feature Extraction、Question Answeringなどのタスクに特化したシンプルなAPIを提供します。以下を参照のこと。タスク概要を参照してください。

パイプラインの抽象化には2つのカテゴリーがある：

pipeline() は、他のすべてのパイプラインをカプセル化する最も強力なオブジェクトです。
タスク固有のパイプラインは、オーディオ、コンピュータービジョン、自然言語処理、およびマルチモーダルタスクで使用できます。

The pipeline abstraction

パイプライン 抽象化は、他のすべての利用可能なパイプラインのラッパーです。他のものと同様にインスタンス化されますパイプラインですが、さらなる生活の質を提供できます。

1 つの項目に対する単純な呼び出し:

>>> pipe = pipeline("text-classification")
>>> pipe("This restaurant is awesome")
[{'label': 'POSITIVE', 'score': 0.9998743534088135}]

ハブの特定のモデルを使用したい場合は、モデルがオンになっている場合はタスクを無視できます。ハブはすでにそれを定義しています。

>>> pipe = pipeline(model="FacebookAI/roberta-large-mnli")
>>> pipe("This restaurant is awesome")
[{'label': 'NEUTRAL', 'score': 0.7313136458396912}]

多くの項目に対してパイプラインを呼び出すには、list を使用してパイプラインを呼び出すことができます。

>>> pipe = pipeline("text-classification")
>>> pipe(["This restaurant is awesome", "This restaurant is awful"])
[{'label': 'POSITIVE', 'score': 0.9998743534088135},
 {'label': 'NEGATIVE', 'score': 0.9996669292449951}]

完全なデータセットを反復するには、Datasetを直接使用することをお勧めします。これは、割り当てる必要がないことを意味しますデータセット全体を一度に処理することも、自分でバッチ処理を行う必要もありません。これはカスタムループと同じくらい速く動作するはずです。 GPU。それが問題でない場合は、ためらわずに問題を作成してください。

import datasets
from transformers import pipeline
from transformers.pipelines.pt_utils import KeyDataset
from tqdm.auto import tqdm

pipe = pipeline("automatic-speech-recognition", model="facebook/wav2vec2-base-960h", device=0)
dataset = datasets.load_dataset("superb", name="asr", split="test")

# KeyDataset (only *pt*) will simply return the item in the dict returned by the dataset item
# as we're not interested in the *target* part of the dataset. For sentence pair use KeyPairDataset
for out in tqdm(pipe(KeyDataset(dataset, "file"))):
    print(out)
    # {"text": "NUMBER TEN FRESH NELLY IS WAITING ON YOU GOOD NIGHT HUSBAND"}
    # {"text": ....}
    # ....

使いやすくするために、ジェネレーターを使用することもできます。

from transformers import pipeline

pipe = pipeline("text-classification")


def data():
    while True:
        # This could come from a dataset, a database, a queue or HTTP request
        # in a server
        # Caveat: because this is iterative, you cannot use `num_workers > 1` variable
        # to use multiple threads to preprocess data. You can still have 1 thread that
        # does the preprocessing while the main runs the big inference
        yield "This is a test"


for out in pipe(data()):
    print(out)
    # {"text": "NUMBER TEN FRESH NELLY IS WAITING ON YOU GOOD NIGHT HUSBAND"}
    # {"text": ....}
    # ....

Transformers

Pipelines

The pipeline abstraction

transformers.pipeline

Pipeline batching

Pipeline chunk batching

Pipeline custom code

Implementing a pipeline

Audio

AudioClassificationPipeline

class transformers.AudioClassificationPipeline

__call__

AutomaticSpeechRecognitionPipeline

class transformers.AutomaticSpeechRecognitionPipeline

__call__

TextToAudioPipeline

class transformers.TextToAudioPipeline

__call__

ZeroShotAudioClassificationPipeline

class transformers.ZeroShotAudioClassificationPipeline

__call__

Computer vision

DepthEstimationPipeline

class transformers.DepthEstimationPipeline

__call__

ImageClassificationPipeline

class transformers.ImageClassificationPipeline

__call__

ImageSegmentationPipeline

class transformers.ImageSegmentationPipeline

__call__

ImageToImagePipeline

class transformers.ImageToImagePipeline

__call__

ObjectDetectionPipeline

class transformers.ObjectDetectionPipeline

__call__

VideoClassificationPipeline

class transformers.VideoClassificationPipeline

__call__

ZeroShotImageClassificationPipeline

class transformers.ZeroShotImageClassificationPipeline

__call__

ZeroShotObjectDetectionPipeline

class transformers.ZeroShotObjectDetectionPipeline

__call__

Natural Language Processing

ConversationalPipeline

class transformers.Conversation

add_user_input

append_response

mark_processed

class transformers.ConversationalPipeline

__call__

FillMaskPipeline

class transformers.FillMaskPipeline

__call__

NerPipeline

class transformers.TokenClassificationPipeline

aggregate_words

gather_pre_entities

group_entities

group_sub_entities

QuestionAnsweringPipeline

class transformers.QuestionAnsweringPipeline

__call__

create_sample

span_to_answer

SummarizationPipeline

class transformers.SummarizationPipeline

__call__

TableQuestionAnsweringPipeline

class transformers.TableQuestionAnsweringPipeline

__call__

TextClassificationPipeline

class transformers.TextClassificationPipeline

__call__

TextGenerationPipeline

class transformers.TextGenerationPipeline

__call__

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call

call