Gladiaio
/

bhadresh-savani_distilbert-base-uncased-emotion_onnx

Model card Files Files and versions Community

Thytu commited on Jan 18, 2023

Commit

097107d

1 Parent(s): 37e891c

feat: model

Browse files

Signed-off-by: Thytu <vdmatos@gladia.io>

Files changed (11) hide show

bhadresh-savani_distilbert-base-uncased-emotion_onnx_inference/1/.gitkeep +0 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_inference/config.pbtxt +58 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_model/1/model.bin +3 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_model/config.pbtxt +30 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/config.json +3 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/model.py +73 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/special_tokens_map.json +3 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/tokenizer.json +3 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/tokenizer_config.json +3 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/vocab.txt +0 -0
bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/config.pbtxt +31 -0

bhadresh-savani_distilbert-base-uncased-emotion_onnx_inference/1/.gitkeep ADDED Viewed

File without changes

bhadresh-savani_distilbert-base-uncased-emotion_onnx_inference/config.pbtxt ADDED Viewed

	@@ -0,0 +1,58 @@

+name: "bhadresh-savani_distilbert-base-uncased-emotion_onnx_inference"
+max_batch_size: 0
+platform: "ensemble"
+input [
+{
+    name: "TEXT"
+    data_type: TYPE_STRING
+    dims: [ -1 ]
+}
+]
+output {
+    name: "output"
+    data_type: TYPE_FP32
+    dims: [-1, 6]
+}
+ensemble_scheduling {
+    step [
+        {
+            model_name: "bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize"
+            model_version: -1
+            input_map {
+            key: "TEXT"
+            value: "TEXT"
+        }
+        output_map [
+{
+    key: "input_ids"
+    value: "input_ids"
+},
+{
+    key: "attention_mask"
+    value: "attention_mask"
+}
+        ]
+        },
+        {
+            model_name: "bhadresh-savani_distilbert-base-uncased-emotion_onnx_model"
+            model_version: -1
+            input_map [
+{
+    key: "input_ids"
+    value: "input_ids"
+},
+{
+    key: "attention_mask"
+    value: "attention_mask"
+}
+            ]
+        output_map {
+                key: "output"
+                value: "output"
+            }
+        }
+    ]
+}

bhadresh-savani_distilbert-base-uncased-emotion_onnx_model/1/model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:88da46062f1337aaa66c58d9529f8f4b5dd02b0410fe02d839f8d69441c9e929
+size 139298114

bhadresh-savani_distilbert-base-uncased-emotion_onnx_model/config.pbtxt ADDED Viewed

	@@ -0,0 +1,30 @@

+name: "bhadresh-savani_distilbert-base-uncased-emotion_onnx_model"
+max_batch_size: 0
+platform: "onnxruntime_onnx"
+default_model_filename: "model.bin"
+input [
+{
+    name: "input_ids"
+    data_type: TYPE_INT32
+    dims: [-1, -1]
+},
+{
+    name: "attention_mask"
+    data_type: TYPE_INT32
+    dims: [-1, -1]
+}
+]
+output {
+    name: "output"
+    data_type: TYPE_FP32
+    dims: [-1, 6]
+}
+instance_group [
+    {
+      count: 1
+      kind: KIND_GPU
+    }
+]

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dbdd79309149fa1f33496a5fd1bc8b377e98e338539bc4868a7b66ee18f4d66a
+size 808

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/model.py ADDED Viewed

	@@ -0,0 +1,73 @@

+#  Copyright 2022, Lefebvre Dalloz Services
+#
+#  Licensed under the Apache License, Version 2.0 (the "License");
+#  you may not use this file except in compliance with the License.
+#  You may obtain a copy of the License at
+#
+#      http://www.apache.org/licenses/LICENSE-2.0
+#
+#  Unless required by applicable law or agreed to in writing, software
+#  distributed under the License is distributed on an "AS IS" BASIS,
+#  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+#  See the License for the specific language governing permissions and
+#  limitations under the License.
+"""
+This module is copy-pasted in generated Triton configuration folder to perform the tokenization step.
+"""
+# noinspection DuplicatedCode
+from pathlib import Path
+from typing import Dict, List
+import numpy as np
+try:
+    # noinspection PyUnresolvedReferences
+    import triton_python_backend_utils as pb_utils
+except ImportError:
+    pass  # triton_python_backend_utils exists only inside Triton Python backend.
+from transformers import AutoTokenizer, BatchEncoding, PreTrainedTokenizer, TensorType
+class TritonPythonModel:
+    tokenizer: PreTrainedTokenizer
+    def initialize(self, args: Dict[str, str]) -> None:
+        """
+        Initialize the tokenization process
+        :param args: arguments from Triton config file
+        """
+        # more variables in https://github.com/triton-inference-server/python_backend/blob/main/src/python.cc
+        path: str = str(Path(args["model_repository"]).parent.absolute())
+        self.tokenizer = AutoTokenizer.from_pretrained(path)
+    def execute(self, requests) -> "List[List[pb_utils.Tensor]]":
+        """
+        Parse and tokenize each request
+        :param requests: 1 or more requests received by Triton server.
+        :return: text as input tensors
+        """
+        responses = []
+        # for loop for batch requests (disabled in our case)
+        for request in requests:
+            # binary data typed back to string
+            query = [t.decode("UTF-8") for t in pb_utils.get_input_tensor_by_name(request, "TEXT").as_numpy().tolist()]
+            tokens: BatchEncoding = self.tokenizer(
+                text=query, return_tensors=TensorType.NUMPY, padding=True, pad_to_multiple_of=8
+            )
+            # tensorrt uses int32 as input type, ort uses int64
+            tokens_dict = {k: v.astype(np.int32) for k, v in tokens.items()}
+            # communicate the tokenization results to Triton server
+            outputs = list()
+            for input_name in self.tokenizer.model_input_names:
+                tensor_input = pb_utils.Tensor(input_name, tokens_dict[input_name])
+                outputs.append(tensor_input)
+            inference_response = pb_utils.InferenceResponse(output_tensors=outputs)
+            responses.append(inference_response)
+        return responses

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b6d346be366a7d1d48332dbc9fdf3bf8960b5d879522b7799ddba59e76237ee3
+size 125

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d241a60d5e8f04cc1b2b3e9ef7a4921b27bf526d9f6050ab90f9267a1f9e5c66
+size 711396

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e5a3982826e0689e0698a39be0ad3c588562cec9b45594ed8c33b32e0cbb85a6
+size 436

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/1/vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize/config.pbtxt ADDED Viewed

	@@ -0,0 +1,31 @@

+name: "bhadresh-savani_distilbert-base-uncased-emotion_onnx_tokenize"
+max_batch_size: 0
+backend: "python"
+input [
+{
+    name: "TEXT"
+    data_type: TYPE_STRING
+    dims: [ -1 ]
+}
+]
+output [
+{
+    name: "input_ids"
+    data_type: TYPE_INT32
+    dims: [-1, -1]
+},
+{
+    name: "attention_mask"
+    data_type: TYPE_INT32
+    dims: [-1, -1]
+}
+]
+instance_group [
+    {
+      count: 1
+      kind: KIND_GPU
+    }
+]