ONNX-converted version of the model

by asofter - opened Feb 2, 2024

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+155767

-0

ONNX-converted version of the model21ab9673

asofter

Feb 2, 2024

We decided to swap the existing model for the Code Scanner in llm-guard with your model. Our tests show much better accuracy compared to the HuggingFace's one.

To have faster inference, we use ONNX models converted using Optimum from HuggingFace.

Example of the repo with ONNX built-in: https://huggingface.co/laiyer/deberta-v3-base-prompt-injection

asofter

Feb 2, 2024

pip install transformers optimum[onnxruntime] optimum

model_path = "philomath-1209/programming-language-identification"

from transformers import pipeline, AutoTokenizer
from optimum.onnxruntime import ORTModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained(model_path)
model = ORTModelForSequenceClassification.from_pretrained(model_path, export=True)

from pathlib import Path
onnx_path = Path("onnx")

model.save_pretrained(onnx_path)
tokenizer.save_pretrained(onnx_path)

philomath-1209 changed pull request status to merged Feb 2, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment