Code corresponding to the onnx upload

#12
by ProfLinh - opened

I noticed that @izhx uploaded the onnx weights recently and am wondering if he can also share the code that generated it. I'm seeing a discrepancy my runs and am wondering if I'm processing the ONNX files differently than intended. For example, I'm unable to reproduce the embedding values that I'm getting when using AutoModel or SentenceTransformers. For reference, here is the onnx code I'm using:

sess = onnxruntime.InferenceSession(onnx_model_path)
tokenizer = AutoTokenizer.from_pretrained(hf_model_path)
input_text = "Convert this sentence to ONNX."
input_names = ["input_ids", "attention_mask", "token_type_ids"]
inputs_1 = tokenizer(input_text, return_tensors="pt")
inputs = {
    "input_ids": inputs_1["input_ids"].numpy(), 
    "attention_mask": inputs_1["attention_mask"].numpy(), 
    "token_type_ids": inputs_1["token_type_ids"].numpy(),
}
output = sess.run(None, inputs)

Thanks all!

Alibaba-NLP org

Thanks! So I guess @Xenova has the code then?

Alibaba-NLP org

Yes

Sign up or log in to comment