Update checkpoint for transformers>=4.29

by ArthurZ HF staff - opened Oct 10, 2023

base: refs/heads/main

←

from: refs/pr/7

Discussion Files changed

-5

ArthurZ

Language Technology Research Group at the University of Helsinki org Oct 10, 2023

Following the merge of a PR in transformers it appeared that this model was not properly converted. This PR will fix the inference and was tested using the following script:

>>> from transformers import MarianModel, MarianMTModel
>>> tokenizer = AutoTokenizer.from_pretrained('Helsinki-NLP/opus-mt-tc-big-ar-en')
>>> inputs = tokenizer("يا!دعونا نتعلم معا", return_tensors="pt", padding=True)
>>> model = MarianMTModel.from_pretrained('Helsinki-NLP/opus-mt-tc-big-ar-en')
>>> print(tokenizer.batch_decode(model.generate(**inputs)))
["<pad> Hey! Let's learn together</s>"]

Update checkpoint for transformers>=4.29b59ec62d

Noaman

Nov 28, 2023

•

edited Nov 28, 2023

HI, i have tried test script but still output is random words. for above arabic text im getting
[' Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist Salafist ']

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment