Helsinki-NLP/opus-100
Viewer • Updated • 55.1M • 51.3k • 239
How to use Aleton/be-en-translator with Transformers:
# Use a pipeline as a high-level helper
# Warning: Pipeline type "translation" is no longer supported in transformers v5.
# You must load the model directly (see below) or downgrade to v4.x with:
# 'pip install "transformers<5.0.0'
from transformers import pipeline
pipe = pipeline("translation", model="Aleton/be-en-translator") # Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("Aleton/be-en-translator")
model = AutoModelForSeq2SeqLM.from_pretrained("Aleton/be-en-translator")This model is a fine-tuned version of Helsinki-NLP/opus-mt-mul-en for translating text from Belarusian (be) to English (en).
The model was fine-tuned using the transformers library on the Belarusian-English split of the OPUS-100 dataset. It is based on the MarianMT architecture and is optimized for quick and accurate translation of short to medium-length sentences.
You can use this model directly with the transformers library:
import torch
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
# Load model and tokenizer
model_name = "Aleton/be-en-translator"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
# Set device
device = "cuda" if torch.cuda.is_available() else "cpu"
model = model.to(device)
# Text to translate
text = "Прывітанне, як справы?"
# Generate translation
inputs = tokenizer(text, return_tensors="pt").to(device)
outputs = model.generate(**inputs, max_length=128)
translated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(translated_text)
# Expected output: Hello, how are you?
Base model
Helsinki-NLP/opus-mt-mul-en