Dataset: sacrebleu
How to load this metric directly with the 🤗/nlp library:

from nlp import load_metric metric = load_metric("sacrebleu")


SacreBLEU provides hassle-free computation of shareable, comparable, and reproducible BLEU scores. Inspired by Rico Sennrich's `multi-bleu-detok.perl`, it produces the official WMT scores but works with plain text. It also knows all the standard test sets and handles downloading, processing, and tokenization for you. See the [] file at for more information.


