Spaces:

evaluate-metric
/

bleu

Running

lvwerra HF staff commited on Jun 8, 2022

Commit

c23e03b

•

1 Parent(s): 8103015

Update Space (evaluate main: 05209ece)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -10,6 +10,14 @@ pinned: false
 tags:
 - evaluate
 - metric
 ---
 # Metric Card for BLEU

 tags:
 - evaluate
 - metric
+description: >-
+  BLEU (Bilingual Evaluation Understudy) is an algorithm for evaluating the quality of text which has been machine-translated from one natural language to another.
+  Quality is considered to be the correspondence between a machine's output and that of a human: "the closer a machine translation is to a professional human translation, the better it is"
+  – this is the central idea behind BLEU. BLEU was one of the first metrics to claim a high correlation with human judgements of quality, and remains one of the most popular automated and inexpensive metrics.
+  Scores are calculated for individual translated segments—generally sentences—by comparing them with a set of good quality reference translations.
+  Those scores are then averaged over the whole corpus to reach an estimate of the translation's overall quality.
+  Neither intelligibility nor grammatical correctness are not taken into account.
 ---
 # Metric Card for BLEU