Spaces:

manueldeprada
/

beer

Runtime error

Manuel de Prada commited on Apr 29, 2023

Commit

21d7631

•

1 Parent(s): 80dcff0

beer metric

Files changed (1) hide show

README.md CHANGED Viewed

@@ -11,11 +11,7 @@ tags:
 - evaluate
 - metric
 description: >-
-   BEER 2.0 (BEtter Evaluation as Ranking) is a trained machine translation evaluation metric with high correlation with human judgment both on sentence and corpus level. It is a linear model-based metric for sentence-level evaluation in machine translation (MT) that combines 33 relatively dense features, including character n-grams and reordering features.
-It employs a learning-to-rank framework to differentiate between function and non-function words and weighs each word type according to its importance for evaluation.
-The model is trained on ranking similar translations using a vector of feature values for each system output.
-BEER outperforms the strong baseline metric METEOR in five out of eight language pairs, showing that less sparse features at the sentence level can lead to state-of-the-art results.
-Features on character n-grams are crucial, and higher-order character n-grams are less prone to sparse counts than word n-grams.
 ---
 # Metric Card for BEER

 - evaluate
 - metric
 description: >-
+   BEER 2.0 (BEtter Evaluation as Ranking) is a trained machine translation evaluation metric with high correlation with human judgment both on sentence and corpus level. It is a linear model-based metric for sentence-level evaluation in machine translation (MT) that combines 33 relatively dense features, including character n-grams and reordering features. It employs a learning-to-rank framework to differentiate between function and non-function words and weighs each word type according to its importance for evaluation. The model is trained on ranking similar translations using a vector of feature values for each system output. BEER outperforms the strong baseline metric METEOR in five out of eight language pairs, showing that less sparse features at the sentence level can lead to state-of-the-art results. Features on character n-grams are crucial, and higher-order character n-grams are less prone to sparse counts than word n-grams.
 ---
 # Metric Card for BEER