Back to all metrics
Dataset: glue πŸ“‰
Update on GitHub

How to load this metric directly with the πŸ€—/nlp library:

Copy to clipboard
from nlp import load_metric metric = load_metric("glue")


GLUE, the General Language Understanding Evaluation benchmark ( is a collection of resources for training, evaluating, and analyzing natural language understanding systems.


  title={{GLUE}: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding},
  author={Wang, Alex and Singh, Amanpreet and Michael, Julian and Hill, Felix and Levy, Omer and Bowman, Samuel R.},
  note={In the Proceedings of ICLR.},
Note that each GLUE dataset has its own citation. Please see the source to see
the correct citation for each contained dataset.