Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ibm 's Collections
Materials
BioMed
✨ Highlights
Power-LM
Genie: Wishes datasets
🔬 Research
Paraphrase and perturbation question-answering robustness

Paraphrase and perturbation question-answering robustness

updated Aug 19, 2024

Datasets from "A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios" (https://arxiv.org/abs/2408.01963)

Upvote
1

  • ibm-research/BoolQ_robustness

    Viewer • Updated Aug 19, 2024 • 29.4k • 27

  • ibm-research/identity_group_abuse_robustness

    Viewer • Updated Aug 19, 2024 • 21.8k • 22 • 1

  • ibm-research/PopQA_robustness

    Viewer • Updated Aug 19, 2024 • 204k • 119
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs