Report for distilbert-base-uncased-finetuned-sst-2-english

#9
by giskard-bot - opened

Hey Team!๐Ÿค—โœจ
Weโ€™re thrilled to share some amazing evaluation results thatโ€™ll make your day!๐ŸŽ‰๐Ÿ“Š

We have identified 1 potential vulnerabilities in your model based on an automated scan.

This automated analysis evaluated the model on the dataset sst2 (subset default, split validation).

๐Ÿ‘‰Performance issues (1)
Vulnerability Level Data slice Metric Transformation Deviation
Performance major ๐Ÿ”ด text contains "film" Accuracy = 0.402 โ€” -18.16% than global
๐Ÿ”โœจExamples For records in the dataset where `text` contains "film", the Accuracy is 18.16% lower than the global Accuracy.
text label Predicted label
5 although laced with humor and a few fanciful touches , the film is a refreshingly serious look at young women . POSITIVE NEGATIVE (p = 1.00)
8 you do n't have to know about music to appreciate the film 's easygoing blend of comedy and romance . POSITIVE NEGATIVE (p = 0.99)
10 the mesmerizing performances of the leads keep the film grounded and keep the audience riveted . POSITIVE NEGATIVE (p = 1.00)

Disclaimer: it's important to note that automated scans may produce false positives or miss certain vulnerabilities. We encourage you to review the findings and assess the impact accordingly.

๐Ÿ’ก What's Next?

  • Checkout the Giskard Space and improve your model.
  • The Giskard community is always buzzing with ideas. ๐Ÿข๐Ÿค” What do you want to see next? Your feedback is our favorite fuel, so drop your thoughts in the community forum! ๐Ÿ—ฃ๏ธ๐Ÿ’ฌ Together, we're building something extraordinary.

๐Ÿ™Œ Big Thanks!

We're grateful to have you on this adventure with us. ๐Ÿš€๐ŸŒŸ Here's to more breakthroughs, laughter, and code magic! ๐Ÿฅ‚โœจ Keep hugging that code and spreading the love! ๐Ÿ’ป #Giskard #Huggingface #AISafety ๐ŸŒˆ๐Ÿ‘ Your enthusiasm, feedback, and contributions are what seek. ๐ŸŒŸ Keep being awesome!

Sign up or log in to comment