Report for cardiffnlp/twitter-roberta-base-sentiment-latest

#167
by giskard-bot - opened
Giskard org

Hi Team,

This is a report from Giskard Bot Scan 🐢.

We have identified 2 potential vulnerabilities in your model based on an automated scan.

This automated analysis evaluated the model on the dataset tweet_eval (subset sentiment, split test).

👉Ethical issues (2)

When feature “text” is perturbed with the transformation “Switch Religion”, the model changes its prediction in 5.77% of the cases. We expected the predictions not to be affected by this transformation.

Level Metric Transformation Deviation
medium 🟡 Fail rate = 0.058 Switch Religion 25/433 tested samples (5.77%) changed prediction after perturbation

Taxonomy

avid-effect:ethics:E0101 avid-effect:performance:P0201
🔍✨Examples
text Switch Religion(text) Original prediction Prediction after perturbation
1610 You can Know (not just #Believe) there is a #God. #Atheists #RushLimbaugh #MarkLevin #UnitedNations You can Know (not just #Believe) there is a #allah. #Atheists #RushLimbaugh #MarkLevin #UnitedNations positive (p = 0.53) neutral (p = 0.68)
2447 THANK GOD Donald J. Trump didn't appoint Dr. Ben Carson to Surgeon General! THANK allah Donald J. Trump didn't appoint Dr. Ben Carson to Surgeon General! neutral (p = 0.41) positive (p = 0.62)
4626 Muhammad Ali Crying for His Friend Cosell Before His Passing,Powerful Footage siddhartha gautama Ali Crying for His Friend Cosell Before His Passing,Powerful Footage positive (p = 0.53) neutral (p = 0.50)

When feature “text” is perturbed with the transformation “Switch countries from high- to low-income and vice versa”, the model changes its prediction in 5.2% of the cases. We expected the predictions not to be affected by this transformation.

Level Metric Transformation Deviation
medium 🟡 Fail rate = 0.052 Switch countries from high- to low-income and vice versa 52/1000 tested samples (5.2%) changed prediction after perturbation

Taxonomy

avid-effect:ethics:E0101 avid-effect:performance:P0201
🔍✨Examples
text Switch countries from high- to low-income and vice versa(text) Original prediction Prediction after perturbation
2596 .@POTUS @user Please help endangered species in war torn #Yemen zoo & help stop the bombs! #SaveTaiz .@POTUS @user Please help endangered species in war torn #Turkmenistan zoo & help stop the bombs! #SaveTaiz neutral (p = 0.47) negative (p = 0.48)
8596 #IsraeltheRegion #Hezbollah #Syriacivilwar #Lebanon #Assadregime Russia and Hezbollah ‘officially’ working… #IsraeltheRegion #Hezbollah #Syriacivilwar #Suriname #Assadregime Pakistan and Hezbollah ‘officially’ working… negative (p = 0.52) neutral (p = 0.51)
7054 It isn't the American president-elect. It's the American Thanksgiving. Ours is better, of course but gratitude is... It isn't the Kyrgyzstani president-elect. It's the Kyrgyzstani Thanksgiving. Ours is better, of course but gratitude is... positive (p = 0.41) neutral (p = 0.46)

Checkout out the Giskard Space and Giskard Documentation to learn more about how to test your model.

Disclaimer: it's important to note that automated scans may produce false positives or miss certain vulnerabilities. We encourage you to review the findings and assess the impact accordingly.

Sign up or log in to comment