Report for mrm8488/bert-tiny-finetuned-sms-spam-detection
Hey Team!🤗✨
We’re thrilled to share some amazing evaluation results that’ll make your day!🎉📊
We have identified 9 potential vulnerabilities in your model based on an automated scan.
This automated analysis evaluated the model on the dataset sms_spam (subset plain_text
, split train
).
👉Spurious Correlation issues (2)
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Spurious Correlation | minor 🟡 | avg_digits(text) < 0.032 |
Nominal association (Theil's U) = 0.684 | — | Prediction label = LABEL_0 for 99.09% of samples in the slice |
🔍✨Examples
Data slice `avg_digits(text)` < 0.032 seems to be highly associated to prediction label = `LABEL_0` (99.09% of predictions in the data slice).text | avg_digits(text) | label | Predicted label |
|
---|---|---|---|---|
0 | Go until jurong point, crazy.. Available only in bugis n great world la e buffet... Cine there got amore wat... | 0 | LABEL_0 | LABEL_0 (p = 0.94) |
1 | Ok lar... Joking wif u oni... | 0 | LABEL_0 | LABEL_0 (p = 0.94) |
3 | U dun say so early hor... U c already then say... | 0 | LABEL_0 | LABEL_0 (p = 0.94) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Spurious Correlation | minor 🟡 | avg_digits(text) >= 0.084 |
Nominal association (Theil's U) = 0.630 | — | Prediction label = LABEL_1 for 96.11% of samples in the slice |
🔍✨Examples
Data slice `avg_digits(text)` >= 0.084 seems to be highly associated to prediction label = `LABEL_1` (96.11% of predictions in the data slice).text | avg_digits(text) | label | Predicted label |
|
---|---|---|---|---|
2 | Free entry in 2 a wkly comp to win FA Cup final tkts 21st May 2005. Text FA to 87121 to receive entry question(std txt rate)T&C's apply 08452810075over18's | 0.160256 | LABEL_1 | LABEL_1 (p = 0.91) |
8 | WINNER!! As a valued network customer you have been selected to receivea £900 prize reward! To claim call 09061701461. Claim code KL341. Valid 12 hours only. | 0.120253 | LABEL_1 | LABEL_1 (p = 0.91) |
9 | Had your mobile 11 months or more? U R entitled to Update to the latest colour mobiles with camera for Free! Call The Mobile Update Co FREE on 08002986030 | 0.083871 | LABEL_1 | LABEL_1 (p = 0.90) |
👉Performance issues (7)
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | major 🔴 | avg_digits(text) < 0.005 |
Recall = 0.154 | — | -83.05% than global |
🔍✨Examples
For records in the dataset where `avg_digits(text)` < 0.005, the Recall is 83.05% lower than the global Recall.text | avg_digits(text) | label | Predicted label |
|
---|---|---|---|---|
54 | SMS. ac Sptv: The New Jersey Devils and the Detroit Red Wings play Ice Hockey. Correct or Incorrect? End? Reply END SPTV | 0 | LABEL_1 | LABEL_0 (p = 0.62) |
68 | Did you hear about the new "Divorce Barbie"? It comes with all of Ken's stuff! | 0 | LABEL_1 | LABEL_0 (p = 0.94) |
270 | Ringtone Club: Get the UK singles chart on your mobile each week and choose any top quality ringtone! This message is free of charge. | 0 | LABEL_1 | LABEL_0 (p = 0.57) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | major 🔴 | avg_whitespace(text) >= 0.225 |
Balanced Accuracy = 0.749 | — | -21.32% than global |
🔍✨Examples
For records in the dataset where `avg_whitespace(text)` >= 0.225, the Balanced Accuracy is 21.32% lower than the global Balanced Accuracy.text | avg_whitespace(text) | label | Predicted label |
|
---|---|---|---|---|
323 | cud u tell ppl im gona b a bit l8 cos 2 buses hav gon past cos they were full & im still waitin 4 1. Pete x | 0.259259 | LABEL_0 | LABEL_1 (p = 0.66) |
4514 | Money i have won wining number 946 wot do i do next | 0.230769 | LABEL_1 | LABEL_0 (p = 0.92) |
4873 | Hi dis is yijue i would be happy to work wif ü all for gek1510... | 0.227273 | LABEL_0 | LABEL_1 (p = 0.71) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | major 🔴 | text contains "ok" |
Balanced Accuracy = 0.800 | — | -15.95% than global |
🔍✨Examples
For records in the dataset where `text` contains "ok", the Balanced Accuracy is 15.95% lower than the global Balanced Accuracy.text | label | Predicted label |
|
---|---|---|---|
5 | FreeMsg Hey there darling it's been 3 week's now and no word back! I'd like some fun you up for it still? Tb ok! XxX std chgs to send, £1.50 to rcv | LABEL_1 | LABEL_0 (p = 0.78) |
4249 | accordingly. I repeat, just text the word ok on your mobile phone and send | LABEL_1 | LABEL_0 (p = 0.93) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | major 🔴 | avg_word_length(text) < 3.891 AND avg_word_length(text) >= 3.306 |
Recall = 0.784 | — | -13.59% than global |
🔍✨Examples
For records in the dataset where `avg_word_length(text)` < 3.891 AND `avg_word_length(text)` >= 3.306, the Recall is 13.59% lower than the global Recall.text | avg_word_length(text) | label | Predicted label |
|
---|---|---|---|---|
5 | FreeMsg Hey there darling it's been 3 week's now and no word back! I'd like some fun you up for it still? Tb ok! XxX std chgs to send, £1.50 to rcv | 3.625 | LABEL_1 | LABEL_0 (p = 0.78) |
227 | Will u meet ur dream partner soon? Is ur career off 2 a flyng start? 2 find out free, txt HORO followed by ur star sign, e. g. HORO ARIES | 3.6 | LABEL_1 | LABEL_0 (p = 0.91) |
263 | MY NO. IN LUTON 0125698789 RING ME IF UR AROUND! H* | 3.72727 | LABEL_0 | LABEL_1 (p = 0.87) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | major 🔴 | text_length(text) < 51.500 AND text_length(text) >= 40.500 |
Balanced Accuracy = 0.845 | — | -11.19% than global |
🔍✨Examples
For records in the dataset where `text_length(text)` < 51.500 AND `text_length(text)` >= 40.500, the Balanced Accuracy is 11.19% lower than the global Balanced Accuracy.text | text_length(text) | label | Predicted label |
|
---|---|---|---|---|
955 | Filthy stories and GIRLS waiting for your | 42 | LABEL_1 | LABEL_0 (p = 0.94) |
3094 | staff.science.nus.edu.sg/~phyhcmk/teaching/pc1323 | 50 | LABEL_0 | LABEL_1 (p = 0.88) |
3302 | RCT' THNQ Adrian for U text. Rgds Vatian | 41 | LABEL_1 | LABEL_0 (p = 0.92) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | medium 🟡 | avg_whitespace(text) >= 0.212 AND avg_whitespace(text) < 0.223 |
Balanced Accuracy = 0.860 | — | -9.62% than global |
🔍✨Examples
For records in the dataset where `avg_whitespace(text)` >= 0.212 AND `avg_whitespace(text)` < 0.223, the Balanced Accuracy is 9.62% lower than the global Balanced Accuracy.text | avg_whitespace(text) | label | Predicted label |
|
---|---|---|---|---|
5 | FreeMsg Hey there darling it's been 3 week's now and no word back! I'd like some fun you up for it still? Tb ok! XxX std chgs to send, £1.50 to rcv | 0.216216 | LABEL_1 | LABEL_0 (p = 0.78) |
227 | Will u meet ur dream partner soon? Is ur career off 2 a flyng start? 2 find out free, txt HORO followed by ur star sign, e. g. HORO ARIES | 0.217391 | LABEL_1 | LABEL_0 (p = 0.91) |
2402 | Babe: U want me dont u baby! Im nasty and have a thing 4 filthyguys. Fancy a rude time with a sexy bitch. How about we go slo n hard! Txt XXX SLO(4msgs) | 0.215686 | LABEL_1 | LABEL_0 (p = 0.81) |
Vulnerability | Level | Data slice | Metric | Transformation | Deviation |
---|---|---|---|---|---|
Performance | medium 🟡 | avg_word_length(text) < 4.258 AND avg_word_length(text) >= 4.102 |
Recall = 0.857 | — | -5.56% than global |
🔍✨Examples
For records in the dataset where `avg_word_length(text)` < 4.258 AND `avg_word_length(text)` >= 4.102, the Recall is 5.56% lower than the global Recall.text | avg_word_length(text) | label | Predicted label |
|
---|---|---|---|---|
2003 | TheMob>Yo yo yo-Here comes a new selection of hot downloads for our members to get for FREE! Just click & open the next link sent to ur fone... | 4.14286 | LABEL_1 | LABEL_0 (p = 0.92) |
3302 | RCT' THNQ Adrian for U text. Rgds Vatian | 4.125 | LABEL_1 | LABEL_0 (p = 0.92) |
4676 | Hi babe its Chloe, how r u? I was smashed on saturday night, it was great! How was your weekend? U been missing me? SP visionsms.com Text stop to stop 150p/text | 4.19355 | LABEL_1 | LABEL_0 (p = 0.84) |
Disclaimer: it's important to note that automated scans may produce false positives or miss certain vulnerabilities. We encourage you to review the findings and assess the impact accordingly.
💡 What's Next?
- Checkout the Giskard Space and improve your model.
- The Giskard community is always buzzing with ideas. 🐢🤔 What do you want to see next? Your feedback is our favorite fuel, so drop your thoughts in the community forum! 🗣️💬 Together, we're building something extraordinary.
🙌 Big Thanks!
We're grateful to have you on this adventure with us. 🚀🌟 Here's to more breakthroughs, laughter, and code magic! 🥂✨ Keep hugging that code and spreading the love! 💻 #Giskard #Huggingface #AISafety 🌈👏 Your enthusiasm, feedback, and contributions are what seek. 🌟 Keep being awesome!