inoki-giskard/scan-report-temp · Report for HooshvareLab/bert-fa-base-uncased-sentiment-deepsentipers-binary

Hi Team,

This is a report from Giskard Bot Scan 🐢.

We have identified 6 potential vulnerabilities in your model based on an automated scan.

This automated analysis evaluated the model on the dataset sst2 (subset default, split validation).

👉Performance issues (6)

Vulnerability	Level	Data slice	Metric	Transformation	Deviation
Performance	major 🔴	`avg_word_length(text)` >= 4.109 AND `avg_word_length(text)` < 4.212	Precision = 0.361	—	-29.10% than global

🔍✨Examples

For records in the dataset where `avg_word_length(text)` >= 4.109 AND `avg_word_length(text)` < 4.212, the Precision is 29.1% lower than the global Precision.

	text	avg_word_length(text)	label	Predicted `label`
19	in its best moments , resembles a bad high school production of grease , without benefit of song .	4.21053	negative	positive (p = 1.00)
28	it 's a cookie-cutter movie , a cut-and-paste job .	4.2	negative	positive (p = 1.00)
44	the title not only describes its main characters , but the lazy people behind the camera as well .	4.21053	negative	positive (p = 1.00)

Vulnerability	Level	Data slice	Metric	Transformation	Deviation
Performance	major 🔴	`avg_whitespace(text)` < 0.196 AND `avg_whitespace(text)` >= 0.192	Precision = 0.361	—	-29.10% than global

🔍✨Examples

For records in the dataset where `avg_whitespace(text)` < 0.196 AND `avg_whitespace(text)` >= 0.192, the Precision is 29.1% lower than the global Precision.

	text	avg_whitespace(text)	label	Predicted `label`
19	in its best moments , resembles a bad high school production of grease , without benefit of song .	0.191919	negative	positive (p = 1.00)
28	it 's a cookie-cutter movie , a cut-and-paste job .	0.192308	negative	positive (p = 1.00)
44	the title not only describes its main characters , but the lazy people behind the camera as well .	0.191919	negative	positive (p = 1.00)

Vulnerability	Level	Data slice	Metric	Transformation	Deviation
Performance	major 🔴	`text_length(text)` >= 151.500 AND `text_length(text)` < 165.500	Precision = 0.407	—	-20.03% than global

🔍✨Examples

For records in the dataset where `text_length(text)` >= 151.500 AND `text_length(text)` < 165.500, the Precision is 20.03% lower than the global Precision.

	text	text_length(text)	label	Predicted `label`
9	in exactly 89 minutes , most of which passed as slowly as if i 'd been sitting naked on an igloo , formula 51 sank from quirky to jerky to utter turkey .	154	negative	positive (p = 1.00)
11	it takes a strange kind of laziness to waste the talents of robert forster , anne meara , eugene levy , and reginald veljohnson all in the same movie .	152	negative	positive (p = 1.00)
26	the action switches between past and present , but the material link is too tenuous to anchor the emotional connections that purport to span a 125-year divide .	161	negative	positive (p = 1.00)

Vulnerability	Level	Data slice	Metric	Transformation	Deviation
Performance	major 🔴	`text` contains "movie"	Precision = 0.421	—	-17.22% than global

🔍✨Examples

For records in the dataset where `text` contains "movie", the Precision is 17.22% lower than the global Precision.

	text	label	Predicted `label`
11	it takes a strange kind of laziness to waste the talents of robert forster , anne meara , eugene levy , and reginald veljohnson all in the same movie .	negative	positive (p = 1.00)
14	even horror fans will most likely not find what they 're seeking with trouble every day ; the movie lacks both thrills and humor .	negative	positive (p = 1.00)
18	... the movie is just a plain old monster .	negative	positive (p = 1.00)

Vulnerability	Level	Data slice	Metric	Transformation	Deviation
Performance	major 🔴	`avg_word_length(text)` >= 4.317 AND `avg_word_length(text)` < 4.613	Precision = 0.445	—	-12.46% than global

🔍✨Examples

For records in the dataset where `avg_word_length(text)` >= 4.317 AND `avg_word_length(text)` < 4.613, the Precision is 12.46% lower than the global Precision.

	text	avg_word_length(text)	label	Predicted `label`
6	a sometimes tedious film .	4.4	negative	positive (p = 1.00)
20	pumpkin takes an admirable look at the hypocrisy of political correctness , but it does so with such an uneven tone that you never know when humor ends and tragedy begins .	4.40625	negative	positive (p = 1.00)
22	holden caulfield did it better .	4.5	negative	positive (p = 1.00)

Vulnerability	Level	Data slice	Metric	Transformation	Deviation
Performance	major 🔴	`avg_whitespace(text)` < 0.188 AND `avg_whitespace(text)` >= 0.178	Precision = 0.445	—	-12.46% than global

🔍✨Examples

For records in the dataset where `avg_whitespace(text)` < 0.188 AND `avg_whitespace(text)` >= 0.178, the Precision is 12.46% lower than the global Precision.

	text	avg_whitespace(text)	label	Predicted `label`
6	a sometimes tedious film .	0.185185	negative	positive (p = 1.00)
20	pumpkin takes an admirable look at the hypocrisy of political correctness , but it does so with such an uneven tone that you never know when humor ends and tragedy begins .	0.184971	negative	positive (p = 1.00)
22	holden caulfield did it better .	0.181818	negative	positive (p = 1.00)

Checkout out the Giskard Space and test your model.

Disclaimer: it's important to note that automated scans may produce false positives or miss certain vulnerabilities. We encourage you to review the findings and assess the impact accordingly.