Poor results with informal English greetings

#10
by ymurenko - opened

I did a test of common English greetings that are used on the internet, and the results are pretty poor:

hi -> ur, 0.71
HI -> ur, 0.70
Hi -> sw, 0.37
Hi! -> sw, 0.50
hey -> sw, 0.33
HEY -> hi, 0.71
Hey -> sw, 0.93
Hey! -> sw, 0.92
hello -> sw, 0.32
HELLO -> hi, 0.95
Hello -> en, 0.71
Hello! -> en, 0.53
yo -> sw, 0.96
YO -> hi, 0.86
Yo -> sw, 0.74
Yo! -> sw, 0.73
sup -> sw, 0.82
SUP -> sw, 0.55
Sup -> sw, 0.43
Sup! -> sw, 0.35

"Hey!" and "HELLO" have a very high confidence for the wrong language, and a lot of these get interpreted as Swahili (based on a quick google search, it doesn't look like any of these are actual words in that language)

the training set constists of complete sentences. this model might expectingly perform poor on a single word, yet a greeting like 'hi' which is used in many languages.

Sign up or log in to comment