Model failed to predict if the word is lower

#4
by vpkprasanna - opened

model can able to predict Pariisi as LOC but it failed to predict if the same word is in lower case pariisi , how to resolve this issue ?

Hello!
Great question. This model uses the bert-base-multilingual-cased model, meaning that it differentiates Pariisi and pariisi. Because it only sees the former during training, it doesn't work well for pariisi, as you've noticed. However, as the README says:

Is your data not (always) capitalized correctly? Then consider using this uncased variant of this model by @lxyuan for better performance:
lxyuan/span-marker-bert-base-multilingual-uncased-multinerd.

@lxyuan their model is equivalent to this one, with the exception that it is uncased, i.e. it works just as well for pariisi as Pariisi:

image.png

I recommend that one if your data isn't always correctly capitalized :) Hope this helps.

  • Tom Aarsen

got the answer thanks for the replay

tomaarsen changed discussion status to closed

Sign up or log in to comment