globis-university/deberta-v3-japanese-base

When I was training the NER task using this series of models, I encountered some issues.
After the normal training process, I obtained a model with relatively ideal evaluation metrics such as F1 score.
However, in actual use, I found that the models based on the base and small versions identified some discontinuous characters with very low scores (<0.5), while the large version did not have this problem.
In conjunction with your instructions, could this be caused by the difference between unigram and BPE? How should I deal with this issue?

globis-university
/

deberta-v3-japanese-base

issue about ner task