Model Types

#1
by luedman - opened

What kind of model types/architecture did you use for the NER and RE tasks repectivly? BERT, ... ?

Yes, the models are BERT based. The NER model was trained on conll2003, you can find the model source code here, the config is here and the code for input encoding / decoding is here. It is a span classification based approach, i.e. all token n-grams up to a certain size (8) are classified. The RE model was trained on tacred (model config, model source code, input encoding / decoding). For that model, special marker tokens are added around the head and tail entities and the text is then classified.

ArneBinder changed discussion status to closed

Sign up or log in to comment