--- language: - en tags: - pytorch - ner - text generation - seq2seq inference: false license: mit datasets: - conll2003 metrics: - f1 --- # t5-base-qa-ner-conll Unofficial implementation of [InstructionNER](https://arxiv.org/pdf/2203.03903v1.pdf). t5-base model tuned on conll2003 dataset. https://github.com/ovbystrova/InstructionNER ## Inference ```shell git clone https://github.com/ovbystrova/InstructionNER cd InstructionNER ``` ```python from instruction_ner.model import Model model = Model( model_path_or_name="olgaduchovny/t5-base-ner-conll", tokenizer_path_or_name="olgaduchovny/t5-base-ner-conll" ) options = ["LOC", "PER", "ORG", "MISC"] instruction = "please extract entities and their types from the input sentence, " \ "all entity types are in options" text = "The protest , which attracted several thousand supporters , coincided with the 18th anniversary of Spain 's constitution ." generation_kwargs = { "num_beams": 2, "max_length": 128 } pred_text, pred_spans = model.predict( text=text, generation_kwargs=generation_kwargs, instruction=instruction, options=options ) >>> ('Spain is a Loc.', [(99, 104, 'LOC')]) ``` ## Prediction Sample ``` Sentence: The protest , which attracted several thousand supporters , coincided with the 18th anniversary of Spain 's constitution . Instruction: please extract entities and their types from the input sentence, all entity types are in options Options: ORG, PER, LOC Prediction (raw text): Spain is a LOC. Prediction (span): [(99, 104, 'LOC')] ```